0% found this document useful (0 votes)

20 views91 pages

Introduction to Data Science: (Khoa học dữ liệu)

computer network

Uploaded by

ngocmaicute0509

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views91 pages

Introduction to Data Science: (Khoa học dữ liệu)

computer network

Uploaded by

ngocmaicute0509

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 91

Introduction to Data Science

(Khoa học dữ liệu)

Image representation

Nguyen Thi Oanh

Hanoi University of Science and Technology
oanhnt@soict.hust.edu.vn

SOICT, HUST, 2024

1
About me
• Dr. Nguyen Thi Oanh
• Department of Computer science, SoICT, HUST
• Email:
‒ oanhnt@soict.hust.edu.vn
‒ oanh.nguyenthi@hust.edu.vn
• Office:
‒ 706 - B1 (working office) / 1001-B1
• Teaching:
‒ Computer vision, image processing
‒ Databases, database labs
‒ Intro to DS, Intro to ICT
• Research:
‒ Semantic segmentation (on medical images)
‒ Domain adaptation for semantic segmentation
‒ Action recognition (with multi-view, multi-modality
‒ Image representation and retrieval

2
Plan

• Introduction
• Digital images and basic operations
‒ histogram, brightness, contrast, color, texture, …

• Convolution and Filters

‒ noisy remove,
‒ edge detectors

• Feature extraction: local and global descriptor

3
Computer Vision ?

• Image Processing
‒ Work with image as a matrix
‒ Input: image ➔ output: image
‒ Help human to examine / modify images
• Computer Vision
‒ Make computers understand images and video
‒ Images and video are a source of information on the
reality
What kind of scene?
Where are the cars?
How far is the building?
…
4
Computer Vision and Applications

• Images, video are everywhere

• Video, images:
‒ Riche information

➔Hot topic, especially

When we talk every day
about AI with smart city,
mart home, smart …

5
How vision is used now

• Understand the image Facebook's suggestion

Smile detection: smart camera

• Camera can automatically trip the shutter at the right instant to catch the
perfect expression

Source: Derek Hoiem, Computer vision, CS 543 / ECE 549, University of Illinois
6
How vision is used now

• Login without a password, but with biometrics (fingerprint,

iris, face,…

Fingerprint scanners on many new laptops, Face recognition systems now

beginning to appear more widely
other devices
http://www.sensiblevision.com/

Source: Derek Hoiem, Computer vision, CS 543 / ECE 549, University of Illinois

7
How vision is used now

• Object recognition (on mobile phones)

Point & Find, Nokia

Google Goggles

Source: Derek Hoiem, Computer vision, CS 543 / ECE 549, University of Illinois
8
How vision is used now
• Content-based image retrieval

9
How vision is used now

• Earth View, Google earth (3D modeling from lots of 2D

images): automatic building generation + hand modeled
buildings (Golden Gate bridge or Sydney Opera house)

Microsoft's Virtual Earth

10
How vision is used now

• Panorama stitching:

Source:http://miseaupoint.org/blog/en/wp-content/uploads/2014/01/photo_stitching.jpg

11
How vision is used now

• Smart cars → autonomous vehicles

Mobileye: vision systems currently in many cars

“In mid 2010 Mobileye will launch a world's first application of full emergency
braking for collision mitigation for pedestrians where
vision is the key technology for detecting pedestrians
Source: Derek Hoiem, Computer vision, CS 543 / ECE 549, University of Illinois

12
How vision is used now

• Games / robots: 2_on_1_melee2

http://www.robocup.org/

Vision-based interaction game

(Microsoft's Kinect)

Robot vacuum cleaner

13
What we will talk about?

• 2 types information we would like to extract from images :

‒ Matrix 3D information
‒ Semantic Information
3D building?

Where is it?
Text in the picture,
How to represent what does it mean?
the image content ? Are there any person
in the picture?

Feature extraction Learning

Pre-processing Model
Deep Learning

17
Digital images ?

• What can we see on the picture?

‒ A car?
• What does the machine see?
‒ Image is a matrix of pixels
‒ Image N x M : N x M matrix
‒ 1 pixel (gray levels):
• An intensity value: 0-255
• Black: 0
• White: 255

18
Digital images ?

• For an image I x

‒ Index (0,0): Top left corner

‒ I(x,y): intensity of pixel at
the position (x,y) y

19
Digital images ?

• Principal type of images

‒ Binary image:
- I(x,y)  {0 , 1}
- 1 pixel: 1 bit
‒ Gray image:
- I(x,y)  [0..255]
- 1 pixel: 8 bits (1 byte)
‒ Color image
- IR(x,y), IG(x,y) IB(x,y)  [0..255]
- 1 pixel: 24 bits (3 bytes )
‒ Other : multi-spectre, depth
image,…

20
Color image in RGB space

It exists other color spaces:

Lab, HSV, …

21
Image histogram

 Histogram is a graphical representation of the repartition of

colours among the pixels of a numeric image.

22
Image histogram

 Histogram
 Should be normalized by dividing all elements to total number
of pixels in the image

p 𝑖 𝜖 [0,1]

255
෍𝑝 𝑖 = 1
𝑖=0

23
Image histogram

• Histogram
‒ Only statistic information
‒ No indication about the location of pixel (no spatial
information) ➔ Different images can have the same
histogram

24
Image Brightness
• Brightness of a grayscale image is the average intensity
of all pixels in an image
‒ refers to the overall lightness or darkness of the image

25
Contrast
• The contrast of a grayscale image indicate how easily
object in the image can be distinguished
• Many different equations for contrast exist
‒ Standard deviation of intensity values of pixels in the
image

‒ Difference between intensity value maximum et minimum

26
Brightness/Contrast vs histogram

Narrow histogram

Broad histogram

27
Contrast Enhancement
• Modify pixel intensities to obtain higher contrast
• There are several methods:
‒ Linear stretching of intensity range:
• Linear transform
• Linear transform with saturation
• Piecewise linear transform

‒ Non-linear transform (Gama correction)

‒ Histogram equalization (Cân bằng histogram)

28
Linear stretching 𝑠 = 𝑠𝑚𝑖𝑛 + 𝑟 − 𝑟𝑚𝑖𝑛
𝑠𝑚𝑎𝑥 − 𝑠𝑚𝑖𝑛
𝑟𝑚𝑎𝑥 − 𝑟𝑚𝑖𝑛

𝑠 = 𝑎. 𝑟 + 𝑠𝑚𝑖𝑛 − 𝑎. 𝑟𝑚𝑖𝑛 ,
𝑠 −𝑠
where a = 𝑟𝑚𝑎𝑥− 𝑟𝑚𝑖𝑛
𝑚𝑎𝑥 𝑚𝑖𝑛

If 𝑠𝑚𝑖𝑛 = 0; 𝑠𝑚𝑎𝑥 = 255

255
𝑠 = 𝑟 − 𝑟𝑚𝑖𝑛
𝑟𝑚𝑎𝑥 − 𝑟𝑚𝑖𝑛

29
Linear stretching

Intensity range = [0,255]

No efficace?

30
Histogram equalization
• Change histogram of modified image into uniform
distribution

• No parameters. OpenCV:cv2.equalizeHist(img)

31
Histogram equalization

32
Gama correction
Non linear transformation
• The general form of power-law transformation is:
𝑠 = 𝑐. 𝑟 
‒  > 1: compress values in dark area, while expanding values in light
area
‒  < 1 : expand values in dark area, while compressing values in light
area

s : new value
r : normalized old values
to [0, 1]
(r = old intensity/(L-1))
c : scaling constant
corresponding to the bit size
used
(c = L-1 = 255)

33
Gama correction

For grayscale image

of 8 bits:

𝑟 𝛾
𝑠 = 255.
255

34
Color Image histogram

• Intensity histogram:
‒ Convert color image to
grayscale
=> Compute histogram of gray
scale image
• Individual Color Channel
Histograms:
3 histograms for (R,G,B)
• 3D histogram:
a color identified by 3 values. Not
usually because of big elements

Source: https://web.cs.wpi.edu/~emmanuel

35
RGB (Red – Green - Blue)

• Used in storage and display

• R = G = B: gray level
• Any color
= r*R + g*G + b*B
‒ Strongly correlated channels
‒ Non-perceptual
‒ No separation between
intensity and color

36
Human Vision

• Two types of light-sensitive photoreceptors (on

retina) Cones
cone-shaped
less sensitive
operate in high light
color vision
Rods
rod-shaped
highly sensitive
operate at night
gray-scale vision

37
Human Vision ➔ Camera
• Three kinds of cones
- each cone is able to detect a range of colors
- labeled by the color at which they are most sensitive
.

440 530 560 nm.

RELATIVE ABSORBANCE (%)

100
S M L

400 450 500 550 600 650

WAVELENGTH (nm.)

38
HSV (Hue – Saturation- Value)

• The Hue-Saturation-Value (HSV) color space is useful

for segmentation and recognition
‒ Non-linear conversion
‒ Visual representation of colors

• We identify for a pixel:

‒ The pixel intensity (value)
‒ The pixel color (hue + saturation)
• RGB does not have this seperation

39
HSV (Hue – Saturation- Value)

• Hue (H) is coded as an angle

between 0 and 360
• Saturation (S) is coded as a
radius between 0 and 1
‒ S = 0 : gray
‒ S = 1 : pure color
• Value (V) = MAX (Red,
Green, Blue)

40
HSV (Hue – Saturation- Value)

• If we know the color of the object we are looking for, we

can model it using a hue interval
• Take care, because it is an angle (periodic value)
‒ Hue < 60° means nothing
• Is 350° smaller or bigger than 60°?
‒ Define an interval: 350° < Hue < 60° (for example)
• This interval is valid if Saturation > threshold (otherwise
gray level)
• This is independant of Value , which is more sensible to
light conditions

41
Lab color space

• The Lab system (sometimes Lab*) is based on a study

from human vision
‒ independant from all technologies
‒ presenting colors as seen by the human eyes
• Colors are defined using 3 values
‒ L is the luminance, going from 0% (black) to 100%
(white)
‒ a* represents an axis going from green (negative
value, -127) to red (positive value, +127)
‒ b* represents an axis going from blue (negative value,
-127) to yellow (positive value,+127)

42
Lab color space

43
Color conversions

• Convert between color spaces

• OpenCV:
‒ https://docs.opencv.org/4.0.0/de/d25/imgproc_color_conversions
.html
‒ Function: cv::cvtColor (InputArray src, OutputArray dst, int
code, int dstCn=0)
• converts an image from one color space to another
• code: conversion code (COLOR_RGB2HSV,
COLOR_RGB2HSV, COLOR_BGR2Lab, …)

44
Color space vs. illumination conditions

• collected 10 images of the cube under varying illumination

conditions

• separately cropped every color to get 6 datasets for the 6

different colors

Changes in color due to varying Illumination conditions

• Compute the density plot: Check the distribution of a particular
color say, blue or yellow in different color spaces. The density
plot or the 2D Histogram gives an idea about the variations in
values for a given color
Source: Vikas Gupta, Learn OpenCV
45
Similar illumination: very compact

Fig.: Density Plot showing the variation of values in color

channels for 2 similar bright images of blue color
Source: Vikas Gupta, Learn OpenCV
46
Similar illumination: very compact

Fig.: Density Plot showing the variation of values in color channels for
2 similar bright images of yellow color
Source: Vikas Gupta, Learn OpenCV
47
Different illumination

Fig.: Density Plot showing the variation of values in color channels

under varying illumination for the blue color Source: Vikas Gupta, Learn OpenCV
48
Different illumination

• Different illumination:

Fig.: Density Plot showing the variation of values in color channels

under varying illumination for the yellow color
Source: Vikas Gupta, Learn OpenCV
49
Color space vs illumination conditions

• Different illumination:
‒ RGB space: the variation in the value of channels is very
hight
‒ HSV: compact in H. Only H contains information about the
absolute color ➔ a choix
‒ YCrCb, LAB: compact in CrCb and in AB
• Higher level of compactness is in LAB
‒ Convert to other color spaces (OpenCV):
• cvtColor(bgr, ycb, COLOR_BGR2YCrCb);
• cvtColor(bgr, hsv, COLOR_BGR2HSV);
• cvtColor(bgr, lab, COLOR_BGR2Lab);

50
Spatial convolution

• Image filtering : For each pixel, compute function of local neighborhood

and output a new value
‒ Same function applied at each position
‒ Output and input image are typically the same size
• Convolution : Linear filtering, function is a weighted sum/difference of
pixel values
I' = I * K
• Really important!
‒ Enhance images: Denoise, smooth, increase contrast, etc.
‒ Extract information from images:
• Texture, edges, distinctive points, etc.
‒ Detect patterns
• Template matching

51
Spatial convolution

Mask (kernel)
Original image Filtered image

52
Spatial convolution

• New value of a pixel(i,j) is a weighted sum of its neigbors

K: convolution kernel,
mask, filter, …

- Flip the kernel both horizontally and

vertically (filter rotated 180 degrees).
- Put the center of kernel at each pixel (i,j) of
the image.
- Multiply each element of the kernel with its
corresponding element of the image matrix
- Sum up all product outputs

53
Spatial convolution

• New value of a pixel(i,j) is a weighted sum of its neigbors

Source: http://machinelearninguru.com

54
Spatial convolution

I' = I * K

55
Spatial convolution

I' = I * K

56
Spatial convolution

I' = I * K

57
Spatial convolution

• Border problem?
‒ Zero padding in the input matrix
‒ reflect across border:
• f(-x,y) = f(x,y)
• f(-x,-y) = f(x,y)
‒…

58
Some kernels

• 2D spatial convolution
‒ is mostly used in image processing for feature extraction
‒ And is also the core block of Convolutional Neural Networks
(CNNs)
• Each kernel has its own effect and is useful for a specific
task such as
‒ blurring (noise removing): mean filter, gaussian filter, …
‒ Sharpening,
‒ edge detection: sobel, prewitt, laplace
‒ …..

60
Some kernels

0 0 0
* 0 1 0
0 0 0

Original image Filtered image

(no change)

0 0 0
* 1 0 0
0 0 0

Filtered image
Original image
(shifted left by 1 pixel)
Source: David Lowe
61
Some kernels

• Box filter (mean filter): low-pass filter

‒ Replace each pixel with an average of
its neigborhood
‒ Achieve smoothing effect

Original image Filtered image Filtered image

with box size 5x5 with box size 11x11

62
Some kernels

 Gaussian filter : ): low-pass filter

0.003 0.013 0.022 0.013 0.003
0.013 0.059 0.097 0.059 0.013
0.022 0.097 0.159 0.097 0.022
0.013 0.059 0.097 0.059 0.013
0.003 0.013 0.022 0.013 0.003
Gaussian filter with size 5 x5 , sigma =1
Gaussian function in 3D

Rule for Gaussian filter:

set filter half-width to about 3σ
Gaussian image
Sigma = 0.5 ➔ mask size: 3x3
63
Some kernels

• Gaussian filter

Original image Filtered image Filtered image

with box size 5x5 with box size 11x11

65
Some kernels

• Sobel

Vertical Edge
(absolute value)
66
Some kernels

• Sobel

Horizontal Edge
(absolute value)
67
Edge detection

• Edges are corresponding to:

‒ Maximums of the first derivative
‒ Zero-crossing in the second derivative

68
Edge detection with first derivatives

• Compute the convolution between the image and the first

derivatives kernels
‒ Kernels: Sobel, Prewitt, Robert
‒ Implemented in OpenCV library

• Find local extrema

‒ Edge composed of pixels having maximum/minimum value of
the first derivatives of image
‒ Can use a threshold to detect edge rapidly
‒ Can make several steps to obtain the optimal edge: Canny
detector (implemented in OpenCV)

69
Edge detection with first derivatives

• Filters used to compute the first

derivatives of image
‒ Robert

‒ Prewitt
• less sensitive to noise
• Smoothing with mean filter,
then compute1st derivative
‒ Sobel:
• less sensitive to noise
• Smoothing with gaussian,
then computing1st derivative
y x
70
Edge detection with first derivatives

71
Image derivatives

• 1st derivatives :

First derivative of
I* Ix image with
respect to x

First derivative of
I* Iy image with
respect to y

Image gradient

72
Image gradient

• An image gradient is a directional change in the intensity

or color in an image
• For each pixel in the image: Gx, Gy
➔Form a gradient vector (Gx, Gy) :
- Important information to describe the image content
- Gradient Magnitude = (Gx)2 + (Gy)2 ≈|Gx| + |Gy|
- Gradient Direction = arctan(Gy/Gx)

Blue lines represent the

gradient direction: from
brightest to darkest

73
Edge detection with second derivatives

• Compute the second derivative

‒ Apply the Laplacian filter on the image I

[ 0 1 0
1 −4 1
0 1 0 ] Or
[ 1 1 1
1 −8 1
1 1 1 ]
• Find zero-crossing

74
Laplacian filter - Second derivative

• Discrete approximations for the Laplacian function

‒ One convolution matrix
[ 0 1 0
1 −4 1
0 1 0 ] [ 1 1 1
1 −8 1
1 1 1 ]

75
OpenCV

• Blurring: GaussianBlur, boxFilter,...

• First derivatives: cv.Sobel(), Scharr,...
• Second derivative: cv.Laplacian(),...
• Canny edge detector: optimal detector
‒ https://docs.opencv.org/4.x/da/d22/tutorial_py_canny.html
‒ cv.Canny()

76
Image representation

77
Feature extraction

• Two types of features are extracted from the image:

‒ local and global features (descriptors)
• Global features
‒ Describe the image as a whole to the generalize the entire
object
‒ Include contour representations, shape descriptors, and
texture features
‒ Examples: Invariant Moments (Hu, Zernike), Histogram
Oriented Gradients (HOG), PHOG, and Co-HOG,...
• Local feature:
‒ the local features describe the image patches (key points in
the image) of an object
‒ represents the texture/color in an image patch
‒ Examples: SIFT, SURF, LBP, BRISK, MSER and FREAK, …
78
Feature extraction

• Global features

256 bins intensity histogram

Pyramid Histogram of Oriented Gradients

16 bins intensity histogram
Source:http://www.robots.ox.ac.uk/~vgg/research/caltech/phog.html

79
Feature extraction

• Local features: how to determine image patches / local

regions

Dividing into
patches with Keypoint detection
regular grid
Image segmentation

Without knowledge about

image content Based on the content of image

81
Feature extraction

• Image segmentation
‒ Thresholding
‒ Split and merge
‒ Region growing
‒ Watershed
‒…

82
Feature extraction

• Keypoint detectors:
‒ DoG /SIFT detector
‒ Harris corner detector
‒ Moravec
‒ …
• Local features: computed in
local regions associated to each
keypoints:
- SIFT,
- SURF(Speeded Up Robust
Features),
- PCA-SIFT
- LBP, BRISK, MSER and
FREAK, …

83
Keypoint detector

• Harris corner detector

‒ https://docs.opencv.org/3.4/dc/d
0d/tutorial_py_features_harris.h
tml
‒ Invariant under translation,
rotation, but not scaling
• Harris-Laplace detector:
‒ https://docs.opencv.org/4.0.1/d
1/dad/classcv_1_1xfeatures2d_
1_1HarrisLaplaceFeatureDetec
tor.html
‒ Invariant under translation,
rotation, scaling

84
Keypoint detector: DoG/SIFT detector

• Find local extrema in

space-scale DoG:
‒ DoG ~ Laplace of
Gaussian
‒ extrema in second
derivatives

A SIFT keypoint : {x, y, scale, dominant orientation}

Source: Distinctive Image Features from Scale-Invariant Keypoints – IJCV 2004

85
Feature extraction : Good feature?

• Compact
• Invariant to
‒ geometric transformation
‒ Camera viewpoint
‒ Lighting condition

• Best performant local feature: SIFT (David Lowe)

86
Feature extraction : SIFT feature

Blur the image Compute orientation

Compute gradients in
using the scale of histogram in 8
respect to the keypoint
the keypoint directions over 4x4
orientation(rotation
(scale invariance) sample regions ➔ 128 D
invariance)

Source: Distinctive Image Features from Scale-Invariant Keypoints – IJCV 2004

http://campar.in.tum.de/twiki/pub/Chair/TeachingWs13TDCV/feature_descriptors.pdf

87
Other detectors and descriptors

Popular features: SURF, HOG, SIFT

http://campar.in.tum.de/twiki/pub/Chair/TeachingWs13TDCV/feature_descriptors.p
df

Summary some local features:

http://www.cse.iitm.ac.in/~vplab/courses/CV_DIP/PDF/Feature_Detectors_and_Descri
ptors.pdf

88
Feature extraction : OpenCV

• SIFT & SURF:

‒ Patented algorithms
‒ They are free to use fro academic / research purposes
‒ You should technically be getting permission to use them in
commercial applications
• From OpenCV 3.0, patented algorithms are
‒ removed from standard package,
‒ putted into non-free module (opencv-contrib, not installed by
default). From version 4.4, sift is free (in the standard package)
• Free alternatives to sift, surf:
‒ ORB (Oriented FAST and Rotated Brief)
‒ BRIEF, BRISK, FREAK, KAZE and AKAZE

89
Feature extraction : OpenCV

• SIFT
sift = cv.xfeatures2d.SIFT_create() // for version before v 4.4.
sift = cv. SIFT_create() // for version from v 4.4.
‒ sift.detect() function finds the keypoint in the images
‒ sift.compute() which computes the descriptors from the keypoints
kp = sift.detect(gray,None)
kp,des = sift.compute(gray,kp)
‒ Find keypoints and descriptors in a single step
sift.detectAndCompute()
kp, des = sift.detectAndCompute(gray,None)
‒ https://docs.opencv.org/3.4/da/df5/tutorial_py_sift_intro.html
• SURF: similar

90
Feature extraction : OpenCV

• SURF: similar
>>> img = cv.imread('fly.png',0)
# Create SURF object. You can specify params here or later.
# Here I set Hessian Threshold to 400
>>> surf = cv.xfeatures2d.SURF_create(400)
# Find keypoints and descriptors directly
>>> kp, des = surf.detectAndCompute(img,None)
>>> len(kp)
699

‒ https://docs.opencv.org/3.4/df/dd2/tutorial_py_surf_intro.html

91
Origine: Bag-of-words models
• Orderless document representation: frequencies of
words from a dictionary Salton & McGill (1983)

US Presidential Speeches Tag Cloud

http://chir.ag/phernalia/preztags/
92
Bags of features for object recognition
• Works pretty well for image-level classification and for
recognizing object instances

face, flowers, building

Csurka et al. (2004), Willamowski et al. (2005), Grauman & Darrell (2005), Sivic et al. (2003, 2005) 93
Bag of features: outline
1. Extract features OpenCV:
BOWImgDescriptorExtractor Class
2. Learn “visual vocabulary”
3. Quantize features using visual vocabulary
4. Represent images by frequencies of
“visual words”

94
Higher semantic vision problem
1. Image representation
- Pixel level
- Region level
- Image level

2. Classification: ML techniques
- Pixel level ➔ segmentation
- Region level ➔ detection
- Image level ➔ classification/recognition

95
References
• CVIP tool to explore the power of computer processing of digital images: Many methods in image
processing and computer vision have been implemented
‒ https://cviptools.ece.siue.edu/
• Library: OpenCV, with C/C++, Python and Java interfaces. OpenCV was designed for computational
efficiency and with a strong focus on real-time application: https://opencv.org/
• Books:
‒ Rafael C. Gonzalez, Richard Eugene Woods, Digital Image Processing, 2nd edition, Prentice -Hall,
2002: Chap 3 (spatial operators), 6 (Color spaces)
‒ Richard Szeliski, Computer Vision: Algorithms and Applications, Springer, 2010.
http://szeliski.org/Book/
• Articles:
‒ SIFT (DoG detector and SIFT descriptor): https://www.cs.ubc.ca/~lowe/keypoints/
‒ SURF: Herbert Bay, Andreas Ess, Tinne Tuytelaars, and Luc Van Gool, "Speeded Up Robust
Features", ETH Zurich, Katholieke Universiteit Leuven
‒ GLOH: Krystian Mikolajczyk and Cordelia Schmid "A performance evaluation of local descriptors",
IEEE Transactions on Pattern Analysis and Machine Intelligence, 10, 27, pp 1615--1630, 2005.
‒ PHOG: http://www.robots.ox.ac.uk/~vgg/research/caltech/phog.html
• https://www.learnopencv.com/ : many examples with code in C++/ Python and clear explanation

96
Thank you for
your attention!

Computer Vision Class 10 Notes
100% (5)
Computer Vision Class 10 Notes
7 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
Ch-3 Image AnalysisComputer Vision
No ratings yet
Ch-3 Image AnalysisComputer Vision
88 pages
Unit 1
No ratings yet
Unit 1
179 pages
1 Intro
No ratings yet
1 Intro
103 pages
Computer Vision Intorduction
No ratings yet
Computer Vision Intorduction
57 pages
Lec00 Intro For Web Highlighted
No ratings yet
Lec00 Intro For Web Highlighted
72 pages
Lect1 PDF
100% (1)
Lect1 PDF
45 pages
Ai CV Notes
No ratings yet
Ai CV Notes
6 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
Prerequisites: What Is Computer Vision? Vision For Measurement
No ratings yet
Prerequisites: What Is Computer Vision? Vision For Measurement
8 pages
Lec01 Intro
No ratings yet
Lec01 Intro
61 pages
CV Module 1
No ratings yet
CV Module 1
166 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
Computer Vision Class 10 AI Notes CBSE
No ratings yet
Computer Vision Class 10 AI Notes CBSE
8 pages
What Computer Vision With The OpenCV
100% (5)
What Computer Vision With The OpenCV
137 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
Computer Vision Part1
No ratings yet
Computer Vision Part1
96 pages
CV Unit 1 Overview of Computer Vison and Application
No ratings yet
CV Unit 1 Overview of Computer Vison and Application
51 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
CS5330 F22 Lectures
No ratings yet
CS5330 F22 Lectures
116 pages
Ch1 TDMA Image Processing
No ratings yet
Ch1 TDMA Image Processing
34 pages
CV - Unit 1
No ratings yet
CV - Unit 1
14 pages
Computer Vision
No ratings yet
Computer Vision
29 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
T2310 TDS3651 L01 Introduction
No ratings yet
T2310 TDS3651 L01 Introduction
73 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
4 pages
Computer Vision: Facial Recognition
No ratings yet
Computer Vision: Facial Recognition
9 pages
Digital Image Processing
No ratings yet
Digital Image Processing
30 pages
Computer Vision
No ratings yet
Computer Vision
19 pages
Unit 1
No ratings yet
Unit 1
200 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
803 (A) Image Processing and Computer Vision#: Subject In-Charge: Prof Shilpa Sharma
No ratings yet
803 (A) Image Processing and Computer Vision#: Subject In-Charge: Prof Shilpa Sharma
44 pages
Computer Vision Class X
No ratings yet
Computer Vision Class X
39 pages
00CV Intro Full
No ratings yet
00CV Intro Full
58 pages
Computer Vision
100% (1)
Computer Vision
48 pages
Computer Vision
No ratings yet
Computer Vision
15 pages
Computer Vision Introduction
No ratings yet
Computer Vision Introduction
42 pages
CS 474 Lec 01 Introduction
No ratings yet
CS 474 Lec 01 Introduction
69 pages
AI 10th Grade Pdfs
No ratings yet
AI 10th Grade Pdfs
30 pages
CV SVD L01 P1 Intro
No ratings yet
CV SVD L01 P1 Intro
35 pages
Computer Vision and Image Processing (Updated)
No ratings yet
Computer Vision and Image Processing (Updated)
165 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
CV Gtu Answers
No ratings yet
CV Gtu Answers
56 pages
AI-Computer Vision
No ratings yet
AI-Computer Vision
16 pages
1 Intro Visión Artificial
No ratings yet
1 Intro Visión Artificial
50 pages
Unit-5 Computer Vision
No ratings yet
Unit-5 Computer Vision
3 pages
C10 - Ai - Computer Vision
No ratings yet
C10 - Ai - Computer Vision
40 pages
Lec 00
No ratings yet
Lec 00
76 pages
Computer Vision
No ratings yet
Computer Vision
4 pages
Computer Vision: Linda Shapiro
No ratings yet
Computer Vision: Linda Shapiro
73 pages
DL4CV Week01 Part01
No ratings yet
DL4CV Week01 Part01
35 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
Unit1 CV
No ratings yet
Unit1 CV
44 pages
Computer Vision
No ratings yet
Computer Vision
7 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
Lec 01 CompVision N DIP Intro
No ratings yet
Lec 01 CompVision N DIP Intro
91 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Jetson Nano
100% (1)
Jetson Nano
349 pages
01 Basics 01ML 02
No ratings yet
01 Basics 01ML 02
35 pages
3.3. Smoothing Spatial Filtering
No ratings yet
3.3. Smoothing Spatial Filtering
60 pages
Unit-2@IP (Ritik Chauhan)
No ratings yet
Unit-2@IP (Ritik Chauhan)
10 pages
CUDA Application For Canny Edge Detection
No ratings yet
CUDA Application For Canny Edge Detection
12 pages
AD8703 Basic of Computer Vision UNIT 1
No ratings yet
AD8703 Basic of Computer Vision UNIT 1
65 pages
Classification of Endangered Bird Species of Nepal Using Deep Learning
No ratings yet
Classification of Endangered Bird Species of Nepal Using Deep Learning
43 pages
Google Aiml
No ratings yet
Google Aiml
50 pages
Project Report
No ratings yet
Project Report
40 pages
CHAP 5 Morphological Image Processing
No ratings yet
CHAP 5 Morphological Image Processing
103 pages
02 Springer Paper Template
No ratings yet
02 Springer Paper Template
17 pages
Understanding of Convolutional Neural Network (CNN)
No ratings yet
Understanding of Convolutional Neural Network (CNN)
9 pages
Computer Vision Important Questions Answers 250322 101712
No ratings yet
Computer Vision Important Questions Answers 250322 101712
26 pages
Chapter 3 Multidimensional Grids A 2023 Programming Massively Parallel Pro
No ratings yet
Chapter 3 Multidimensional Grids A 2023 Programming Massively Parallel Pro
22 pages
New CNN-Based Predictor For Reversible Data Hiding
No ratings yet
New CNN-Based Predictor For Reversible Data Hiding
5 pages
AI Facilitators Handbook X
No ratings yet
AI Facilitators Handbook X
42 pages
CNN Students
No ratings yet
CNN Students
170 pages
Ijrpr Paper Studentsize
No ratings yet
Ijrpr Paper Studentsize
12 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
CNN
No ratings yet
CNN
62 pages
REF2 - Basic Image Processing
No ratings yet
REF2 - Basic Image Processing
18 pages
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
No ratings yet
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
65 pages
Unit 2
No ratings yet
Unit 2
45 pages
Convolution Operation
No ratings yet
Convolution Operation
23 pages
Ai Theory Curriculum
No ratings yet
Ai Theory Curriculum
8 pages
Edge Detection in Image Processing
No ratings yet
Edge Detection in Image Processing
9 pages
SGM4-Study Guide For Module 4
No ratings yet
SGM4-Study Guide For Module 4
15 pages
Unit - 2
No ratings yet
Unit - 2
51 pages
Image Tampering Localization Using A Dense Fully Convolutional Network
No ratings yet
Image Tampering Localization Using A Dense Fully Convolutional Network
14 pages
9 CNN-1
No ratings yet
9 CNN-1
89 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Introduction to Data Science: (Khoa học dữ liệu)

Uploaded by

Introduction to Data Science: (Khoa học dữ liệu)

Uploaded by

Introduction to Data Science

(Khoa học dữ liệu)

Nguyen Thi Oanh

SOICT, HUST, 2024

• Convolution and Filters

• Feature extraction: local and global descriptor

• Images, video are everywhere

➔Hot topic, especially

• Understand the image Facebook's suggestion

Smile detection: smart camera

• Login without a password, but with biometrics (fingerprint,

Fingerprint scanners on many new laptops, Face recognition systems now

• Object recognition (on mobile phones)

Point & Find, Nokia

• Earth View, Google earth (3D modeling from lots of 2D

Microsoft's Virtual Earth

• Smart cars → autonomous vehicles

Mobileye: vision systems currently in many cars

• Games / robots: 2_on_1_melee2

Vision-based interaction game

Robot vacuum cleaner

• 2 types information we would like to extract from images :

Feature extraction Learning

• What can we see on the picture?

‒ Index (0,0): Top left corner

• Principal type of images

It exists other color spaces:

 Histogram is a graphical representation of the repartition of

‒ Difference between intensity value maximum et minimum

‒ Non-linear transform (Gama correction)

‒ Histogram equalization (Cân bằng histogram)

If 𝑠𝑚𝑖𝑛 = 0; 𝑠𝑚𝑎𝑥 = 255

Intensity range = [0,255]

For grayscale image

• Used in storage and display

• Two types of light-sensitive photoreceptors (on

440 530 560 nm.

400 450 500 550 600 650

• The Hue-Saturation-Value (HSV) color space is useful

• We identify for a pixel:

• Hue (H) is coded as an angle

• If we know the color of the object we are looking for, we

• The Lab system (sometimes L*a*b*) is based on a study

• Convert between color spaces

• collected 10 images of the cube under varying illumination

• separately cropped every color to get 6 datasets for the 6

Changes in color due to varying Illumination conditions

Fig.: Density Plot showing the variation of values in color

Fig.: Density Plot showing the variation of values in color channels

Fig.: Density Plot showing the variation of values in color channels

• Image filtering : For each pixel, compute function of local neighborhood

• New value of a pixel(i,j) is a weighted sum of its neigbors

- Flip the kernel both horizontally and

• New value of a pixel(i,j) is a weighted sum of its neigbors

Original image Filtered image

• Box filter (mean filter): low-pass filter

Original image Filtered image Filtered image

 Gaussian filter : ): low-pass filter

Rule for Gaussian filter:

Original image Filtered image Filtered image

• Edges are corresponding to:

• Compute the convolution between the image and the first

• Find local extrema

• Filters used to compute the first

• An image gradient is a directional change in the intensity

Blue lines represent the

• Compute the second derivative

• Discrete approximations for the Laplacian function

• Blurring: GaussianBlur, boxFilter,...

• Two types of features are extracted from the image:

256 bins intensity histogram

Pyramid Histogram of Oriented Gradients

• Local features: how to determine image patches / local

Without knowledge about

• Harris corner detector

• Find local extrema in

• The Lab system (sometimes Lab*) is based on a study