INT345 Computer Vision
INT345 Computer Vision
Computer Vision
Lecture #0
Agenda, Roadmap & Expectations
Course details
• LTP – 2 0 2
• Credits – 3.0
• Mode – BYOD
• Text Book
– COMPUTER VISION: A MODERN APPROACH by
DAVID A. FORSYTH, Pearson Education India
– PROGRAMMING COMPUTER VISION WITH
PYTHON: TOOLS AND ALGORITHMS FOR
ANALYSING IMAGES by JAN ERIK SOLEM, O'REILLY
Course Assessment Model
• Marks break up*
• Attendance 5
• CA 25
– Project
– 1 Best out of 2 Test-code based
• MTE 20
• ETE 50
• Total 100
Computer Vision
Make computers understand images and video.
Is that a
queen or a
bishop?
Why computer vision matters?
•Examples of state-of-the-art
Optical character recognition (OCR)
Technology to convert scanned docs to text
• If you have a scanner, it probably came with OCR software
LaneHawk by EvolutionRobotics
“A smart camera is flush-mounted in the checkout lane, continuously watching for
items. When an item is detected and recognized, the cashier verifies the quantity of
items that were found under the basket, and continues to close the transaction. The
item can remain under the basket, and with LaneHawk,you are assured to get paid for
it… “
Vision-based biometrics
“How the Afghan Girl was Identified by Her Iris Patterns” Read the story
wikipedia
Login without a password…
• Mobileye
– Vision systems currently in high-end BMW, GM, Volvo models
– By 2010: 70% of car manufacturers.
Medical imaging
• Image Enhancement
– Histogram equalization
– Contrast enhancement
– Thresholding
– The EM algorithm
– Frequency domain techniques
• Image Filtering
– Spatial filtering
– Fourier transform
– Geometric transformations
The course contents: Unit 2
• 2-D Projective Geometry
– Planar geometry
– The 2D projective plane
– Projective transformations
– Homography properties
• Camera Geometry
– Pinhole camera
– Cameras with lenses
– CCD cameras
– General projective cameras
– Affine cameras
The course contents: Unit 3
• Stereo Geometry
– Epipolar geometry
– Fundamental matrix
– Motion of camera
– Normalized 8-point algorithm
– Linear triangulation methods
– Geometric error cost function
The course contents: Unit 4
• Feature Detection and Description
– Feature detection
– Harris operator
– Feature matching
– Scale invariant feature transform
– Speeded up robust features
– Hessian operator
– FAST
– BRIEF
– ORB
– HOG
– Texture descriptors
– Image descriptors
– Applications of descriptors
24
What are Cohorts
• Computer Vision
by Prof. Jayanta Mukhopadhyay ,
IIT Kharagpur
Skill Set
EMPLOYABILITY
Advanced Python Programming