0% found this document useful (0 votes)
11 views32 pages

1a. Introduction

Computer vision is a branch of artificial intelligence that enables computers to interpret and understand visual data. Key tasks include image classification, object detection, and facial recognition, with applications in healthcare, autonomous vehicles, security, and more. The field has evolved significantly since its inception in the 1960s, with advancements in technology leading to various practical implementations today.

Uploaded by

atik
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views32 pages

1a. Introduction

Computer vision is a branch of artificial intelligence that enables computers to interpret and understand visual data. Key tasks include image classification, object detection, and facial recognition, with applications in healthcare, autonomous vehicles, security, and more. The field has evolved significantly since its inception in the 1960s, with advancements in technology leading to various practical implementations today.

Uploaded by

atik
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 32

Computer Vision

Instructor: Dr. Mohammad Khairul Islam


Professor, CSE, CU
Slide Courtesy: James Hays Image by
kirkh.deviantart.com
What is Computer Vision
• Computer vision is a field of artificial intelligence
(AI) that enables computers to interpret and
make decisions based on visual data from the
world.
• This involves the automatic extraction, analysis,
and understanding of useful information from a
single image or a sequence of images.
Computer Vision: Tasks
 Image Classification: Identifying what an image represents.
 Object Detection: Locating and identifying objects within an
image.
 Image Segmentation: Dividing an image into multiple segments
or regions to simplify analysis.
 Facial Recognition: Identifying or verifying individuals based on
their facial features.
 Pattern Recognition: Detecting patterns and regularities in data.
 3D Reconstruction: Reconstructing three-dimensional models
from two-dimensional images.
 Scene Understanding: Comprehending the context and
environment of a scene.
Computer Vision: Applications
 Healthcare: Assisting in diagnosing diseases from medical
images (e.g., MRI, X-rays).
 Autonomous Vehicles: Helping self-driving cars understand and
navigate their surroundings.
 Security: Enhancing surveillance systems through automatic
detection of unusual activities.
 Retail: Implementing automated checkout systems and
personalized shopping experiences.
 Manufacturing: Performing quality inspection and automation in
production lines.
 Agriculture: Monitoring crop health and managing farming
operations through drone imagery.
Computer Vision
Make computers understand images and video.

What kind of scene?

Where are the cars?

How far is the


building?


Vision is really hard
• Vision is an amazing feat of natural intelligence

– Visual cortex occupies about 50% of Macaque brain


– More human brain devoted to vision than anything else
Is that a
queen or a
bishop?

Obviously relative to the human brain,


the macaque monkey brain is roughly
1/12 of the volume (average monkey
brain volume 100 cm3, adult human
brain volume 1200 cm3) and, therefore,
our relative voxel resolution is not
comparable
Why Computer Vision Matters

Safety Health Security

Comfort Fun Access


Ridiculously brief history of computer vision

• 1966: Minsky assigns computer vision as an


undergrad summer project
• 1960’s: interpretation of synthetic worlds
• 1970’s: some progress on interpreting selected Guzman ‘68

images
• 1980’s: ANNs come and go; shift toward geometry
and increased mathematical rigor or hardship
• 1990’s: face recognition; statistical analysis in Ohta Kanade ‘78
vogue
• 2000’s: broader recognition; large annotated
datasets available; video processing starts
• 2030’s: robot uprising?
Turk and Pentland ‘91
How vision is used now
• Examples of state-of-the-art
Optical character recognition (OCR)
Technology to convert scanned docs to text
• If you have a scanner, it probably came with OCR software

Digit recognition, AT&T labs License plate readers


http://www.research.att.com/~yann/ http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
Face detection
• Many new digital cameras now detect faces
– Canon, Sony, Fuji, …
Smile detection

Sony Cyber-shot® T70 Digital Still Camera


Object Recognition (in supermarkets)

LaneHawk by EvolutionRobotics
“A smart camera is flush-mounted in the checkout lane, continuously
watching for items. When an item is detected and recognized, the
cashier verifies the quantity of items that were found under the basket,
and continues to close the transaction. The item can remain under the
basket, and with LaneHawk,you are assured to get paid for it… “
Vision-based biometrics

“How the Afghan Girl was Identified by Her Iris Patterns” Read the story
wikipedia
Login without a password…

Fingerprint scanners on Face recognition systems now


many new laptops, beginning to appear more widely
other devices http://www.sensiblevision.com/
Object recognition (in mobile phones)

Point & Find, Nokia


Google Goggles
Special effects: shape capture

The Matrix
Special effects: motion capture

Pirates of the Carribean, Industrial Light and Magic


Sports

Sportvision first down line


Nice explanation on www.howstuffworks.com

http://www.sportvision.com/video.html
Smart cars

• Mobileye
– Vision systems currently in high-end BMW, GM, Volvo models

Slide content courtesy of Amnon Shashua


Google cars

Oct 9, 2010. "Google Cars Drive Themselves, in Traffic". The New York Times. John Markoff
June 24, 2011. "Nevada state law paves the way for driverless cars". Financial Post. Christine Dobby
Aug 9, 2011, "Human error blamed after Google's driverless car sparks five-vehicle crash". The
Star (Toronto)
Interactive Games: Kinect
• Object Recognition:
http://www.youtube.com/watch?feature=iv&v=fQ59dXOo63o
• Mario: http://www.youtube.com/watch?v=8CTJL5lUjHg
• 3D: http://www.youtube.com/watch?v=7QrnwoO1-8A
• Robot: http://www.youtube.com/watch?v=w8BmgtMKFbY
Vision in space

NASA'S Mars Exploration Rover Spirit captured this westward view from atop
a low plateau where Spirit spent the closing months of 2007.

Vision systems (JPL) used for several tasks


• Panorama stitching
• 3D terrain modeling
• Obstacle detection, position tracking
• For more, read “Computer Vision on Mars” by Matthies et al.
Industrial robots

Vision-guided robots position nut runners on wheels


Mobile robots

NASA’s Mars Spirit Rover


http://en.wikipedia.org/wiki/Spirit_rover http://www.robocup.org/

Saxena et al. 2008


STAIR at Stanford
Medical imaging

Image guided surgery


3D imaging
Grimson et al., MIT
MRI, CT
Computer Vision and Nearby Fields
• Computer Graphics: Models to Images
• Comp. Photography: Images to Images
• Computer Vision: Images to Models
Scope of Computer Vision
Computer Vision Robotics

Machine
Human Computer
Learning
Interaction

Image Processing
Graphics
Feature Matching Medical Imaging
Recognition
Computational
Photography
Neuroscience

Optics
Course Topics
• Interpreting Intensities
– What determines the brightness and color of a pixel?
– How can we use image filters to extract meaningful information from the
image?
• Correspondence and Alignment
– How can we find corresponding points in objects or scenes?
– How can we estimate the transformation between them?
• Grouping and Segmentation
– How can we group pixels into meaningful regions?
• Categorization and Object Recognition
– How can we represent images and categorize them?
– How can we recognize categories of objects?
• Advanced Topics
– Action recognition, 3D scenes and context, human-in-the-loop vision…
Textbook

http://szeliski.org/Book/

• Ref: Computer Vision: A modern Approach by


David A. Forsyth & Jean Ponce, Prentice Hall

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy