0% found this document useful (0 votes)

40 views72 pages

Lec00 Intro For Web Highlighted

Uploaded by

abbasahmer734

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views72 pages

Lec00 Intro For Web Highlighted

Uploaded by

abbasahmer734

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 72

CS5670: Intro to Computer Vision

(Cornell Tech)
Depth from a single image
Visualizing scenes from tourist
photos
Reconstructing dynamic 3D
scenes

DynIBaR: Neural Dynamic Image-Based Rendering [

https://dynibar.github.io/]
Zhengqi Li, Qianqian Wang, Forrester Cole, Richard Tucker, Noah Snavely
CVPR 2023
Today
1. What is computer vision?

2. Why study computer vision?

3. Course overview

4. Images & image filtering [time permitting]

Today
• Readings
– Szeliski, Chapter 1 (Introduction)
Every image tells a story
• Goal of computer vision:
perceive the “story”
behind the picture
• Compute properties of
the world
– 3D shape
– Names of people or
objects
– What happened?
The goal of computer vision
Can computers match human perception?
• Yes and no (mainly no)
– computers can be better at
“easy” things
– humans are better at
“hard” things

• But huge progress

– Accelerating in the last five
years due to deep learning
– What is considered “hard”
keeps changing
Human perception has its shortcomings

https://twitter.com/pickover/status/
1460275132958662657/
But humans can tell a lot about a scene
from a little information…

Source: “80 million tiny images” by Torralba, et al.

The goal of computer vision
The goal of computer vision
• Compute the 3D shape of the world

ZED 2i Camera
The goal of computer vision
• Recognize objects and people

Terminator 2, 1991
slide credit: Fei-Fei, Fergus & Torralba
sky
building

flag

face
banner
wall
street lamp
bus bus

cars slide credit: Fei-Fei, Fergus & Torralba

The goal of computer vision
• “Enhance” images
The goal of computer vision
• Forensics

Source: Nayar and Nishino, “Eyes for Relighting”

Source: Nayar and Nishino, “Eyes for Relighting”
Source: Nayar and Nishino, “Eyes for Relighting”
The goal of computer vision
• Improve photos (“Computational Photography”)

Super-resolution (source:
2d3)

Depth of field on cell phone

camera (source:
Google Research Blog) Removing objects (
Google Magic Erase
Low-light photography r
(credit: Hasinoff et al., SIGGRAPH ASIA 2016 )
)
April 10, 2019
Why study computer vision?
• Billions of images/videos captured per day

• Huge number of potential applications

• The next slides show the current state of
Optical character recognition
(OCR) • If you have a scanner, it probably came with OCR
software

Digit recognition, AT&T labs (1990’s) License plate readers

http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
http://yann.lecun.com/exdb/lenet/

Sudoku grabber
http://sudokugrab.blogspot.com/

Automatic check processing

Face detection

• Nearly all cameras detect faces in real

time
– (Why?)
Face analysis and recognition
Vision-based biometrics

Who is she? Source: S. Seitz

Vision-based biometrics

“How the Afghan Girl was Identified by Her Iris Patterns” Read
the story

Source: S. Seitz
Login without a password

Fingerprint scanners Face unlock on Apple iPhone X

on many new See also
smartphones and http://www.sensiblevision.com/
other devices
New York Times, Jan. 18, 2020
by Kashmir Hill
Bird identification

Merlin Bird ID (based on Cornell Tech technology!)

Special effects: shape capture

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Source: S. Seitz
Special effects: motion capture

Pirates of the Carribean, Industrial Light and Magic Source: S. Seitz

3D face tracking w/ consumer cameras

Snapchat Lenses

Face2Face system (Thies et

Image synthesis

Karras, et al., Progressive Growing of GANs for Improved Quality, Stability, and Variation, ICLR
Which face is real?

https://www.whichfaceisreal.com/
Image synthesis

“An astronaut riding a horse in a “A photo of a Corgi dog riding a bike in

photorealistic style” – DALL-E 2 Times Square. It is wearing sunglasses and
a beach hat” – Imagen
Sports

Sportvision first down line

Explanation on www.howstuffworks.com

Source: S. Seitz
Smart cars

• Mobileye
• Tesla Autopilot
• Safety features in many cars
Self-driving cars

Waymo
Robotics

NASA’s Mars Curiosity Rover Amazon Picking Challenge

https://en.wikipedia.org/wiki/Curiosity_(rover) http://www.robocup2016.org/en/events/amazon-picking-chal
lenge/

Amazon Prime Air Amazon Scout

Medical imaging

3D imaging
(MRI, CT) Skin cancer classification with deep learning
https://cs.stanford.edu/people/esteva/nature/
Virtual & Augmented Reality

6DoF head tracking Hand & body tracking

3D scene understanding 3D-360 video capture

Current state of the art
• You just saw many examples of current systems.
– Many of these are less than 5 years old

• Computer vision is an active research area, and rapidly

changing
– Many new apps in the next 5 years
– Deep learning and generative methods powering many modern
applications

• Many startups across a dizzying array of areas

– Generative AI, robotics, autonomous vehicles, medical
imaging, construction, inspection, VR/AR, …
Why is computer vision difficult?

Viewpoint variation

Credit: Flickr user michaelpaul

Scale
Illumination
Why is computer vision difficult?

Motion (Source: S. Lazebnik)

Intra-class variation

Background clutter Occlusion

Challenges: local ambiguity

slide credit: Fei-Fei, Fergus & Torralba

But there are lots of visual cues we can
use…

Source: S. Lazebnik
Bottom line
• Perception is an inherently ambiguous problem
– Many different 3D scenes could have given rise to a given 2D
image

Artist Julian Beever with his anamorphic Coke bottle

– We often must use prior knowledge about the world’s
structure Image source: F. Durand
CS5670: Introduction to Computer Vision

• Project-based course whose goal is to teach you

the basics of computer vision – image processing,
geometry, recognition – in a hands-on way
Course requirements
• Prerequisites
– Data structures
– Good working knowledge of Python programming
– Linear algebra
– Vector calculus

• Course does not assume prior imaging

experience
– computer vision, image processing, graphics, etc.
Course overview
(tentative)
1. Low-level vision
– image processing, edge detection,
feature detection, cameras, image
formation

2. Geometry & appearance

– projective geometry, stereo, structure
from motion, optimization, lighting &
materials

3. Recognition & generative

models
– object classification, deep learning,
1. Low-level vision
• Basic image processing and image formation

* =
Filtering, edge detection

Feature extraction Image formation

Project: Hybrid images
Project: Feature detection and matching
2. Geometry & appearance

Image credit: IDS Imaging

Projective geometry Stereo vision

Multi-view stereo Structure from motion

Project: Creating panoramas
Project: 3D reconstruction
3. Recognition, Deep Learning &
Generative Models

“dog”

Image classification Convolutional Neural Networks

“a class watching a computer vision lecture at Cornell Tech”

Image generation
Project: Neural Radiance Fields
(NeRFs)
Questions?

Adobe Premiere Pro CC 2017
No ratings yet
Adobe Premiere Pro CC 2017
5 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
1 Intro Visión Artificial
No ratings yet
1 Intro Visión Artificial
50 pages
00CV Intro Full
No ratings yet
00CV Intro Full
58 pages
Lec 00
No ratings yet
Lec 00
76 pages
Computer Vision
100% (1)
Computer Vision
48 pages
Ch-3 Image AnalysisComputer Vision
No ratings yet
Ch-3 Image AnalysisComputer Vision
88 pages
1 Vision Lec 1
No ratings yet
1 Vision Lec 1
49 pages
Lec01 Intro
No ratings yet
Lec01 Intro
61 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
1 Intro
No ratings yet
1 Intro
103 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
Lect1 PDF
100% (1)
Lect1 PDF
45 pages
CV Module 1
No ratings yet
CV Module 1
166 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
Computer Vision Intorduction
No ratings yet
Computer Vision Intorduction
57 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
Computer Vision: Linda Shapiro
No ratings yet
Computer Vision: Linda Shapiro
73 pages
CS 474 Lec 01 Introduction
No ratings yet
CS 474 Lec 01 Introduction
69 pages
Prerequisites: What Is Computer Vision? Vision For Measurement
No ratings yet
Prerequisites: What Is Computer Vision? Vision For Measurement
8 pages
Lecture1 - Introduction
No ratings yet
Lecture1 - Introduction
35 pages
Unit 1
No ratings yet
Unit 1
186 pages
803 (A) Image Processing and Computer Vision#: Subject In-Charge: Prof Shilpa Sharma
No ratings yet
803 (A) Image Processing and Computer Vision#: Subject In-Charge: Prof Shilpa Sharma
44 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
CV Unit 1 Overview of Computer Vison and Application
No ratings yet
CV Unit 1 Overview of Computer Vison and Application
51 pages
Lec01 - Intro To Computer Vision
No ratings yet
Lec01 - Intro To Computer Vision
43 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
Computer Vision Part1
No ratings yet
Computer Vision Part1
96 pages
CSE480: Machine Vision
No ratings yet
CSE480: Machine Vision
51 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
62 pages
Computer Vision: From Recognition To Geometry
No ratings yet
Computer Vision: From Recognition To Geometry
26 pages
Introduction To Computer Vision
No ratings yet
Introduction To Computer Vision
34 pages
Lecture 1
100% (1)
Lecture 1
21 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
Introduction To Computer Vision: by James Hays
No ratings yet
Introduction To Computer Vision: by James Hays
32 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
Computer Vision Presentation Updated
No ratings yet
Computer Vision Presentation Updated
15 pages
Computer Vision: Evolution and Promise
No ratings yet
Computer Vision: Evolution and Promise
5 pages
Comp Vis Week 1
No ratings yet
Comp Vis Week 1
39 pages
What Is Computer Vision
No ratings yet
What Is Computer Vision
18 pages
Department of Computer Science and Engineering - University of Bologna
No ratings yet
Department of Computer Science and Engineering - University of Bologna
23 pages
Lecture AI 15 23052025 112103am
No ratings yet
Lecture AI 15 23052025 112103am
69 pages
LectureNotes PDF
No ratings yet
LectureNotes PDF
212 pages
DL4CV Week01 Part01
No ratings yet
DL4CV Week01 Part01
35 pages
CS312 Module 4
No ratings yet
CS312 Module 4
21 pages
Introduction to Data Science: (Khoa học dữ liệu)
No ratings yet
Introduction to Data Science: (Khoa học dữ liệu)
91 pages
Computer Vision Presentation AI
No ratings yet
Computer Vision Presentation AI
16 pages
IT5409 Ch1 Intro New Template
No ratings yet
IT5409 Ch1 Intro New Template
14 pages
T2310 TDS3651 L01 Introduction
No ratings yet
T2310 TDS3651 L01 Introduction
73 pages
18cse390t U1 s1 Slo1 Content
No ratings yet
18cse390t U1 s1 Slo1 Content
15 pages
1 Intro24
No ratings yet
1 Intro24
79 pages
01 - Introduction
No ratings yet
01 - Introduction
37 pages
Lec 2
No ratings yet
Lec 2
52 pages
Computer Vision
No ratings yet
Computer Vision
14 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
3 pages
Human Sensing 03
No ratings yet
Human Sensing 03
9 pages
Overview of Computer Vision: CS491E/791E
No ratings yet
Overview of Computer Vision: CS491E/791E
55 pages
Computer Vision and Artificial Intelligence
No ratings yet
Computer Vision and Artificial Intelligence
55 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Exterior Render Settings (V-Ray 3.4 For Sketchup) :: Sketchup 3D Rendering Tutorials by Sketchupartists
No ratings yet
Exterior Render Settings (V-Ray 3.4 For Sketchup) :: Sketchup 3D Rendering Tutorials by Sketchupartists
20 pages
ZMorphVX ProductManual 2020-06-26
No ratings yet
ZMorphVX ProductManual 2020-06-26
163 pages
Mine Plan For Longterm Planning
No ratings yet
Mine Plan For Longterm Planning
131 pages
Cgarena: Photoshop After Effects 3dsmax Gallery Interview 3D Challenge
No ratings yet
Cgarena: Photoshop After Effects 3dsmax Gallery Interview 3D Challenge
52 pages
PC Powerplay - Issue 301, 2023
No ratings yet
PC Powerplay - Issue 301, 2023
100 pages
A Voting Algorithm For FGS
No ratings yet
A Voting Algorithm For FGS
50 pages
? MERN Stack - AI Integration - DSA (2025 Roadmap) - Job Ready Developer
No ratings yet
? MERN Stack - AI Integration - DSA (2025 Roadmap) - Job Ready Developer
11 pages
The RAM Structural System V8i: 3D Viewer
No ratings yet
The RAM Structural System V8i: 3D Viewer
23 pages
Informed Architecture Computational Strategies in Architectural Design Cocchiarella PDF Download
No ratings yet
Informed Architecture Computational Strategies in Architectural Design Cocchiarella PDF Download
79 pages
The Most Advanced Real Time 3D Sonar in The World: /echoscope
No ratings yet
The Most Advanced Real Time 3D Sonar in The World: /echoscope
2 pages
Blender Road Generator
No ratings yet
Blender Road Generator
74 pages
EFI Optitex 3D Studio Services
No ratings yet
EFI Optitex 3D Studio Services
2 pages
Datasheet Trimble PipeDesigner 3D LR
No ratings yet
Datasheet Trimble PipeDesigner 3D LR
2 pages
Sara Berg: Fall 2005 - Spring 2007
100% (1)
Sara Berg: Fall 2005 - Spring 2007
19 pages
3D Printed Cosplay Props Heisenbergs Hammer Reside
No ratings yet
3D Printed Cosplay Props Heisenbergs Hammer Reside
54 pages
20100btbdceim07411 BTCS503
No ratings yet
20100btbdceim07411 BTCS503
54 pages
Capitulo I Introdusaun Autocad 2009 (2D)
No ratings yet
Capitulo I Introdusaun Autocad 2009 (2D)
19 pages
Key Difference AutoCAD 2018 Vs AutoCAD 2024
No ratings yet
Key Difference AutoCAD 2018 Vs AutoCAD 2024
12 pages
(FREE PDF Sample) DirectX 8 and Visual Basic Development Keith Sink Ebooks
100% (25)
(FREE PDF Sample) DirectX 8 and Visual Basic Development Keith Sink Ebooks
84 pages
Buckling Analysis Report
No ratings yet
Buckling Analysis Report
4 pages
Game Development Manual22 PDF
No ratings yet
Game Development Manual22 PDF
74 pages
Revit S - Fund - CH01
No ratings yet
Revit S - Fund - CH01
15 pages
Improving Design of Ground Control Station For Unmanned Aerial Vehicle: Borrowing From Design Patterns
No ratings yet
Improving Design of Ground Control Station For Unmanned Aerial Vehicle: Borrowing From Design Patterns
10 pages
Computer Graphics (CSE 4103)
No ratings yet
Computer Graphics (CSE 4103)
62 pages
Training Manual and Textbook Catalogue2007
No ratings yet
Training Manual and Textbook Catalogue2007
13 pages
3D Animator Resume-1
No ratings yet
3D Animator Resume-1
1 page
Mathematical Modelling and Its Analysis of Angkor Wat and Khmer Temple
No ratings yet
Mathematical Modelling and Its Analysis of Angkor Wat and Khmer Temple
2 pages
Recommendation Report ETABS Vs STAAD
No ratings yet
Recommendation Report ETABS Vs STAAD
15 pages
NSDPro 7 Win Manual
No ratings yet
NSDPro 7 Win Manual
175 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lec00 Intro For Web Highlighted

Uploaded by

Lec00 Intro For Web Highlighted

Uploaded by

CS5670: Intro to Computer Vision

DynIBaR: Neural Dynamic Image-Based Rendering [

2. Why study computer vision?

4. Images & image filtering [time permitting]

• But huge progress

Source: “80 million tiny images” by Torralba, et al.

cars slide credit: Fei-Fei, Fergus & Torralba

Source: Nayar and Nishino, “Eyes for Relighting”

Depth of field on cell phone

• Huge number of potential applications

Digit recognition, AT&T labs (1990’s) License plate readers

Automatic check processing

• Nearly all cameras detect faces in real

Who is she? Source: S. Seitz

Fingerprint scanners Face unlock on Apple iPhone X

Merlin Bird ID (based on Cornell Tech technology!)

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Pirates of the Carribean, Industrial Light and Magic Source: S. Seitz

Face2Face system (Thies et

“An astronaut riding a horse in a “A photo of a Corgi dog riding a bike in

Sportvision first down line

NASA’s Mars Curiosity Rover Amazon Picking Challenge

Amazon Prime Air Amazon Scout

6DoF head tracking Hand & body tracking

3D scene understanding 3D-360 video capture

• Computer vision is an active research area, and rapidly

• Many startups across a dizzying array of areas

Credit: Flickr user michaelpaul

Motion (Source: S. Lazebnik)

Background clutter Occlusion

slide credit: Fei-Fei, Fergus & Torralba

Artist Julian Beever with his anamorphic Coke bottle

• Project-based course whose goal is to teach you

• Course does not assume prior imaging

2. Geometry & appearance

3. Recognition & generative

Feature extraction Image formation

Image credit: IDS Imaging

Projective geometry Stereo vision

Multi-view stereo Structure from motion

Image classification Convolutional Neural Networks

“a class watching a computer vision lecture at Cornell Tech”

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.