0% found this document useful (0 votes)

19 views54 pages

CS436 CS5310 EE513 L01 Introduction

The document outlines the course CS436/CS5310/EE513 on Computer Vision Fundamentals, taught by Murtaza Taj at LUMS, covering topics such as feature detection, visual recognition, and geometric transformations. It includes a tentative course outline, reading materials, and evaluation criteria for assignments and projects. The goal of the course is to enable students to make useful decisions about real physical objects and scenes based on sensed images.

Uploaded by

Rao aafaq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views54 pages

CS436 CS5310 EE513 L01 Introduction

Uploaded by

Rao aafaq

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 54

CS436/CS5310/EE513

Computer Vision Fundamentals

Murtaza Taj
murtaza.taj@lums.edu.pk

Lecture 1: Introduction
Mon, 04th Sep 2023
Introduction
! Murtaza Taj (PhD)
! PhD from Queen Mary University of London in Dec 2009
! Joined LUMS in Jan 2011
! Faculty Director Technology for People Initiative
! Director Computer Vision & Graphics Lab
! Research: 2D & 3D Scene Understanding, Image Processing,
Computer Graphics, Machine Learning

tpi.lums.edu.pk cvlab.lums.edu.pk
Computer Vision & Graphics Lab (CVG Lab)
https://cvlab.lums.edu.pk

! Remote Sensing
! Image matching
! Retrieval, classification, object detection

! Digital Cultural Heritage

! 2D/3D Scene Understanding
! 3D Object Retrieval, Point cloud segmentation

! Medical Imaging
! Generative Adversarial Networks
! Image classification

! Others
! Self-driving cars (End-to-end, How well am I driving)
! Visual Question Generation (NLP), Pose Estimation
Murtaza Taj
Co-founder Groopic Inc. CA
What is the core business of Google & Facebook?
What is the core business
of Google & Facebook?
Murtaza Taj
Co-founder Ingrain Media Inc. CA
Course Outline
Reading
! Text
! Computer Vision: Algorithms and Applications:
This is the draft of a textbook recently written by Richard Szeliski
Available in PDF form at http://szeliski.org/Book/

! Introductory Techniques for 3D Computer Vision:

by Emanuel Trucco and Alessandro Verri,
is very useful, especially for topics related to geometry

! Journals
! IEEE Transactions on Pattern Analysis and Machine Intelligence
! Transaction on Graphics (ToG)

! Conferences
! IEEE CVPR, ICCV, ECCV
! SIGGRAPH
Tentative Course Outline
Topic Lectures Reading
Introduction 1 Szeliski Ch 1
• Course Introduction, policies, etc

es L?
• Overview of Computer Vision

ur D
• Why are computer vision problems hard?

ct 7
le 43
• Examples of successful computer vision applications

5 S
• Overview of course topics

4- h C
Feature Detection 2 Szeliski Ch 4

nd it
• Edge Detection/Convolution Trucco Ch 4-5
• Filter Banks ou p w
ar rla
ve

Visual Recognition 4
O

• Deep Learning & CNN (Tutorial)

• Object Classification (ImageNet, LeNet etc)
• Object Localization
Tentative Course Outline
Topic Lectures Reading
Geometric Transformations and Camera Models 10-18 Szeliski Ch 2
• 2D transformations
• Estimating 2D Transformation
• 3D transformations
• Camera Models
• Camera Calibration
Dense Motion Estimation and Image Stitching 19-21 Szeliski Ch 8-9
• Optical Flow
• Pyramids
• Parametric Methods for Image Alignment
Structure from Motion 22-23 Szeliski Ch 7
• Rigid SFM (Factorization Method)
Stereo 24-26 Trucco Ch 7-8
• Basic Formulation
• Epipolar Constraint
• Estimation of Fundamental Matrix
Tentative Course Outline
Topic Lectures Reading
Geometric Transformations and Camera Models 10-18 Szeliski Ch 2

es G?
• 2D transformations
• Estimating 2D Transformation

ur C
ct 2
• 3D transformations

le 5
5 S4
• Camera Models

4- C
• Camera Calibration

h
Dense Motion Estimation and Image Stitching 19-21 Szeliski Ch 8-9

nd it
ou w
• Optical Flow

ar rlap
• Pyramids
• Parametric Methods for Image Alignment

ve
Structure from Motion O 22-23 Szeliski Ch 7
• Rigid SFM (Factorization Method)
Stereo 24-26 Trucco Ch 7-8
• Basic Formulation
• Epipolar Constraint
• Estimation of Fundamental Matrix
Instrument Weight
Course Introduction Assignments 30%
Quizzes 5%
! Programming Environment
Project 20%
! Python (OpenCV, TensorFlow, Keras, PyTorch)
Mid-term 20%
Exam 25%
! Assignments
! Written and Programming assignments (approx. 2+3)
! Associated report (discussion on results)

! Project
! Project is a simple extension of assignments with some room
for innovation
! Project evaluation meetings at regular intervals
! In a group of 2
Any Questions?
Introduction
Slide Credits
! CS 436 - LUMS, Dr. Sohaib Khan
! CS131 - Stanford, Fei Fei Li
! CS231 - Stanford, A. Karpathy
! UC Berkley, Jetindra Malik
! and many more
Introduction
! Sight is our primary sensation
! 80% of our first 12 years of learning is
through vision
! 30% of neurons in brain’s cortex are
dedicated to vision, compared to 8% for
touch, 2% for hearing

! Human Experience
What is the goal of Computer Vision?

“The goal of Computer Vision is to make useful decisions about

real physical objects and scenes based on sensed images”

Image Computer
Processing Graphics

Computer
Vision

2D & 3D Scene Understanding

Image IN Image OUT
Image
Processing

Symbolic Image OUT

Info IN Computer
Graphics

Symbolic
Decision
Image IN OUT
Computer
Vision
What is Computer Vision?

Slide acknowledgement: Prof. Fei Fei Li’s CS131 class at Stanford

Interpretation
! Scene understanding

! Multi-view Geometry
Scene Understanding
What we would like to infer…

Will person B put some money into Person C’s tip bag?
What kind of information can we extract from an image?
! 3D Information
! Semantic Information
Safe City Project
Traffic E-Challan
Multi-view Geometry
Camera Projection
! 3D to 2D projection

3D
2D

Optical
centre

Photograph
Came Real object
Laws ra ,
of Op
tics
Slide credit: Kenton Anderson
Geometric Transformations
! 2D-to-2D (image-to-image)
! 3D-to-3D (world-to-world)
! 3D-to-2D (camera model)
! 2D-to-3D (3D reconstruction)
! Shape from Stereo
! Structure from Motion
! Single View Reconstruction
Image plane-to-Image plane
Pakistan Super League
Vision for Robotics, Space exploration

! Vision system used for:

! Panorama stiching
! 3D terrain modelling
! Obstacle detection, position tracking
! …
Shape from Stereo

Source: http://www-robotics.jpl.nasa.gov
Multi-camera Surveillance System
Computer Vision

Convolution Stereo
Transformations
Feature extraction Image Recognition
Camera Model
Multi-View Geometry
Scene Understanding
Face Recognition Machine Learning 3D Reconstruction

Mobile Apps Photogrammetry

Structure from Motion
Shop Analytics
Surveillance Object Detection
Animated Movies
Innovation
Startups
Key Questions
! How a 3D world is projected onto a 2D
image by a camera?

! How multiple images of the same

world are related together?

! How can we reconstruct the 3D world

from images?

! What objects are present in the scene

and where?
Next …
! Edge Detection
Additional Slides
The Complexity of Perception
Why is computer vision hard?
! Computers are good at numerical processing

! Humans are good at perceptual processing

! We want to use a computer to mimic human perception…

which is complex to understand
Perception

Ref: Light and Vision: LIFE Science Library

Perception
What is this?
Recognition Helps Reorganization
Any Questions?
Writing Programs that “See”

An Example
Representing a Digital Image
! It is natural to represent image as a matrix
The goal of Computer Vision - Image Understanding

Slide acknowledgement: Prof. Fei Fei Li’s CS131 class at Stanford

What kind of information can we extract from an image?

Lec 00
No ratings yet
Lec 00
76 pages
1 Intro
No ratings yet
1 Intro
103 pages
CV Unit 1 Overview of Computer Vison and Application
No ratings yet
CV Unit 1 Overview of Computer Vison and Application
51 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
Computer Vision 2011
100% (1)
Computer Vision 2011
103 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Unit 1
No ratings yet
Unit 1
186 pages
Unit 5 Introduction Robot Vision
No ratings yet
Unit 5 Introduction Robot Vision
60 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
Lecture 1 S
No ratings yet
Lecture 1 S
23 pages
CS-475 - Computer Vision
No ratings yet
CS-475 - Computer Vision
5 pages
Prerequisites: What Is Computer Vision? Vision For Measurement
No ratings yet
Prerequisites: What Is Computer Vision? Vision For Measurement
8 pages
1 Sirg Bsu - 1
No ratings yet
1 Sirg Bsu - 1
46 pages
Computer Vision Introduction
No ratings yet
Computer Vision Introduction
42 pages
3dv Slides
No ratings yet
3dv Slides
153 pages
Lecture 01
No ratings yet
Lecture 01
79 pages
01 Introduction
No ratings yet
01 Introduction
19 pages
Lecture-1 CV
No ratings yet
Lecture-1 CV
18 pages
Computer Vision - 01 Introduction
No ratings yet
Computer Vision - 01 Introduction
40 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
AD8703 BCV Unit IV 2023
No ratings yet
AD8703 BCV Unit IV 2023
93 pages
CV-1 1
No ratings yet
CV-1 1
18 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
Module 1
No ratings yet
Module 1
18 pages
Lec00 Intro For Web Highlighted
No ratings yet
Lec00 Intro For Web Highlighted
72 pages
01 - Introduction
No ratings yet
01 - Introduction
37 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
Intro
No ratings yet
Intro
66 pages
T2310 TDS3651 L01 Introduction
No ratings yet
T2310 TDS3651 L01 Introduction
73 pages
CS5330 F22 Lectures
No ratings yet
CS5330 F22 Lectures
116 pages
01 Introduction To MachineVision
No ratings yet
01 Introduction To MachineVision
53 pages
Unit 1 Chapter 1
No ratings yet
Unit 1 Chapter 1
27 pages
Overview of Computer Vision: CS491E/791E
No ratings yet
Overview of Computer Vision: CS491E/791E
55 pages
Computer Vision and Artificial Intelligence
No ratings yet
Computer Vision and Artificial Intelligence
55 pages
Book
No ratings yet
Book
2 pages
Computer Vision: From Recognition To Geometry
No ratings yet
Computer Vision: From Recognition To Geometry
26 pages
Human Sensing 03
No ratings yet
Human Sensing 03
9 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
62 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
1 Vision Lec 1
No ratings yet
1 Vision Lec 1
49 pages
Administrivia: CMPSCI 370: Introduction To Computer Vision
No ratings yet
Administrivia: CMPSCI 370: Introduction To Computer Vision
12 pages
4F12 Handout 1
No ratings yet
4F12 Handout 1
29 pages
18cse390t U1 s1 Slo1 Content
No ratings yet
18cse390t U1 s1 Slo1 Content
15 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
CV - Unit 1
No ratings yet
CV - Unit 1
14 pages
CV Digital Notes
No ratings yet
CV Digital Notes
77 pages
CS 436-CS 5310-Computer Vision-Zehra Shah
No ratings yet
CS 436-CS 5310-Computer Vision-Zehra Shah
4 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
CO1 Notes
No ratings yet
CO1 Notes
105 pages
Lec01 Intro
No ratings yet
Lec01 Intro
61 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
CompVisNotes PDF
No ratings yet
CompVisNotes PDF
115 pages
"Introduction To Computer Vision": Submitted by
No ratings yet
"Introduction To Computer Vision": Submitted by
45 pages
Ch-3 Image AnalysisComputer Vision
No ratings yet
Ch-3 Image AnalysisComputer Vision
88 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
01 Introduction
No ratings yet
01 Introduction
33 pages
Lec 1 - 2
No ratings yet
Lec 1 - 2
39 pages
CV s2015 Lec 1
No ratings yet
CV s2015 Lec 1
32 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
Procedural Surface: Exploring Texture Generation and Analysis in Computer Vision
From Everand
Procedural Surface: Exploring Texture Generation and Analysis in Computer Vision
Fouad Sabry
No ratings yet
CS436 CS5310 Ee513 L05 CNN2
No ratings yet
CS436 CS5310 Ee513 L05 CNN2
27 pages
Lec 16 PCA
No ratings yet
Lec 16 PCA
64 pages
Lec 3 Data Preprocessing and Transformation
No ratings yet
Lec 3 Data Preprocessing and Transformation
66 pages
003-KNN Complete Updated
No ratings yet
003-KNN Complete Updated
72 pages
002-Supervised Learning Setup 00 W2L1
No ratings yet
002-Supervised Learning Setup 00 W2L1
18 pages
Mounting Procedure: Reference: C3131320010 A1
No ratings yet
Mounting Procedure: Reference: C3131320010 A1
16 pages
Analysis of Consumer Satisfaction and Lo 300543b7
No ratings yet
Analysis of Consumer Satisfaction and Lo 300543b7
18 pages
Gulfood Exhibitor List N 1
No ratings yet
Gulfood Exhibitor List N 1
19 pages
Esfuerzos en Vigas - PDF
No ratings yet
Esfuerzos en Vigas - PDF
6 pages
Cisco Script
No ratings yet
Cisco Script
2 pages
Pleuropulmonary Infections
No ratings yet
Pleuropulmonary Infections
40 pages
Simple and Compound Entry
100% (1)
Simple and Compound Entry
4 pages
BC2402 Designing and Developing Databases - Course Outline
No ratings yet
BC2402 Designing and Developing Databases - Course Outline
11 pages
Introduction To TikTok Shop Affiliate Program
No ratings yet
Introduction To TikTok Shop Affiliate Program
10 pages
CFor Speed Setup
No ratings yet
CFor Speed Setup
13 pages
What Is Defensive Driving?
No ratings yet
What Is Defensive Driving?
3 pages
D-155 - 3 Cylinder Diesel Engine (01/75 - 12/85) 00 - Complete Machine 04-02 - Piston and Cylinder Sleeve
No ratings yet
D-155 - 3 Cylinder Diesel Engine (01/75 - 12/85) 00 - Complete Machine 04-02 - Piston and Cylinder Sleeve
4 pages
Manual Slake Durability Device
No ratings yet
Manual Slake Durability Device
40 pages
Markets in Profile 部分18
No ratings yet
Markets in Profile 部分18
5 pages
Mahaveer Price List
No ratings yet
Mahaveer Price List
6 pages
Chapter 3 Data Modeling Using The Entity Relationship ER Model
No ratings yet
Chapter 3 Data Modeling Using The Entity Relationship ER Model
55 pages
UCSP 1st Q Budget Work
No ratings yet
UCSP 1st Q Budget Work
1 page
Book Sizes
No ratings yet
Book Sizes
9 pages
IELTS Simon Speaking Part 3 9dee133876
No ratings yet
IELTS Simon Speaking Part 3 9dee133876
37 pages
C
100% (1)
C
75 pages
PDF Living On A Prayer - English Version
No ratings yet
PDF Living On A Prayer - English Version
17 pages
Premier General Catalogue PDF
No ratings yet
Premier General Catalogue PDF
48 pages
MTS3101 Appendices v1
No ratings yet
MTS3101 Appendices v1
35 pages
Week 11 Probability and Statistics
No ratings yet
Week 11 Probability and Statistics
27 pages
Ecn Rulesregulationsandplayingconditions v4.0.3
No ratings yet
Ecn Rulesregulationsandplayingconditions v4.0.3
142 pages
Records Management Plan Template 042022
No ratings yet
Records Management Plan Template 042022
4 pages
Director of Training
No ratings yet
Director of Training
2 pages
Group 2 - Aspects of Connected Speech
No ratings yet
Group 2 - Aspects of Connected Speech
31 pages
Soil Classification Using Horizontal To Vertical Spectrum Ratio Methods On Scilab in Sendangmulyo, Semarang
No ratings yet
Soil Classification Using Horizontal To Vertical Spectrum Ratio Methods On Scilab in Sendangmulyo, Semarang
8 pages
Source Follower: (Common-Drain Amplifier)
No ratings yet
Source Follower: (Common-Drain Amplifier)
40 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CS436 CS5310 EE513 L01 Introduction

Uploaded by

CS436 CS5310 EE513 L01 Introduction

Uploaded by

CS436/CS5310/EE513

Computer Vision Fundamentals

! Digital Cultural Heritage

! Introductory Techniques for 3D Computer Vision:

• Deep Learning & CNN (Tutorial)

“The goal of Computer Vision is to make useful decisions about

2D & 3D Scene Understanding

Symbolic Image OUT

Slide acknowledgement: Prof. Fei Fei Li’s CS131 class at Stanford

! Vision system used for:

Mobile Apps Photogrammetry

! How multiple images of the same

! How can we reconstruct the 3D world

! What objects are present in the scene

! Humans are good at perceptual processing

! We want to use a computer to mimic human perception…

Ref: Light and Vision: LIFE Science Library

Slide acknowledgement: Prof. Fei Fei Li’s CS131 class at Stanford

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.