0% found this document useful (0 votes)

33 views50 pages

Week1_Lecture2

Lecture notes of CV801

Uploaded by

Abrham Gebreselasie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views50 pages

Week1_Lecture2

Lecture notes of CV801

Uploaded by

Abrham Gebreselasie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

CV801: Advanced Computer Vision

Week 1 Lecture 2
Class Participation and Peer-Review (10% Weightage)
Class-participation: 5%
• In-person Attendance: 3%.
• Full mark: In-person attendance in 18 out of 30 lectures AND 7 out of 15 labs

• Reading research papers in advance, and providing correct answers for the in-class room Quizzes-2%

Peer Review: 5%
• Participate in the discussions related to project presentations and paper presentations of other
students: 1%
• 1-page review report on Projects of other groups ( Each person write two peer-review report): 4%

2
Introduction and Overview of Computer Vision
What is Computer Vision?

• Ability of computers
• To understand visual data
• For example, images, videos…

• Automate tasks
• Which human visual system can perform
What is Computer Vision?
• To extract “meaning” from pixels. To bridge the gap between image pixels and
“meaning” (semantic)!

What we see!
What computer sees!
What do we have here?

Seems easy ……..

Wrong! Vision is Hard
• Vision is an amazing feature of natural intelligence
• Around 50% of neural tissues of human brain is directly or indirectly
related to vision, which assists in visual learning.

Hardware perspective:
Is that a Massive digital data collections
queen or a
bishop?
Why Study Computer Vision?
• Engineering point of view - Computer Vision helps to solve many
practical problems: business potential
• Scientific point of view - Human kind of visual system is one of
the grand challenges of Artificial Intelligence (AI)
AI itself is a grand challenge of computing
• Massive visual data on internet

More than 70 million photos are shared on Instagram every day (more than 50 billion photos in total)

300 million images a day (More than 350 billion photos in total)

Business potential Substantial Commercial Interest

• Google
• Meta AI/Facebook
• Apple

Autonomous Driving Security Computer vision

Health
technology can
improve our lives

Biometric Access Comfort: Robot Fun: Virtual Avatar

Why Study Computer Vision?

12
Why Study Computer Vision?

• CVPR conference ranking (Engineering) as of 2024

13
Why Study Computer Vision?
• CVPR papers
2023 2024
Why Study Computer Vision?
Substantial Commercial Interest

List of CVPR 2022 sponsors

CV801 Topics vs Major topics in CVPR 2023

• Covering 8 Out of 12 top CVPR 2023 topics

• Covering ~12 topics

16
Acceptance Rate for Each Topic: CVPR 2024

17
Common Computer Vision Tasks

18
Common Computer Vision Tasks
Image Categorization/Recognition:

CAT
Common Computer Vision Tasks

Scene Recognition:
Is this an outdoor image?
21
Activity Recognition

Activity:
What is this person doing in this image?
Common Computer Vision Tasks: Detection

Detection:
Where is a car in this image?
Common Computer Vision Tasks: Detection

24
Semantic Segmentation

GRASS, CAT, TREE, SKY

25
Instance Segmentation

DOG, DOG, CAT

26
Common Computer Vision Tasks: Segmentation

Semantic Object Instance

Classification
Segmentation Detection Segmentation

CAT GRASS, CAT, TREE, DOG, DOG, CAT DOG, DOG, CAT
SKY

No spatial extent No objects, just pixels Multiple Objects

Video Instance Segmentation

28
Research Paper Presentations (10% Weightage)
Objective
• Learn to systematically introduce a research topic
• Improve teaching and presentation skills
• Involve in critical discussions about research papers
How to Select a Topic?
• Suggested topics.
• Specialized Applications of Segmentation: Eg. medical image segmentation (~3 presentations)
• Vision Foundation Models: Segment Anything Model (SAM) (~2 presentations)
• Efficient Architectures for Computer Vision Applications: State-space Models and Mamba (~4 presentations)
• Conversational LLMs and Vision-Language Models (~2 presentations)
• Image Generation using Diffusion Models (~5 presentations)
• Remote sensing, change detection (~2 presentations)
• Human-centric Vision (~2 presentations)
• All presenters on the same topic should work together to systematically introduce the concepts.

29
Specialized Applications of Segmentation: 3D Medical Image segmentation

UNETR: Transformers for 3D Medical Image Segmentation, WACV 2022

30
Remote Sensing Change Detection

Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review.
https://arxiv.org/pdf/2305.05813.pdf

34
Foundation Models in Vision

Foundational Models Defining a New Era in Vision: A Survey and Outlook

38
https://github.com/awaisrauf/Awesome-CV-Foundational-Models
Generalizable Localization Models
Segment Anything Model (SAM- https://arxiv.org/abs/2304.02643)
SAM for Synthetic Embryo Detection, Counting and Segmentation
(without training the model on target dataset or target category)

Embryo detection & counting Segmentation

Input Count=307
39
Large Language Models

40
Multi-Model LLMs
mbzuai.ac.ae
Multi-Model LLMs
Image Generation Using Diffusion Models
Diffusion Models in Vision: A Survey https://arxiv.org/pdf/2209.04747.pdf

“A diffusion model is a deep generative model that is based on two stages, a forward diffusion stage and
a reverse diffusion stage. In the forward diffusion stage, the input data is gradually perturbed over
several steps by adding Gaussian noise. In the reverse stage, a model is tasked at recovering the original
input data by learning to gradually reverse the diffusion process, step by step “

Forward

Reverse
Image Generation (i)

1. Diffusion Models 2. Multi Model LLM Meets Diffusion Models

Eg: For Person Image Synthesis, CVPR 2023

mbzuai.ac.ae
Image Generation (ii)

3. 3D-aware Image Generation 4. Image Generation for Healthcare Applications

ICCV 2023 MICCAI 2023

mbzuai.ac.ae
Human-centric Scene Understanding

Example: Pedestrian detection, Multi-camera person search, Crowd counting, Pose estimation, Activity
recognition

Pedestrian Detection Person Search Crowd Counting Human Pose Estimation

mbzuai.ac.ae
ARCHITECTURE DESIGN CHOICES FOR
REAL-WORLD VISION APPLICATIONS
• Development of Efficient network architectures
For image classification, object detection, segmentation
and human pose estimation in images and videos.

Vision Mamba

• Mamba for Medical Image Segmentation

mbzuai.ac.ae
Questions?
Survey Outcome
Expected Deep learning and CNN backgrounds

• Perceptron. • Regularization

• Multi-layer Perceptron • Dropout

• Backpropagation • Data Augmentation

• Stochastic gradient descent. • Batch normalization

• Cross entropy loss

• CNN layer
58
Summary
• Course Overview
• Introduction and Overview of Computer Vision
• Common Computer Vision tasks

mbzuai.ac.ae

Semantic Deep Learning Integrated With RGB Feature-Based Rule Optimization For Facility Surface Corrosion Detection and Evaluation
No ratings yet
Semantic Deep Learning Integrated With RGB Feature-Based Rule Optimization For Facility Surface Corrosion Detection and Evaluation
15 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
CV_SVD_L01_P1_Intro
No ratings yet
CV_SVD_L01_P1_Intro
35 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
LectureNotes PDF
No ratings yet
LectureNotes PDF
212 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
Unit 1
No ratings yet
Unit 1
186 pages
Lec 1 - 2
No ratings yet
Lec 1 - 2
39 pages
Overview
No ratings yet
Overview
5 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
intro
No ratings yet
intro
66 pages
01 - Introduction
No ratings yet
01 - Introduction
37 pages
Cv Digital Notes
No ratings yet
Cv Digital Notes
77 pages
CompVisNotes PDF
No ratings yet
CompVisNotes PDF
115 pages
Lecture 1
100% (1)
Lecture 1
21 pages
Lec 00
No ratings yet
Lec 00
76 pages
Computer Vision Class Notes
No ratings yet
Computer Vision Class Notes
4 pages
Computer Visiondk
No ratings yet
Computer Visiondk
12 pages
cxvxfv
No ratings yet
cxvxfv
12 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
A Comprehensive Guide to Computer Vision
No ratings yet
A Comprehensive Guide to Computer Vision
6 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
CV 01 Introduction
No ratings yet
CV 01 Introduction
14 pages
Computer Vision: From Recognition To Geometry
No ratings yet
Computer Vision: From Recognition To Geometry
26 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
Format of 1st Page - Seminar
No ratings yet
Format of 1st Page - Seminar
3 pages
Group 17 Computer Vision @Lcd-1
No ratings yet
Group 17 Computer Vision @Lcd-1
25 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
CV - Lecture 1 - Iintroduction
No ratings yet
CV - Lecture 1 - Iintroduction
24 pages
CV
No ratings yet
CV
48 pages
grp3_computerVision (4)
No ratings yet
grp3_computerVision (4)
28 pages
Lecture Notes
No ratings yet
Lecture Notes
144 pages
CV Unit 1
No ratings yet
CV Unit 1
30 pages
Cv Unit 1 Overview of Computer Vison and Application
No ratings yet
Cv Unit 1 Overview of Computer Vison and Application
51 pages
Technologies 12 00015
No ratings yet
Technologies 12 00015
40 pages
UNIT-I_Introduction to Computer Vision
No ratings yet
UNIT-I_Introduction to Computer Vision
45 pages
Raz Report Final
No ratings yet
Raz Report Final
37 pages
Computer Vision
No ratings yet
Computer Vision
14 pages
CV_UNIT_1
No ratings yet
CV_UNIT_1
17 pages
Lec 1
No ratings yet
Lec 1
51 pages
Computer Vision
No ratings yet
Computer Vision
10 pages
DL4CV_Week01_Part01
No ratings yet
DL4CV_Week01_Part01
35 pages
Introduction To CVIP
No ratings yet
Introduction To CVIP
33 pages
CV_Lecture_1-DD-Don
No ratings yet
CV_Lecture_1-DD-Don
38 pages
Unit 5 Introduction Robot Vision
No ratings yet
Unit 5 Introduction Robot Vision
60 pages
T2310 TDS3651 L01 Introduction
No ratings yet
T2310 TDS3651 L01 Introduction
73 pages
Topic 5 Computer Vision
No ratings yet
Topic 5 Computer Vision
65 pages
Week5_Computer_Vision
No ratings yet
Week5_Computer_Vision
58 pages
CPCS335 - Chapter 9-Final
No ratings yet
CPCS335 - Chapter 9-Final
24 pages
Computer Vision (1) (2)
No ratings yet
Computer Vision (1) (2)
14 pages
Computer Vision
No ratings yet
Computer Vision
8 pages
0
No ratings yet
0
8 pages
Discussion 1 - Introduction
No ratings yet
Discussion 1 - Introduction
26 pages
CV&IP Chapter 1
No ratings yet
CV&IP Chapter 1
27 pages
Computer Vision Class X
No ratings yet
Computer Vision Class X
17 pages
Week 9 Lecture Notes
No ratings yet
Week 9 Lecture Notes
27 pages
Computer Vision Lecture 1
No ratings yet
Computer Vision Lecture 1
15 pages
CS312 Module 4
No ratings yet
CS312 Module 4
21 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet
AI in Healthcare
No ratings yet
AI in Healthcare
16 pages
PHD Thesis Manufacturing Engineering
100% (3)
PHD Thesis Manufacturing Engineering
4 pages
Master of Science-Computer Science-Syllabus
No ratings yet
Master of Science-Computer Science-Syllabus
22 pages
Handwritten Gujarati Character Recognition Based On Discrete Cosine Transform
No ratings yet
Handwritten Gujarati Character Recognition Based On Discrete Cosine Transform
4 pages
Melanoma Classification A Comprehensive Survey (3 240314 220858
No ratings yet
Melanoma Classification A Comprehensive Survey (3 240314 220858
67 pages
Superpixel-Based Fast Fuzzy C-Means Clustering For Color Image Segmentation
No ratings yet
Superpixel-Based Fast Fuzzy C-Means Clustering For Color Image Segmentation
19 pages
FIERY
No ratings yet
FIERY
16 pages
Zero To Hero
50% (2)
Zero To Hero
253 pages
RSOBIA - A New OBIA Toolbar and Toolbox in ArcMap
No ratings yet
RSOBIA - A New OBIA Toolbar and Toolbox in ArcMap
5 pages
Contour Based Tracking
No ratings yet
Contour Based Tracking
20 pages
Text Books
No ratings yet
Text Books
2 pages
Deep Stock Representation Learning From Candlestick
No ratings yet
Deep Stock Representation Learning From Candlestick
10 pages
Semantically Adversarial Scenario Generation With Explicit Knowledge Guidance
No ratings yet
Semantically Adversarial Scenario Generation With Explicit Knowledge Guidance
20 pages
A Condition-Based Dynamic Segmentation of Large Systems Using A
No ratings yet
A Condition-Based Dynamic Segmentation of Large Systems Using A
15 pages
Short Paper: A Trajectory-Based Ball Tracking Framework With Visual Enrichment For Broadcast Baseball Videos
No ratings yet
Short Paper: A Trajectory-Based Ball Tracking Framework With Visual Enrichment For Broadcast Baseball Videos
15 pages
Computer Methods and Programs in Biomedicine
No ratings yet
Computer Methods and Programs in Biomedicine
12 pages
CustomerSegmentationforaMobileTelecommunicationsCompanyBasedonServiceUsage
No ratings yet
CustomerSegmentationforaMobileTelecommunicationsCompanyBasedonServiceUsage
7 pages
CAPTCHA Security A Case Study
No ratings yet
CAPTCHA Security A Case Study
7 pages
SegNeXt Rethinking Convolutional Attention Design Segmentation
No ratings yet
SegNeXt Rethinking Convolutional Attention Design Segmentation
15 pages
The Automatic Number Plate Recognition System (Anpr)
No ratings yet
The Automatic Number Plate Recognition System (Anpr)
4 pages
Brosnan y Sun PDF
No ratings yet
Brosnan y Sun PDF
14 pages
Liver Cancer Detection Using CNN
100% (1)
Liver Cancer Detection Using CNN
16 pages
Soluble Solids Content and PH Prediction and Maturity Discrimination of Lychee Fruits Using Visible and Near Infrared Hyperspectral Imaging
No ratings yet
Soluble Solids Content and PH Prediction and Maturity Discrimination of Lychee Fruits Using Visible and Near Infrared Hyperspectral Imaging
10 pages
On Color Image Segmentation
No ratings yet
On Color Image Segmentation
17 pages
Region Growing
No ratings yet
Region Growing
4 pages
Decentralized Federated Learning for Healthcare Networks a Case Study on Tumor Segmentation (2)
No ratings yet
Decentralized Federated Learning for Healthcare Networks a Case Study on Tumor Segmentation (2)
16 pages
Satellite Image Classification
No ratings yet
Satellite Image Classification
52 pages
A Review Paper On Use of BERT Algorithm in Twitter Sentiment Analysis
No ratings yet
A Review Paper On Use of BERT Algorithm in Twitter Sentiment Analysis
5 pages
Vehicle Counting Method Based On Gaussian Mixture Models and Blob Analysis
No ratings yet
Vehicle Counting Method Based On Gaussian Mixture Models and Blob Analysis
4 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Week1_Lecture2

Uploaded by

Week1_Lecture2

Uploaded by

CV801: Advanced Computer Vision

Seems easy ……..

More than 500 hours of video uploaded every minute

Business potential Substantial Commercial Interest

List of CVPR 2024 sponsors

Autonomous Driving Security Computer vision

Biometric Access Comfort: Robot Fun: Virtual Avatar

• CVPR conference ranking (Engineering) as of 2024

List of CVPR 2022 sponsors

• Covering 8 Out of 12 top CVPR 2023 topics

• Covering ~12 topics

GRASS, CAT, TREE, SKY

DOG, DOG, CAT

Semantic Object Instance

No spatial extent No objects, just pixels Multiple Objects

UNETR: Transformers for 3D Medical Image Segmentation, WACV 2022

Foundational Models Defining a New Era in Vision: A Survey and Outlook

Embryo detection & counting Segmentation

1. Diffusion Models 2. Multi Model LLM Meets Diffusion Models

Eg: For Person Image Synthesis, CVPR 2023

3. 3D-aware Image Generation 4. Image Generation for Healthcare Applications

Pedestrian Detection Person Search Crowd Counting Human Pose Estimation

• Mamba for Medical Image Segmentation

• Multi-layer Perceptron • Dropout

• Backpropagation • Data Augmentation

• Cross entropy loss

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.