0% found this document useful (0 votes)

37 views6 pages

Computer Vision AIML Handout v1.0

The document outlines the course structure for 'Computer Vision' at the Birla Institute of Technology & Science, Pilani, detailing objectives, content, textbooks, and evaluation methods. It covers various topics including low-level and mid-level vision, object segmentation, image classification, and deep learning applications. The evaluation scheme includes quizzes, assignments, a mid-semester test, and a comprehensive exam, with specific guidelines for each component.

Uploaded by

psychosaniyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views6 pages

Computer Vision AIML Handout v1.0

Uploaded by

psychosaniyan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

BIRLA INSTITUTE OF TECHNOLOGY & SCIENCE, PILANI

WORK INTEGRATED LEARNING PROGRAMMES

Digital
Part A: Content Design
Course Title Computer Vision
Course No(s) AIML* ZG525 Computer Vision
Credit Units 4
Content Authors Ms. Seetha Parameswaran
Version 1.0
Date June 26th 2023

Course Objectives
No Course Objective
CO1 Students should understand the fundamentals of a camera producing an image,
including camera calibration, optical distortions, perspective corrections etc.
CO2 Students should be familiar with various building block algorithms in Computer
Vision, including Image processing and Deep Learning with emphasis on the
algorithm building blocks.
CO3 Students should create at least one end-user application.

Text Book(s)
T1 Szeliski, R., 2022. Computer vision: algorithms and applications. Springer Nature.
T2 Image Processing, Analysis, and Machine Vision: Milan Sonka, Vaclav Hlavac,
Roger Boyle, Fourth edition, Cengage Learning

Reference Book(s) & other resources

R1 Forsyth, D. A., & Ponce, J. (2002). Computer vision: a modern approach. Second
Edition. Prentice hall
R2 Practical Machine Learning for Computer Vision: End-to-End Machine Learning for
Images, O’Rielly, 2021
Content Structure

1 Computer Vision ( 4 hrs)

1.1 What is Computer Vision? (T1 Ch 1.1)
1.2 Why Computer Vision is hard? (T2 Ch 1.2)
1.3 Applications of Computer Vision (T1 Ch 1.1)
1.4 Image representation and image analysis tasks (T2 Ch 1.3)
1.5 Image digitization - Sampling and resolution (T2 Ch 2.2)
1.6 Digital Images (T2 Ch 2.3)
1.7 Digital Image types -Binary, Gray-scale and Color (Class Notes)
1.8 Color Images (T2 Ch 2.4)
1.9 Color spaces: RGB and HSV (T2 Ch 2.4)

2 Low-level Vision ( 3 hrs)

2.1 Histogram and Histogram equalization (T1 Ch 3.1.4)
2.2 Gray-scale transformation (T2 Ch 5.1.2)
2.3 Image Smoothing (T2 Ch 5.3.1)
2.4 Connected components in images (T1 Ch 3.3.4)
2.5 Use case: Sharpening, blur, and noise removal using Filtering (T1 Ch 3.4.4)

3 Mid-level Vision ( 4 hrs)

3.1 Edge Detection using Gradients, Sobel, Canny (T1 Ch4.2, T2 Ch 5.3.2, 5.3.5)
3.2 Line detection using Hough transforms (T1 Ch 4.3, T2 Ch 5.3.10)
3.3 Semantic information using RANSAC (T1 Ch 4.3, T2 Ch 10.3)
3.4 Image region descriptor using SIFT (T2 Ch 10.2)
3.5 Use case: Pedestrian detection Using HoG and SIFT descriptors and SVM
(T1 - Ch 14.2)

4 Object Segmentation ( 4 hrs)

4.1 Types of Segmentation: Semantic vs Instance (Class Notes)
4.2 Segmentation using Agglomerative clustering, Kmeans (R1 Ch 9.3)
4.3 Mean-shift clustering (T2 Ch 7.1)
4.4 Vision Transformer (Class Notes)
4.5 Popular DNN Architectures for Segmentation - Detectron family, SOLO,
CondInst, Segment Anything Model (SAM), InternImage (Class Notes)
4.6 Metrics for Object Segmentation (R1 Ch 9.5)
4.6.1 mean IoU
4.6.2 Pixel Accuracy,
4.6.3 Boundary Error: ABPE, BDE
4.7 Use cases for Object Segmentation - Crop classification from satellite
imagery

5 Image Classification using Deep Learning ( 3 hrs)

5.1 Pattern recognition methods in image understanding (T2 Ch 10.6), R1 Ch
15.3)
5.2 Popular DNN Architectures: MobileNet, XceptionNet (Class Notes)
5.3 Metrics for Image Classification (R1 Ch 15.1)
5.3.1 Model Accuracy Metrics
5.3.1.1 Accuracy, Confusion Matrix, TPR, FPR, FNR, Top-K accuracy
5.3.1.2 Precision, Recall, F1 Score
5.3.1.3 AUC-ROC, AUC-PR
5.3.1.4 Intersection-Over-Union (IoU)
5.3.2 Model Performance Metrics
5.3.2.1 FLOPs
5.3.2.2 Memory Footprint for @ specific precision
5.3.2.3 Inference Time on a specific hardware
5.3.3 Metrics for Image Classification.
5.3.3.1 Cross Entroy (Log Loss), Brier Score
5.3.3.2 Macro-Precision, Macro-Recall, Macro-F1
5.4 Example Use cases of Image Classification (R1 – Ch 15, 16)
5.4.1 Automated sorting of fruits based on size, shape, color
5.4.2 Apparel type classification from image
5.5 Classifying Images Of Single Objects (R1 Ch16.2)

6 Object detection and Recognition ( 3 hrs)

6.1 Object detection (T2 Ch 9.2, R1 Ch 17.1)
6.2 Mean-shift clustering (T2 Ch 9.2)
6.3 Using YOLO (Class Notes)
6.4 Metrics (class Notes)
6.4.1 Average-Precision (AP)
6.4.2 Mean-Average-Precision (mAP)
6.5 Multi label object detection and recognition (Class Notes)
6.5.1 Object Localization → Multilabel Classification
6.5.2 Difference between Multiclass vs Multilabel Classification
6.5.3 Popular Models: YOLO, SSD, Faster-RCNN
6.6 Use case: Skin detection Ref: T1 – 14.1 and R1 - Ch17, 18

7 Object tracking ( 4 hrs)

7.1 Motion detection (R1 Ch 11)
7.2 Tracking by Detection (R1 Ch 11.1)
7.3 Tracking with the Mean Shift Algorithm (R1 Ch 11.2)
7.4 Kalman Filters (R1 Ch 11.3)
7.5 DNN architectures: DeepSORT, SiamFC, GSDT, SMILEtrack, SPARSEtrack
7.6 Use case: Pedestrian tracking (add ref)

8 Visual Bag of Words and Semantic Hierarchy ( 4 hrs)

8.1 Knowledge representation (T2 Ch 9.1)
8.2 Syntactic pattern recognition (T2 Ch 9.4)
8.3 Scene labeling (T2 Ch 10.9)
8.4 Semantic image segmentation and understanding (T2 Ch 10.10)
8.5 Summarizing Images with Visual Words (R1 Ch 16.1.3)
8.6 Application: Patch Classification in image of Breast Tumors Detection

9 Edge devices for computer vision ( 2 hrs)

9.1 ESP32 Cam module, Raspberry PI, Banana Pi etc
9.2 Intel
9.2.1 Core and Atom Processors
9.2.2 NUC
9.2.3 Movidias VPUs
9.2.4 OneAPI and OpenVino Libraries
9.3 Nvidia
9.3.1 Jetson Platform - Nano, TX2, Orin
9.3.2 DeepStream Library, CUDA, CUDNN
9.4 Others
9.4.1 Google Coral

Optional Modules to be taken in Experiential Learning / Webinars / Tutorials /

Assignments

1 Face detection and Recognition

1.1 Boosting - Viola Jones algorithm (T2 Ch 10.7)
1.2 DNN architecture: MTCNN, FastFace, RetinaFace
1.3 Active appearance models (T2 Ch 10.5)
1.4 Metrics (class Notes)
1.4.1 IoU based metrics for Face Detection
1.4.2 True Acceptance Rate (TAR), False AR, False Rejection Rate, TAR
@ specific FAR, Top-K Identification Rate
1.5 Use case: Attendance system on face image (add ref)

2 Optical Character Recognition

2.1 Main challenges in OCR
2.2 Popular Approaches for OCR:
2.2.1 Edge and Contours based layout detection
2.2.2 LayoutLM, DiT
2.2.3 LSTM, YOLO
2.2.4 Tesseract, EasyOCR
2.3 Metrics for OCR
2.3.1 Accuracy: character, word, sentence level
2.3.2 String edit distance
2.3.3 mAP for text localization
2.4 Example Use cases of OCR
2.4.1 Vehicle Number Plate recognition
2.4.2 Invoice Parsing
Detailed Plan for Lab work

Module
Lab No. Lab Objective
Reference

Reading images
1 Displaying images 1
Color space conversion

Histogram equalization
2 Gray-scale transformation 2
Filtering applications like sharpening, blur, noise removal, smoothing

Edge detection using Sobel and Canny

Line detection using Hough Transform
3 RANSAC for semantic information 3
SIFT image descriptor
Predestrian detection using HoG and SIFT

Image segmentation using Kmeans

Mean-shift clustering for segmentation
4 4
Vision transformer for segmentation
Crop classification using satellite images

Fruit sorting using transfer learning

5 Apparel type classification using transfer learning 5
Comparison on metrics for evaluation (demo)

Mean shift clustering for object detection

6 Object detection using Yolo and Faster RCNN 6
Skin detection

Mean shift algorithm for object tracking

7 Kalman filtering for object tracking 7
Pedestrian tracking

8 Patch classification in images 8

Evaluation Scheme:
Legend: EC = Evaluation Component; AN = After Noon Session; FN = Fore Noon Session

No Name Type Duration Weight Day, Date, Session, Time

EC-1(a) Quizzes Online 10%

EC-1(b) Assignments Take Home 20%

EC-2 Mid-Semester Test Closed Book 30%

EC-3 Comprehensive Exam Open Book 40%

Note:
Syllabus for Mid-Semester Test (Closed Book): Topics in Session Nos. 1 to 8
Syllabus for Comprehensive Exam (Open Book): All topics (Session Nos. 1 to 16)

Important links and information:

Elearn portal: https://elearn.bits-pilani.ac.in or Canvas

Students are expected to visit the Elearn portal on a regular basis and stay up to date with
the latest announcements and deadlines.
Contact sessions: Students should attend the online lectures as per the schedule
provided on the Elearn portal.

Evaluation Guidelines:
1 EC-1 consists of two Quizzes. Students will attempt them through the course pages
on the Elearn portal. Announcements will be made on the portal, in a timely
manner.
2 EC-2 consists of either one or two Assignments. Students will attempt them
through the course pages on the Elearn portal. Announcements will be made on the
portal, in a timely manner.
3 For Closed Book tests: No books or reference material of any kind will be
permitted.
4 For Open Book exams: Use of books and any printed / written reference material
(filed or bound) is permitted. However, loose sheets of paper will not be allowed.
Use of calculators is permitted in all exams. Laptops/Mobiles of any kind are not
allowed. Exchange of any material is not allowed.
5 If a student is unable to appear for the Regular Test/Exam due to genuine
exigencies, the student should follow the procedure to apply for the Make-Up
Test/Exam which will be made available on the Elearn portal. The Make-Up
Test/Exam will be conducted only at selected exam centres on the dates to be
announced later.

It shall be the responsibility of the individual student to be regular in maintaining the self-
study schedule as given in the course hand-out, attend the online lectures, and take all the
prescribed evaluation components such as Assignment/Quiz, Mid-Semester Test and
Comprehensive Exam according to the evaluation scheme provided in the hand-out.

Computer Vision Syllabus
No ratings yet
Computer Vision Syllabus
2 pages
Fundamentals of Ict Notes
90% (41)
Fundamentals of Ict Notes
60 pages
Computer Vision and Object Recognition
No ratings yet
Computer Vision and Object Recognition
76 pages
License Plate Detection Using Yolov8X and Easy OCR: Abstract
No ratings yet
License Plate Detection Using Yolov8X and Easy OCR: Abstract
9 pages
Ocr Coursework Sample Size
100% (2)
Ocr Coursework Sample Size
6 pages
VEHICLE NUMBER PLATE DETECTION SYSTEM Ramyaa
No ratings yet
VEHICLE NUMBER PLATE DETECTION SYSTEM Ramyaa
41 pages
AD8703 BCV Unit V 2023
No ratings yet
AD8703 BCV Unit V 2023
83 pages
AI English Notes
No ratings yet
AI English Notes
18 pages
Number Plate Recognition
No ratings yet
Number Plate Recognition
5 pages
01-02 Introduction To CV and Segmentation
No ratings yet
01-02 Introduction To CV and Segmentation
85 pages
Advanced Techniques For Improved Bangladeshi Number Plate Detection and Character Recognition in Automated Parking Systems
No ratings yet
Advanced Techniques For Improved Bangladeshi Number Plate Detection and Character Recognition in Automated Parking Systems
11 pages
Project Report OCR
92% (25)
Project Report OCR
50 pages
Block 4
No ratings yet
Block 4
98 pages
Cviii 2024 Ws
No ratings yet
Cviii 2024 Ws
98 pages
Object Identify Recog. CV
No ratings yet
Object Identify Recog. CV
12 pages
Block 4 Output
No ratings yet
Block 4 Output
101 pages
Ee655: Computer Vision & Deep Learning: Koteswar Rao Jerripothula, PHD Department of Electrical Engineering Iit Kanpur
No ratings yet
Ee655: Computer Vision & Deep Learning: Koteswar Rao Jerripothula, PHD Department of Electrical Engineering Iit Kanpur
43 pages
CV SVD L02 P1 IntroImageProcColor
No ratings yet
CV SVD L02 P1 IntroImageProcColor
89 pages
6.question Bank
No ratings yet
6.question Bank
5 pages
0 Computer Vision Panikzettel
No ratings yet
0 Computer Vision Panikzettel
28 pages
Computer Vision Engineer Interview Preparation Guide
No ratings yet
Computer Vision Engineer Interview Preparation Guide
20 pages
Computer Vision
No ratings yet
Computer Vision
5 pages
Lecture 2 PDF
No ratings yet
Lecture 2 PDF
62 pages
Isx QH FDL U99 BG Z3
No ratings yet
Isx QH FDL U99 BG Z3
2 pages
Crowd Counting
No ratings yet
Crowd Counting
11 pages
Snap Cart Case Study
No ratings yet
Snap Cart Case Study
7 pages
Boğaç Ergene, Atabey Kaygun - Semantic Mapping of An Ottoman Fetva Collection (2021)
No ratings yet
Boğaç Ergene, Atabey Kaygun - Semantic Mapping of An Ottoman Fetva Collection (2021)
54 pages
Cviii 2024 Ws
No ratings yet
Cviii 2024 Ws
45 pages
QA-CAD Software Installation Beginners Guide
No ratings yet
QA-CAD Software Installation Beginners Guide
42 pages
Automatic Mail Sorting Machine
No ratings yet
Automatic Mail Sorting Machine
74 pages
Prerequisites: What Is Computer Vision? Vision For Measurement
No ratings yet
Prerequisites: What Is Computer Vision? Vision For Measurement
8 pages
Transcription and Mortgage Act
No ratings yet
Transcription and Mortgage Act
36 pages
1 Introduction
No ratings yet
1 Introduction
67 pages
Unregistered Vehicle Reorganization On Number Plate Using Raspberry Pi Kit
No ratings yet
Unregistered Vehicle Reorganization On Number Plate Using Raspberry Pi Kit
6 pages
Your Interactive Guide To The Digital World
No ratings yet
Your Interactive Guide To The Digital World
53 pages
Syllabus T.Y.B.Sc. Data Science
No ratings yet
Syllabus T.Y.B.Sc. Data Science
52 pages
CV SVD L04 P1 ImageTrasformations 1
No ratings yet
CV SVD L04 P1 ImageTrasformations 1
45 pages
Computer Vision
No ratings yet
Computer Vision
33 pages
2 Input Device
No ratings yet
2 Input Device
3 pages
CMAC Neural Networks
No ratings yet
CMAC Neural Networks
6 pages
Syllabus Udacity Default en Us
No ratings yet
Syllabus Udacity Default en Us
4 pages
Lecture 1
No ratings yet
Lecture 1
84 pages
Microsoft Research Plan
100% (1)
Microsoft Research Plan
20 pages
Lecture 01
No ratings yet
Lecture 01
79 pages
Connector For OneDrive For Business URG
No ratings yet
Connector For OneDrive For Business URG
16 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
69 pages
AD8703 BCV Unit II 2023
No ratings yet
AD8703 BCV Unit II 2023
67 pages
Syllabus-Topics in Computer Vision
100% (1)
Syllabus-Topics in Computer Vision
5 pages
Azhar Lecture Wise CV New
No ratings yet
Azhar Lecture Wise CV New
5 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
CEN454 - Computer Vision and Machine Learning (Current)
No ratings yet
CEN454 - Computer Vision and Machine Learning (Current)
6 pages
Manual Paper Port 14
100% (1)
Manual Paper Port 14
62 pages
CV Lecture 1
No ratings yet
CV Lecture 1
65 pages
5 BCA - Electives Syllabus
No ratings yet
5 BCA - Electives Syllabus
10 pages
Ai and ML
No ratings yet
Ai and ML
6 pages
Iva Syb With Lab
No ratings yet
Iva Syb With Lab
3 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
INT345 Computer Vision
No ratings yet
INT345 Computer Vision
31 pages
Seminar
No ratings yet
Seminar
23 pages
Sample Project Report
No ratings yet
Sample Project Report
26 pages
CV SVD L01 P1 Intro
No ratings yet
CV SVD L01 P1 Intro
35 pages
CV Unit 1
No ratings yet
CV Unit 1
17 pages
Week 9 Lecture Notes
No ratings yet
Week 9 Lecture Notes
27 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
00 - Course Info - MSC
No ratings yet
00 - Course Info - MSC
12 pages
AD8703 BCV Unit I 2023
No ratings yet
AD8703 BCV Unit I 2023
65 pages
CV 2 Marks
No ratings yet
CV 2 Marks
11 pages
CV Digital Notes
No ratings yet
CV Digital Notes
77 pages
Camscanner User Manual LX
No ratings yet
Camscanner User Manual LX
15 pages
Expert PDF 15 - Official Site - Create, Modify, Convert & Protect Your PDFs
No ratings yet
Expert PDF 15 - Official Site - Create, Modify, Convert & Protect Your PDFs
10 pages
Skill Enhancement Course (SEC) Artificial Intelligence
No ratings yet
Skill Enhancement Course (SEC) Artificial Intelligence
54 pages
RVP Syllabus
No ratings yet
RVP Syllabus
4 pages
Computer Science Notes OBJECTIVE Chapter #1 Class XI
90% (49)
Computer Science Notes OBJECTIVE Chapter #1 Class XI
25 pages
Objectdetection
No ratings yet
Objectdetection
7 pages
RMK Group 21cs905 CV Unit 2
No ratings yet
RMK Group 21cs905 CV Unit 2
76 pages
Ali CV Updated
No ratings yet
Ali CV Updated
5 pages
Hardware Devices Form 3
No ratings yet
Hardware Devices Form 3
13 pages
Computer Vision I
No ratings yet
Computer Vision I
61 pages
ECT386 - Ktu Qbank
No ratings yet
ECT386 - Ktu Qbank
10 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
RMK Group 21cs905 CV Unit 5
No ratings yet
RMK Group 21cs905 CV Unit 5
101 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
Realtime Visual Recognition in Deep Convolutional Neural Networks
No ratings yet
Realtime Visual Recognition in Deep Convolutional Neural Networks
13 pages
3 2c735de418 Syllabus Computer Vision Modified
No ratings yet
3 2c735de418 Syllabus Computer Vision Modified
5 pages
CO Machine Vision
No ratings yet
CO Machine Vision
3 pages
Desktop Publishing
No ratings yet
Desktop Publishing
11 pages
Ocr On A Grid Infrastructure: Project Synopsis
No ratings yet
Ocr On A Grid Infrastructure: Project Synopsis
9 pages
Object Detection Using Deep Learning
No ratings yet
Object Detection Using Deep Learning
45 pages
Computer Vision 3-0-0-3 2016 Prerequisite: EC301 Digital Signal Processing Course Objectives
No ratings yet
Computer Vision 3-0-0-3 2016 Prerequisite: EC301 Digital Signal Processing Course Objectives
2 pages
Mastering OpenGL: From Basics to Advanced Rendering Techniques: OpenGL
From Everand
Mastering OpenGL: From Basics to Advanced Rendering Techniques: OpenGL
Kameron Hussain
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Computer Vision AIML Handout v1.0

Uploaded by

Computer Vision AIML Handout v1.0

Uploaded by

BIRLA INSTITUTE OF TECHNOLOGY & SCIENCE, PILANI

WORK INTEGRATED LEARNING PROGRAMMES

Reference Book(s) & other resources

1 Computer Vision ( 4 hrs)

2 Low-level Vision ( 3 hrs)

3 Mid-level Vision ( 4 hrs)

4 Object Segmentation ( 4 hrs)

5 Image Classification using Deep Learning ( 3 hrs)

6 Object detection and Recognition ( 3 hrs)

7 Object tracking ( 4 hrs)

8 Visual Bag of Words and Semantic Hierarchy ( 4 hrs)

9 Edge devices for computer vision ( 2 hrs)

Optional Modules to be taken in Experiential Learning / Webinars / Tutorials /

1 Face detection and Recognition

2 Optical Character Recognition

Edge detection using Sobel and Canny

Image segmentation using Kmeans

Fruit sorting using transfer learning

Mean shift clustering for object detection

Mean shift algorithm for object tracking

8 Patch classification in images 8

No Name Type Duration Weight Day, Date, Session, Time

EC-1(a) Quizzes Online 10%

EC-1(b) Assignments Take Home 20%

EC-2 Mid-Semester Test Closed Book 30%

EC-3 Comprehensive Exam Open Book 40%

Important links and information:

Elearn portal: https://elearn.bits-pilani.ac.in or Canvas

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.