0% found this document useful (0 votes)

10 views65 pages

CNN MLFA Ons-Part1

Uploaded by

Nabayan Saha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views65 pages

CNN MLFA Ons-Part1

Uploaded by

Nabayan Saha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 65

Convolutional Neural Network

Why CNNs?
Topics
General and biological motivation

Hand-coded to learnt filters

Understanding Convolution Operation

CNNs over Feed Forward Neural Networks

Different layers in a CNN (convolution, pooling, relu, etc.)

CNNs for Regression

CNN for Classification meets CNN for Regression

Biological motivation - Mammalian vision system.

Hubel and Wiesel (1959) Experimental setup

1981 Nobel prize

Suggested a ‘hierarchy’ of feature detectors in the mammalian visual cortex.

Biological motivation - Mammalian vision system.

Simple cells:
1. Activity characterized by a linear function of the image.
2. Operates in a spatially localized (SL) receptive field.
3. Each set responds to edges of different orientation.

Complex cells:
1. Operates in large SL receptive field
2. Receive input from lower level simple cells.

Hyper-complex cells:
1. Larger receptive field
2. Receive input from lower level complex cells.
Biological motivation - Grandmother cell
The grandmother cell is a hypothetical neuron that represents a complex but specific
concept or object proposed by cognitive scientist Jerry Letvin in 1969.
Biological motivation - CNN.
Back-propagation [Lang and Hinton, 1988], and modern CNN [LeCun et al., 1989]

CNN proposed by LeCun et al. for document recognition.

CNN for document recognition [LeCun et al., 1989].

All images are 28x28 grayscale.

60k training examples.

10k test examples

Output value is integer from 0-9

Layer 3
Layer 1 Layer 5 Input
CNN for document recognition [LeCun et al., 1989].

Translation invariance Rotation invariance Scale invariance

Squeeze invariance Stroke-width invariance Noise invariance

Then why DL didn’t take-off in 90’s?
Backpropagation applied to handwritten zip code recognition
Y LeCun, B Boser, JS Denker, D Henderson, RE Howard, W
Hubbard, …
Neural computation 1 (4), 541-551, 1989

Handwritten digit recognition with a back-propagation network

Y LeCun, B Boser, JS Denker, D Henderson, RE Howard, W
Hubbard, ...
Advances in neural information processing systems 2, NIPS 1989,
396-404
Then why DL didn’t take-off in 90’s?

1. Limited big data availability

2. Limited computational power to crunch data
Why DL is trending now?
Big data availability Computational power to crunch data

One trillion images.

350 million images uploaded per day.

100 hrs of video uploaded per minute.

2.5 Petabytes data every minute.

Parallel processing units - GPUs

When/how was deep-learning reclaimed?
Traditional ML
Topics
General and biological motivation

Hand-coded to learnt filters

Understanding Convolution Operation

CNNs over Feed Forward Neural Networks

Different layers in a CNN (convolution, pooling, relu, etc.)

CNNs for Regression

CNN for Classification meets CNN for Regression

Traditional machine learning
Raw data Feature extraction Classifier/detector Result

Clustering or
Shallow neural n/w,
etc. Detect

”
Speaker id, speech translate

”
Machine translation
Features: Classical

Filter banks
Edges and Corners: Sobel, LoG and Canny

PCA/Subspaces Histogram of responses

Different transforms
(Fourier/Wavelet)
Deep learning Backpropagate errors to
optimize weights
Training phase

Labels

Labelled dataset (usually in millions)

Network (with unoptimized weights)
Deep learning
Deployment

Network (with trained weights)

Pedestrian detection (for automatic braking)

Traditional ML vs Deep learning: Face detection
Traditional machine learning
Traditional ML vs Deep learning: Face detection
Deep learning

Face p = 0.94

Input Low-level features Mid-level features High-level features Output node

Deep learning benefits over traditional ML
Robust
1. No need to design the features ahead of time – features are automatically
learned to be optimal for the task at hand.
2. Robustness to natural variations in the data is automatically learned.

Generalizable
1. The same neural net approach can be used for many different applications
and data types (e.g., hand-crafted face features cannot be used for pedestrian
detection, whereas the same CNN architecture can be used for both).

Scalable

1. Performance improves with more data, and can be leveraged by massive

parallelization of GPUs.
Topics
General and biological motivation

Hand-coded to learnt filters

Understanding Convolution Operation

CNNs over Feed Forward Neural Networks

Different layers in a CNN (convolution, pooling, relu, etc.)

CNNs for Regression

CNN for Classification meets CNN for Regression

What is a Convolution operation?
Image representation
Convolution operation detecting edges
Convolution operations: Examples
Topics
General and biological motivation

Hand-coded to learnt filters

Understanding Convolution Operation

CNNs over Feed Forward Neural Networks

Different layers in a CNN (convolution, pooling, relu, etc.)

CNNs for Regression

CNNs for Classification meets CNNs for Regression

CNNs over Feed Forward Neural Network

Multi-layer neural network

CNNs are multi-layer neural network with two constraints:

1. Local connectivity
2. Parameter sharing
Intuition behind CNN (over MLP)
CNNs are multi-layer neural network with two
constraints:
1. Local connectivity:
a. Can extract elementary features such as
edges, end-points, corners.

b. This features are combined by subsequent

layer to detect higher order features.

2. Parameter sharing
a. Elementary feature detectors useful on one
part may be useful in other parts of images as
well.
CNN: Local connectivity (LC)
Hidden layer (3 nodes)

Input layer (7 nodes)

MLNN ( 7 X 3 = 21 parameters) MLNN-LC ( 3 X 3 = 9 parameters)

2.3X runtime and storage efficient.
In general for a level with m input and n output nodes and CNN-local connectivity of k nodes (k<m):

MLNN have MLNN-LC have:

1. m x n parameters to store. 1. k x n parameters to store.
2. O(m x n) runtime 2. O(k x n) runtime
CNN: Parameter sharing (PS)

MLNN (21 parameters) MLNN-LC ( 3 X 3 = 9 parameters) MLNN-LC-PS (3 parameters)

2.3X runtime and storage efficient. 2.3X faster,
& 7X storage efficient.

In general for a level with m input and n output nodes and CNN-local connectivity of k nodes (k<m):

MLNN have MLNN-LC have: MLNN-LC-PS have:

1. m x n parameters to store. 1. k x n parameters to store. 1. k parameters to store.
2. O(m x n) runtime 2. O(k x n) runtime 2. O(k x n) runtime
CNN with multiple input channels

Channel 2
Single input channel
Two input channels
CNN with multiple output maps

Single input map Two output maps

A generic level of CNN
Parameter sharing

Local connectivity

# input channels # output maps

Topics
General and biological motivation

Hand-coded to learnt filters

Understanding Convolution Operation

CNNs over Feed Forward Neural Networks

Different layers in a CNN (convolution, pooling, relu, etc.)

CNNs for Regression

CNN for Classification meets CNN for Regression

Different layers of CNN architecture
CNN: Convolutional layer

1. To reduce the number of weights (through local connectivity).

2. To provide spatial invariance (through parameter sharing).
Closer look into CNN filters.
Hyper parameters for convolutional layer.
1. Zero padding (to control input size spatially.)

Without padding (i.e., [0,0]) With padding [2,2]

Hyper parameters for convolutional layer.
2. Stride (to produce smaller output volumes spatially.)

Without stride (i.e., [1,1]) With stride [2,2]

Hyper parameters for convolutional layer.
Both padding and stride

Without padding and stride With padding [1,1] & stride [2,2]
CONVOLUTIONAL LAYER
1. Accepts a volume of size W1 X H1 X D1.
2. Requires four hyperparameters:
a. Number of filters K
b. their spatial extent F
c. their stride S
d. the amount of zero padding P
3. Produces an output volume of size W2 X H2 X D2 where:
W2=(W1−F+2P)/S+1, H2=(H1−F+2P)/S+1, D2=K
4. With parameter sharing, it introduces F⋅F⋅D1 weights per filter, for a total of
(F⋅F⋅D1)⋅K weights and K biases.
Hyper parameters for convolutional layer.
Dilation

Vanilla Convolution With dilation

CONVOLUTIONAL LAYER
1. Accepts a volume of size W1 X H1 X D1.
2. Requires four hyperparameters:
a. Number of filters K
b. their spatial extent F
c. their stride S
d. the amount of zero padding P
3. Produces an output volume of size W2 X H2 X D2 where:
W2=(W1−F+2P)/S+1, H2=(H1−F+2P)/S+1, D2=K —- Exercise
4. With parameter sharing, it introduces F⋅F⋅D1 weights per filter, for a total of
(F⋅F⋅D1)⋅K weights and K biases.
Different layers of CNN architecture
CNN: Pooling layer

1. To reduce the spatial size of the representation to reduce the amount of parameters
and computation in the network.
2. Average pooling or L2 pooling can also be used, but not popular like max pooling.
POOLING LAYER
1. Accepts a volume of size W1 X H1 X D1.
2. Requires two hyperparameters:
a. their spatial extent F
b. their stride S
c. the amount of zero padding P (commonly P = 0).
3. Produces an output volume of size W2 X H2 X D2 where:
W2=(W1−F+2P)/S+1, H2=(H1−F+2P)/S+1, D2=K
4. Introduces zero parameters since it computes a fixed function of the input.
Different layers of CNN architecture
Recap: Gradient descent
Recap: Backpropagation
Recap: Backpropagation
Activation functions: Sigmoidal function

Drawback 1: Sigmoids saturate and kill gradients (when the neuron’s activation saturates at
either tail of 0 or 1).
0 ⇒ fails to update weights while back-prop.
Activation functions: Rectified Linear Unit (very popular).
6X improvement in
convergence

tanh

ReLU

Advantage 1: Eliminate saturation and killing of gradients (one direction).

Sigmoid neurons involve expensive operations (exponentials, etc.), whereas

Advantage 2:
ReLU can be implemented by simply thresholding activations at zero.
Different layers of CNN architecture
Flattening, fully connected (FC) layer and softmax
Class
probabilities Flattening
1. Vectorization (converting M X N X D tensor to a
MND X 1 vector).

FC layer
1. Multilayer perceptron.
2. Generally used in final layers to classify the object.
3. Role of a classifier.
Softmax
Fully connected
Softmax layer
1. Normalize output as discrete class probabilities.
Cross-entropy Loss

What we want?
Cross-entropy Loss?
Cross-entropy Loss?
A Real Life Application
Different layers of CNN architecture: A Review
Training very deep network: Resnet
Training very deep network: Resnet
Training very deep network: Resnet
Training very deep network: Resnet

Neural Network Notes
No ratings yet
Neural Network Notes
268 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
26 pages
Deep Learning Convolution Neural Networks
No ratings yet
Deep Learning Convolution Neural Networks
73 pages
The Impact of Nutrition Education On The Dietary Habits
No ratings yet
The Impact of Nutrition Education On The Dietary Habits
52 pages
Unit 5 CNN
No ratings yet
Unit 5 CNN
151 pages
CNN Students
No ratings yet
CNN Students
170 pages
3 Distributing Tensor Flow Across Devices and Ser 241120 095224
No ratings yet
3 Distributing Tensor Flow Across Devices and Ser 241120 095224
47 pages
PNAL9 CNNs
No ratings yet
PNAL9 CNNs
61 pages
MLP and CNN
No ratings yet
MLP and CNN
56 pages
Unit III
No ratings yet
Unit III
60 pages
CNN 190813145957
No ratings yet
CNN 190813145957
34 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
34 pages
Unit 3
No ratings yet
Unit 3
19 pages
Convolution Neural Networks Vs Fully Connected Neural Networks
No ratings yet
Convolution Neural Networks Vs Fully Connected Neural Networks
6 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
Convolutional Networks
No ratings yet
Convolutional Networks
37 pages
Lecture - 07 (Convolutional Neural Networks)
No ratings yet
Lecture - 07 (Convolutional Neural Networks)
57 pages
Lecture 3 V33
No ratings yet
Lecture 3 V33
52 pages
SUDIPTA
No ratings yet
SUDIPTA
19 pages
Deep Learning
No ratings yet
Deep Learning
90 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
68 pages
(Ebook PDF) Health Services Research Methods 3rd Edition Instant Download
100% (1)
(Ebook PDF) Health Services Research Methods 3rd Edition Instant Download
44 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
21 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
47 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
74 pages
DL CNN
No ratings yet
DL CNN
129 pages
Convolutional Neuralnetworks: Abin - Roozgard
No ratings yet
Convolutional Neuralnetworks: Abin - Roozgard
54 pages
Presented By:: SANTHOSH.K-927622BIT087 SIVA BHARAT.B-927622BIT099 NITHISH KUMAR.S-927622BIT066 SUNTHAR SHREE-927622BIT110
No ratings yet
Presented By:: SANTHOSH.K-927622BIT087 SIVA BHARAT.B-927622BIT099 NITHISH KUMAR.S-927622BIT066 SUNTHAR SHREE-927622BIT110
16 pages
DL Unit3 1
No ratings yet
DL Unit3 1
67 pages
CNN 2
No ratings yet
CNN 2
47 pages
BSBCRT511 Critical Thinking Skills
100% (7)
BSBCRT511 Critical Thinking Skills
18 pages
The Hidden History of New Women in Serbian Culture Toward A New History of Literature Svetlana Tomic Download
No ratings yet
The Hidden History of New Women in Serbian Culture Toward A New History of Literature Svetlana Tomic Download
86 pages
Unit 3 NNDL-1
No ratings yet
Unit 3 NNDL-1
31 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
35 pages
DL Unit 3 2019PAT
No ratings yet
DL Unit 3 2019PAT
66 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
Co2 CNN 3
No ratings yet
Co2 CNN 3
31 pages
Unit - 2
No ratings yet
Unit - 2
51 pages
MLT UNIT-4 & 5 Imp Sol
No ratings yet
MLT UNIT-4 & 5 Imp Sol
22 pages
Shravya Banala
No ratings yet
Shravya Banala
29 pages
DL Unit2
No ratings yet
DL Unit2
25 pages
Anne of Green Gables L2 Orginal
100% (5)
Anne of Green Gables L2 Orginal
65 pages
Nria20-Dl - Unit-3 Notes-Final
No ratings yet
Nria20-Dl - Unit-3 Notes-Final
23 pages
Unit - 4 DL
No ratings yet
Unit - 4 DL
19 pages
Module 5
No ratings yet
Module 5
20 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
Department of Collegiate and Technical Education: Government Polytechnic Nagamangala
No ratings yet
Department of Collegiate and Technical Education: Government Polytechnic Nagamangala
13 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
CNN Remake
No ratings yet
CNN Remake
1 page
Deep Learning Image Classification
No ratings yet
Deep Learning Image Classification
11 pages
CV PPT Mt101
No ratings yet
CV PPT Mt101
16 pages
Visual and Audio Signal Processing Lab University of Wollongong
No ratings yet
Visual and Audio Signal Processing Lab University of Wollongong
20 pages
Basic Introduction To Convolutional Neural Network in Deep Learning
No ratings yet
Basic Introduction To Convolutional Neural Network in Deep Learning
9 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
25 pages
2nd Provisional Merit List of LP Hailakandi
No ratings yet
2nd Provisional Merit List of LP Hailakandi
26 pages
DL 4
No ratings yet
DL 4
4 pages
Module 05 CNN Arctitecture
No ratings yet
Module 05 CNN Arctitecture
7 pages
Arora 2020
No ratings yet
Arora 2020
3 pages
Department of Information Science and Engineering Technical Seminar (18Css84) Convolutional Neural Networks
No ratings yet
Department of Information Science and Engineering Technical Seminar (18Css84) Convolutional Neural Networks
15 pages
Life Skills Lesson Plans Grade 3 Week 5 2
No ratings yet
Life Skills Lesson Plans Grade 3 Week 5 2
3 pages
Tran-Eportfolio Clinical Exemplar
No ratings yet
Tran-Eportfolio Clinical Exemplar
5 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
9 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Seminar
No ratings yet
Seminar
16 pages
TPS 30
No ratings yet
TPS 30
40 pages
7 Applications of Convolutional Neural Networks - FWS
No ratings yet
7 Applications of Convolutional Neural Networks - FWS
3 pages
Guddu Jha - Organized
No ratings yet
Guddu Jha - Organized
3 pages
JARVIS-1: Open-World Multi-Task Agents With Memory-Augmented Multimodal Language Models
No ratings yet
JARVIS-1: Open-World Multi-Task Agents With Memory-Augmented Multimodal Language Models
28 pages
Variations in Psychological Attributes
100% (2)
Variations in Psychological Attributes
43 pages
Lec Feb02-1
No ratings yet
Lec Feb02-1
35 pages
Multimodal Sentiment Analysis-6
No ratings yet
Multimodal Sentiment Analysis-6
20 pages
S9 Q4 Hybrid Module 2 Week 3 Conservation of Momentum
No ratings yet
S9 Q4 Hybrid Module 2 Week 3 Conservation of Momentum
19 pages
Styles of Citation
No ratings yet
Styles of Citation
4 pages
TKC Biodata
0% (1)
TKC Biodata
39 pages
Tutorial Problems Integral Approach
100% (1)
Tutorial Problems Integral Approach
1 page
Applsci 12 09820 v2
No ratings yet
Applsci 12 09820 v2
15 pages
Particle Size and Shape - Complete
No ratings yet
Particle Size and Shape - Complete
25 pages
Multiple Intelligences in The Classroom, 4th Ed
No ratings yet
Multiple Intelligences in The Classroom, 4th Ed
8 pages
Daily Lesson Log - Writing WH Questions
0% (1)
Daily Lesson Log - Writing WH Questions
3 pages
Pasacao Central School
No ratings yet
Pasacao Central School
2 pages
Internal Assessment Rubric and Feedback Form: Design
100% (1)
Internal Assessment Rubric and Feedback Form: Design
2 pages
A Study On Problems Faced by The Teacher
No ratings yet
A Study On Problems Faced by The Teacher
5 pages
Unit3 Physical Activity and Nutrition
No ratings yet
Unit3 Physical Activity and Nutrition
5 pages
Bagong Silang Elementary School Jesus Minguez
No ratings yet
Bagong Silang Elementary School Jesus Minguez
2 pages
SEC 7 CT Midsem Marks
No ratings yet
SEC 7 CT Midsem Marks
2 pages
Assignment 3 1
No ratings yet
Assignment 3 1
3 pages
Azmin MD Zamin 2020 Learning Vocabulary Through Songs
No ratings yet
Azmin MD Zamin 2020 Learning Vocabulary Through Songs
9 pages
MO Assignment Questions
No ratings yet
MO Assignment Questions
2 pages
Toilet Drawing
No ratings yet
Toilet Drawing
1 page
Preventing Burnout
No ratings yet
Preventing Burnout
11 pages
How Do You Start A Summary of A Book Review
No ratings yet
How Do You Start A Summary of A Book Review
2 pages
ISYE 6413: Design and Analysis of Experiments Fall, 2020: Jeffwu@isye - Gatech.edu
No ratings yet
ISYE 6413: Design and Analysis of Experiments Fall, 2020: Jeffwu@isye - Gatech.edu
3 pages
40 North Pearl Street ALBANY, NY 12243-0001: New York State Office of Temporary and Disability Assistance
No ratings yet
40 North Pearl Street ALBANY, NY 12243-0001: New York State Office of Temporary and Disability Assistance
60 pages
Qualitative Data Worksheet: Historical Design Teacher's Feedback Working Title
No ratings yet
Qualitative Data Worksheet: Historical Design Teacher's Feedback Working Title
5 pages
12 Definitive Traits of A Middle Child
No ratings yet
12 Definitive Traits of A Middle Child
2 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CNN MLFA Ons-Part1

Uploaded by

CNN MLFA Ons-Part1

Uploaded by

Convolutional Neural Network

Hand-coded to learnt filters

Understanding Convolution Operation

CNNs over Feed Forward Neural Networks

Different layers in a CNN (convolution, pooling, relu, etc.)

CNNs for Regression

CNN for Classification meets CNN for Regression

Hubel and Wiesel (1959) Experimental setup

Suggested a ‘hierarchy’ of feature detectors in the mammalian visual cortex.

CNN proposed by LeCun et al. for document recognition.

All images are 28x28 grayscale.

60k training examples.

10k test examples

Output value is integer from 0-9

Translation invariance Rotation invariance Scale invariance

Squeeze invariance Stroke-width invariance Noise invariance

Handwritten digit recognition with a back-propagation network

1. Limited big data availability

One trillion images.

350 million images uploaded per day.

100 hrs of video uploaded per minute.

2.5 Petabytes data every minute.

Parallel processing units - GPUs

Hand-coded to learnt filters

Understanding Convolution Operation

CNNs over Feed Forward Neural Networks

Different layers in a CNN (convolution, pooling, relu, etc.)

CNNs for Regression

CNN for Classification meets CNN for Regression

PCA/Subspaces Histogram of responses

Labelled dataset (usually in millions)

Network (with trained weights)

Pedestrian detection (for automatic braking)

Input Low-level features Mid-level features High-level features Output node

1. Performance improves with more data, and can be leveraged by massive

Hand-coded to learnt filters

Understanding Convolution Operation

CNNs over Feed Forward Neural Networks

Different layers in a CNN (convolution, pooling, relu, etc.)

CNNs for Regression

CNN for Classification meets CNN for Regression

Hand-coded to learnt filters

Understanding Convolution Operation

CNNs over Feed Forward Neural Networks

Different layers in a CNN (convolution, pooling, relu, etc.)

CNNs for Regression

CNNs for Classification meets CNNs for Regression

Multi-layer neural network

CNNs are multi-layer neural network with two constraints:

b. This features are combined by subsequent

Input layer (7 nodes)

MLNN ( 7 X 3 = 21 parameters) MLNN-LC ( 3 X 3 = 9 parameters)

MLNN have MLNN-LC have:

MLNN (21 parameters) MLNN-LC ( 3 X 3 = 9 parameters) MLNN-LC-PS (3 parameters)

MLNN have MLNN-LC have: MLNN-LC-PS have:

Single input map Two output maps

# input channels # output maps

Hand-coded to learnt filters

Understanding Convolution Operation

CNNs over Feed Forward Neural Networks

Different layers in a CNN (convolution, pooling, relu, etc.)

CNNs for Regression

CNN for Classification meets CNN for Regression

1. To reduce the number of weights (through local connectivity).

Without padding (i.e., [0,0]) With padding [2,2]

Without stride (i.e., [1,1]) With stride [2,2]

Vanilla Convolution With dilation

Advantage 1: Eliminate saturation and killing of gradients (one direction).

Sigmoid neurons involve expensive operations (exponentials, etc.), whereas

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.