0% found this document useful (0 votes)

23 views17 pages

Convolutional Neural Networks (CNN) : Convolutions

deep learning

Uploaded by

kmedo8080966

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views17 pages

Convolutional Neural Networks (CNN) : Convolutions

deep learning

Uploaded by

kmedo8080966

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

lOMoARcPSD|15872722

Convolutional Neural Networks

(CNN)
CNNs are state of the art for image processing / DL with images as input

applying filters on the input data, basically, the filters replace the weights through
different kernel convolutional operations being responsible for the filter effect

two main ideas:

give a better structure to NN: instead of connecting everything with

everything, connect neurons of one layer with neurons of another layer that
are neighbors

use the same weights for different parts of the image; intuitively if feature of
one image is interesting it will prob. also be interesting in another image

Convolutions
convolve = falten; applying a filter to a function; filter in the sense of a matrix/grid
of values that alter the output of a given function

Discrete Case: Box Filter

Sliding filter kernel from left to right, multiplying and summing up every overlapping fields

applying the same filter to all pixels of an image is the idea of weight sharing

handling overlapping fields: either ignore → shrinking image; or padding: add

0 to compute a value

Convolutional Neural Networks (CNN) 1

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

Convolution on Images

Image of 5x5 with a convolutional filter of size 3x3 generating an output of size 3x3

example computation is reduced to actually necessary computations: 3 ⋅

0 + 3 ⋅ (−1) + 5 ⋅ 0 + 1 ⋅ (−1) + 4 ⋅ 5 + 4 ⋅ (−1) + 7 ⋅ 0 + 9 ⋅
(−1) + (−1) ⋅ 0 = 3 ⋅ (−1) + 1 ⋅ (−1) + 4 ⋅ 5 + 4 ⋅ (−1) + 9 ⋅
(−1) = −3 − 1 + 20 − 4 − 9 = 20 − 17 = 3
Image Filter Examples → that is exactly how filters are applied by any image
altering application, e.g. Instagram

Convolutional Neural Networks (CNN) 2

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

In CNNs these filters represent the weights of the network

Convolutions on RGB Images

images have depth due to RGB split we have 3 channels

depth dimension of image must match depth of filter (convolutional kernel)

same procedure as before: slide filter over image and apply filter through dot
product at every position resulting in zi = wT xi + b
(5×5×3)×1(5×5×3)×1 1
where the weights represent the filter, note that the output matrix z is of
dimension 1

Example: 32 x 32 x 3 image results in 28 x 28 output image without padding

Convolutional Neural Networks (CNN) 3

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

Convolution Layer
def.: applying different filters to the same image, for every filter we apply to the
image we create a new convolutional layer, e.g. applying 2 filters to an 32 x 32 x
3 image results in 28 x 28 x 2 convolutional layers

layer defined by filter width & height, depth implicitly given by dot-product

number of layers defined by number of different weights (i.e. filters)

each filter captures different image characteristic, e.g. horizontal/vertical edges,

circles, squares, etc.

Dimensions of Convolutional Layers - Examples

stride trigger a jump, e.g. stride = 2

Convolutional Neural Networks (CNN) 4

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

without padding the outputs shrink with every iteration which is not a good
idea

padding assures that corner pixels are considered as well and image sizes
don't get smaller as quickler as they would otherwise → most common
paddiong: zero-padding, leading to output size: (+ N +2⋅P
S
−F
, + 1) ×
(+ N +2⋅P
S
−F
, + 1)
N: width of image

F: width of filter
F −1
P: number of padding; padding should usually be set to P = 2
S: stride

number of parameters (weights): each number in filter is considered as

one weight, i.e. 5x5x3 filter has 5*5*3+1 = 76 parameters (+1 for bias for
every layer), if we apply 10 filters we have a total of 76 * 10 = 760
parameters

Exam Example Question

Convolutional Neural Networks (CNN) 5

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

Convolutional Neural Network (CNN)

concatenation of convolutional layers and activations

Pooling
another operator heavily used in CNNs

using padding assures that the images don't shrink as we apply the filters,
pooling allows to shrink images nevertheless but only when required → reducing
feature map size

pooling is the same as downsampling usually by 2

Different ways:

Max Pooling: define equally sized regions within input and then create new
pooled output of that size consisting of highest numbers from each
corresponding input region, e.g.

Convolutional Neural Networks (CNN) 6

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

if within a region more than one highest number exist just take either

Average Pooling: averaging all values of a region instead of taking max value

conv layer = feature extraction computing feature in a given region and pooling
layer = feature selection picking the strongest activation in a region

most common setting of a pool: 2 x 2, e.g. image of 200x200 results in 100x100

Other properties

Example of a fully connected network using convolution, ReLU as activation

function and applying Pooling to shrink the image size

Convolutional Neural Networks (CNN) 7

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

CNN Prototype; FC applies brute force connecting everything with everything, not using shared
weights and thus not applying inductive bias

Convolutions allows us to structure a Neural Network

Receptive Field
describing the field of pixels from which a pixel of field within a convolutional
kernel has been created (computed through dot products) from

the deeper one goes into a network, the bigger the receptive field must be

preferably, use more layers with smaller filters (e.g. 3 layers with filter size 3x3)
as this also injects more non-linearity (with every additional layer), also less
weights → less overfitting

Convolutional Neural Networks (CNN) 8

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

Classic Architectures
LeNet

32x32x1 image recognition of grey scale images therefore only 1 as 3rd

dimension classifying into 10 classes

on a high level: gradually reduce spatial dimensions

Test Benchmarks: ImageNet Dataset - ImageNet Large Scale Visual

Recognition Competition marked key milestone in DL

Common Performance Metrics using top-k scores

top-1 score: checking if sample's top class with highest probability is the
same as target label

top-5 score: if any of 5 predictions with highest prob → top-5 error

percentage of test samples for which correct class wasn't in top 5
predicted classes

AlexNet has about 60 mio parameters

Convolutional Neural Networks (CNN) 9

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

1000 outputs for 1000 classes: in order to get from spatial data 6x6x256 we
use fully connected networks converting data into 9216 data points, then
4096, again 4096, and finally 1000

VGGNet simplifying AlexNet by fixing CONV = 3x3 filters with stride 1 &
MAXPOOL = 2x2 filters with stride 2

again switching between CONV & POOL in 16 layers, again width & height
decreases + # of filters increase as we go deeper resulting in 138 mio
parameters

Skip Connections - ResNet

Problem of Depth - why don't we simply add more layers? → more and more
layers makes training harder, gradies explode and vanish

Convolutional Neural Networks (CNN) 10

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

Residual Block - how can we train very deep nets (i.e. more layers) while
keeping training stable?

skipping connection: taing output from L-1 directly to L+1

ResNet Block

Convolutional Neural Networks (CNN) 11

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

ResNets with set of good network design choices - mostly used for computer
vision networks to classify images

Why do ResNets work?

Convolutional Neural Networks (CNN) 12

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

if these value become 0 the output of layer L+1 will be equal to L-1, nothing
changes because gradients vanish → reason why we cant have unlimited
main layers

1x1 Convolutions
simply scales input answer by constant while keeping dimension of input

useful to shrink number of channels + adds non-linearity allowing us to learn

more complex functions

Inception Layer
core idea: too many layers result in huge computational costs, reduce # of layers
with 1x1 convolution

finding the perfect number of filters → choose them all: same convolutions with
different sizes + 3x3 max pooling with stride 1

Convolutional Neural Networks (CNN) 13

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

Computational Cost: inserting layer of 32x32x16 saves a lot of computational

effort

GoogLeNet using inception blocks with extra added max pool layer to reduce
dimensionality

Xception Net being extrem version of inception applying Depthwise Separable

Convolutions instead of normal convolutions, 36 conv layers structured into
several modules with skip connections

depthwise separable convolutions using different filters for each slide of

depth 3 → reduces # of computations significantly

Convolutional Neural Networks (CNN) 14

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

Fully Convolutional Network

convolutions act as feature extraction methods

fully connected convolutional network assures in the last few layers to take
activation/feature maps and turn the information into a classification result

converting fully connected layers also to convolutional layers using 1x1

convolution as it is exactly the same as the fully connected network layers

using bigger images is not a problem resulting in (H/32 x W/32 x # of channels)

Convolutional Neural Networks (CNN) 15

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

Semantic Segmentation: reduce dimension of information and increase it back to

original size of image in the last layer/step → How to upsample / go back to
original size?

interpolation - double up size, e..g nearest neighbot interpolation (pixel

without value looking at nearest neighbouring pixel and copying its value),
bilinear interpolation (looking at different neighbours taking weighted average
of their values), bicubic interpolation (again taking values from neighbours)

transposed conv: taking representation, blowing it up by spreading given

information equally across new spatial dimension, processing representation
by series of convolutions

performing unpooling

initializing all empty spaced to 0, then continuing with convolutions to

adjust the 0 values

U-Net

from left (contraction path, i.e. encoder) to right (expansion path, i.e. decoder)
performing series of convolutions (feature extraction) and pooling (feature
selection) → during encoding we loose spatial detail, therefore results copied to
decoder such that it also has the previous information

Convolutional Neural Networks (CNN) 16

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

lOMoARcPSD|15872722

Convolutional Neural Networks (CNN) 17

Downloaded by Eng Esraa (esraahassan.esraa@gmail.com)

465-Lecture 5-6
No ratings yet
465-Lecture 5-6
40 pages
05introduction To Convolutional Neural Networks
No ratings yet
05introduction To Convolutional Neural Networks
72 pages
Traffic Engineering
No ratings yet
Traffic Engineering
24 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
98 pages
Module5 ML
No ratings yet
Module5 ML
112 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
32 pages
Lec14-15 CNN
No ratings yet
Lec14-15 CNN
40 pages
CNN (Neural Network)
No ratings yet
CNN (Neural Network)
32 pages
CPCS432 Lecture 5 Deep Learning and Artificial Neural Networks Techniques in Computer Vision
No ratings yet
CPCS432 Lecture 5 Deep Learning and Artificial Neural Networks Techniques in Computer Vision
57 pages
Unec 1700728516
No ratings yet
Unec 1700728516
105 pages
Unit4 CNN
No ratings yet
Unit4 CNN
187 pages
Unit 3 CNN
No ratings yet
Unit 3 CNN
47 pages
07 Ais302 CNN
No ratings yet
07 Ais302 CNN
56 pages
03 Convolutional Neural Networks
No ratings yet
03 Convolutional Neural Networks
83 pages
CSE4261 Lecture-11
No ratings yet
CSE4261 Lecture-11
35 pages
B Tech Electrical and Electronics Engineering
No ratings yet
B Tech Electrical and Electronics Engineering
221 pages
L11 CNN
No ratings yet
L11 CNN
22 pages
Convolutional Neural Networks: Shusen Wang
No ratings yet
Convolutional Neural Networks: Shusen Wang
75 pages
06 CNN Convolutional Neural Network
No ratings yet
06 CNN Convolutional Neural Network
133 pages
Unit 3
No ratings yet
Unit 3
80 pages
Deep Learning LectureCNN
No ratings yet
Deep Learning LectureCNN
28 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
34 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
Deep Learningchap3 CNN
No ratings yet
Deep Learningchap3 CNN
20 pages
CNN
No ratings yet
CNN
37 pages
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
No ratings yet
Convolution Neural Network (CNN) Unit 2: Dr. Kavita R Singh
65 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
CNN Short
No ratings yet
CNN Short
61 pages
Ee046746 Tut 03 04 Convolutional Neural Networks
No ratings yet
Ee046746 Tut 03 04 Convolutional Neural Networks
26 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
Convolutional Neural Networks - Deeplearning-Notes
No ratings yet
Convolutional Neural Networks - Deeplearning-Notes
43 pages
CBSE Class 6 Maths Practice Worksheets
73% (33)
CBSE Class 6 Maths Practice Worksheets
52 pages
DL Unit Iii
No ratings yet
DL Unit Iii
13 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
Kottarathil J. Graph Theory and Decomposition 2024
No ratings yet
Kottarathil J. Graph Theory and Decomposition 2024
201 pages
A Convolutional Neural Network
No ratings yet
A Convolutional Neural Network
6 pages
Unit 3 CNN 2024
No ratings yet
Unit 3 CNN 2024
58 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
Chapter 11 - Similarity
100% (1)
Chapter 11 - Similarity
37 pages
Convolutional Neural Networks - Annotated
No ratings yet
Convolutional Neural Networks - Annotated
83 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
6 pages
Assignement-1 Compound Angle 1685019490830
No ratings yet
Assignement-1 Compound Angle 1685019490830
3 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
Unit Iii Convolutional Networks and Sequence Modelling
No ratings yet
Unit Iii Convolutional Networks and Sequence Modelling
38 pages
02 - Introduction To Convolutional Neural Networks (CNNS)
No ratings yet
02 - Introduction To Convolutional Neural Networks (CNNS)
28 pages
CNN Notes Unit 3 Notes
No ratings yet
CNN Notes Unit 3 Notes
17 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
26 pages
Convolution Neural Network: CP - 6 Machine Learning M S Prasad
No ratings yet
Convolution Neural Network: CP - 6 Machine Learning M S Prasad
28 pages
CNN 2
No ratings yet
CNN 2
47 pages
AE556 2024 Topic4 CNN
No ratings yet
AE556 2024 Topic4 CNN
26 pages
ABB机器人编程手册
No ratings yet
ABB机器人编程手册
1,280 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
Convolutional Neural Networks: ZV0GDF798E
No ratings yet
Convolutional Neural Networks: ZV0GDF798E
9 pages
Third Term Exam-Wps Office-5
No ratings yet
Third Term Exam-Wps Office-5
5 pages
NN 06
No ratings yet
NN 06
18 pages
CNN
No ratings yet
CNN
10 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Science Stem Lesson
No ratings yet
Science Stem Lesson
25 pages
CNN Architecture
No ratings yet
CNN Architecture
24 pages
Roller Coaster SImulation
No ratings yet
Roller Coaster SImulation
75 pages
Convolutional Neural Network - Towards Data Science PDF
No ratings yet
Convolutional Neural Network - Towards Data Science PDF
10 pages
10Th Maths EM Creative One Mark - UNIT 3 - 4 - Kalviexpress
No ratings yet
10Th Maths EM Creative One Mark - UNIT 3 - 4 - Kalviexpress
10 pages
Chapter 3 FM I
No ratings yet
Chapter 3 FM I
16 pages
Bangxi Li (Auth.) - Linear Theory of Fixed Capital and China's Economy - Marx, Sraffa and Okishio-Springer Singapore (2017)
No ratings yet
Bangxi Li (Auth.) - Linear Theory of Fixed Capital and China's Economy - Marx, Sraffa and Okishio-Springer Singapore (2017)
132 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
S 2 BCTQ HK HMQB 9 Y2 Uew H4
No ratings yet
S 2 BCTQ HK HMQB 9 Y2 Uew H4
18 pages
Day 19
No ratings yet
Day 19
40 pages
Main Elements of The Comunication Process
No ratings yet
Main Elements of The Comunication Process
1 page
Safety Stock
No ratings yet
Safety Stock
35 pages
ECON022 BAP With Major
No ratings yet
ECON022 BAP With Major
3 pages
DS Unit 5
No ratings yet
DS Unit 5
27 pages
Math III Q2 Wk1 Wk2
0% (1)
Math III Q2 Wk1 Wk2
4 pages
664724LJ
No ratings yet
664724LJ
16 pages
On Maximal Paths and Circuits Erods Gallai
No ratings yet
On Maximal Paths and Circuits Erods Gallai
20 pages
Edexcel GCSE Maths Higher Paper 32
No ratings yet
Edexcel GCSE Maths Higher Paper 32
24 pages
Reinventing Discovery
No ratings yet
Reinventing Discovery
4 pages
6746fe71a3a5a Crack Xat 2025 in 40 Days
No ratings yet
6746fe71a3a5a Crack Xat 2025 in 40 Days
4 pages
The Classification of Stocks With Basic Financial Indicators An Application of Cluster Analysis On The BIST 100 Index
No ratings yet
The Classification of Stocks With Basic Financial Indicators An Application of Cluster Analysis On The BIST 100 Index
29 pages
Método de Runge Kutta
No ratings yet
Método de Runge Kutta
12 pages
Wind Power Optimization
No ratings yet
Wind Power Optimization
9 pages
DSP 1imp
No ratings yet
DSP 1imp
13 pages
17 Solutions
No ratings yet
17 Solutions
7 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.