0% found this document useful (0 votes)

9 views48 pages

Aiml Ece Unit-5

so basically just cnn in a jist

Uploaded by

Sai Loukik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views48 pages

Aiml Ece Unit-5

so basically just cnn in a jist

Uploaded by

Sai Loukik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 48

UNIT-V

Introduction to Deep learning

UNIT-V
Introduction to Deep learning
• Analyze the key computations underlying deep learning,

• Convolutional Neural Network, Building blocks of CNN-

Convolutional layers, Pooling layers, Dense layers,

• Case study using Jetson Nano board.

• Reference:
• Deep Learning, Ian Goodfellow, Yoshua Benjio, Aaron Courville, The MIT
Press, 2016
Convolutional Networks
• Convolutional networks also known as convolutional neural networks
or CNNs.

• CNNs are specialized kind of neural networks for processing data that
has a known, grid-like topology.
• Examples:
• time-series data, which can be thought of as a 1D grid taking samples at
regular time intervals
• image data, which can be thought of as a 2D grid of pixels.

• CNN uses a convolution operation

• Convolution is a specialized kind of linear operation.
• CNN use convolution in place of general matrix multiplication in at least one
of their layers.

• The operation used in a CNN does not correspond precisely to the

definition of convolution as used in other fields such as engineering or
pure mathematics.
Convolution Operation
• Suppose we are tracking the location of a spaceship with a laser sensor.
• Our laser sensor provides a single output x(t), the position of the spaceship at time t.
• Both x and t are real-valued, i.e., we can get a different reading from the laser sensor
at any instant in time.

• To obtain a less noisy estimate of the spaceship’s position, we would like to

average together several measurements.
• so, we will want this to be a weighted average that gives more weight to recent
measurements.
• We can do this with a weighting function w(a), where ‘a’ is the age of a measurement.

• If we apply such a weighted average operation at every moment, we obtain a

new ‘s’ function providing a smoothed estimate of the position of the
spaceship:

• This operation is called convolution.

• The convolution operation is typically denoted with an
asterisk:

• function ‘x’ referred to as input

• function ‘w’ as the kernel
• Output is referred as feature map

• If ‘x’ and ‘w’ are defined only on time index ‘t’, we can
define the discrete convolution:
• If we use convolutions over more than one axis at a time,
• For example, if we use a two-dimensional image ‘I’ as our input, we
also want to use a two-dimensional kernel ‘K’:

• The commutative property of convolution arises because we have

flipped the kernel relative to the input.

• The only reason to flip the kernel is to obtain the commutative

property.

• The commutative property is useful for writing proofs, it is NOT an

important property of a neural network implementation.
• Instead, many neural network libraries implement a
related function called cross-correlation, which is the
same as convolution but without flipping the kernel:

• Many machine learning libraries implement cross-

correlation but call it convolution.

• We will follow this convention of calling both operations

convolution, and specify whether we mean to flip the
kernel or not in contexts where kernel flipping is relevant.
• Figure below presents an example of convolution (without kernel flipping) applied
to a 2-D tensor.
• Discrete convolution can be viewed as multiplication by a matrix.
• For example, for univariate discrete convolution,
• each row of the matrix is constrained to be equal to the row above shifted by
one element.
• This is known as a Toeplitz matrix.

• In two dimensions, a doubly block circulant matrix corresponds to

convolution.
• In addition to these constraints that several elements be equal to each
other, convolution usually corresponds to a very sparse matrix (a matrix
whose entries are mostly equal to zero).
• This is because the kernel is usually much smaller than the input image.

• Convolution works with inputs of variable size.

• Any neural network algorithm that works with matrix multiplication and
does not depend on specific properties of the matrix structure should
work with convolution, without requiring any further changes to the
neural network.
• Convolution leverages three important ideas that can help improve a machine
learning system:
• sparse interactions
• parameter sharing
• equivariant representations

• Sparse interactions (sparse connectivity or sparse weights)

• Traditional Neural Networks:
• Use dense matrix multiplication.
• Every output unit interacts with every input unit.
• High memory and computational costs:
• Parameters: m×n (‘m’ inputs and ‘n’ outputs)
• Runtime (per example): O(m×n)

• Convolutional Networks:
• Use sparse interactions (sparse connectivity/weights).
• Small kernels scan local regions (e.g., edges in images).
• Advantages:
• Fewer parameters to store.
• Reduced memory and computational requirements.
• Improved statistical efficiency.
• Efficiency:
• Parameters: k×n (where k≪m)
• Runtime: O(k×n)
• For graphical demonstrations of sparse connectivity, see below figures:
• Parameter sharing
• refers to using the same parameter for more than one
function in a model.

• In a traditional neural net,

• each element of the weight matrix is used exactly once when
computing the output of a layer.
• It is multiplied by one element of the input and then never revisited.

• In a convolutional neural net,

• each member of the kernel is used at every position of the input.
• The parameter sharing used by the convolution operation means that
rather than learning a separate set of parameters for every location,
we learn only one set.
• This does NOT affect the runtime of forward propagation—
it is still O(k x n)—but it does further reduce the storage requirements of
the model to k parameters.
• As an example of both of these first two principles in action, the figure shows how sparse
connectivity and parameter sharing can dramatically improve the efficiency of a linear function
for detecting edges in an image.
• Equivariance representation

• To say a function is equivariant means that if the input changes, the

output changes in the same way.

• A function f(x) is equivariant to a function g if f(g(x)) = g(f(x)).

• In the case of convolution, if we let g be any function that translates the

input, i.e., shifts it, then the convolution function is equivariant to g.

• For example, let I be a function giving image brightness at integer

coordinates.
• Let g be a function mapping one image function to another image
function, such that I’=g(I) is the image function with I’(x,y)=I(x−1,y).

• This shifts every pixel of I one unit to the right.

• If we apply this transformation to I, and then apply convolution, the

result will be the same as if we applied convolution to I’, and then
applied the transformation ‘g’ to the output.
• When processing time series data,
• the convolution produces a sort of timeline that shows when different features appear in
the input.
• If we move an event later in time in the input, the exact same representation of it will
appear in the output, just later in time.

• With images,
• convolution creates a 2-D map of where certain features appear in the input.
• If we move the object in the input, its representation will move the same amount in the
output.

• This is useful when we know that some function of a small number of

neighboring pixels is useful when applied to multiple input locations.
• For example, when processing images, it is useful to detect edges in the first layer of a
convolutional network.
• The same edges appear more or less everywhere in the image, so it is practical to share
parameters across the entire image.

• In some cases, we may NOT wish to share parameters across the entire image.
• For example, if we are processing images that are cropped to be centered on an
individual’s face, we probably want to extract different features at different locations—
the part of the network processing the top of the face needs to look for eyebrows, while
the part of the network processing the bottom of the face needs to look for a chin.
Pooling
• A typical layer of a convolutional network consists of three stages (Figure below ).
• In the first stage, the layer performs several convolutions in parallel to produce a set of linear activations.
• In the second stage (detector stage), each linear activation is run through a nonlinear activation function
(ex: ReLU).
• In the third stage, we use a pooling function to modify the output of the layer further.
• A pooling function replaces the output of the net at a certain location with a summary statistic of the nearby
outputs.
• For example, the max pooling operation reports the maximum output within a rectangular neighborhood.

• Pooling helps to make the representation approximately invariant to small translations of the input.
• Invariance to translation means that if we translate the input by a small amount, the values of most of the
pooled outputs do NOT change.
• See the figure above for an example of how this works.

• Invariance to local translation can be a very useful property if

we care more about whether some feature is present than
exactly where it is.
• For example,
• when determining whether an image contains a face, we need NOT
know the location of the eyes with pixel-perfect accuracy, we just
need to know that there is an eye on the left side of the face and an
eye on the right side of the face.
• For example,
• if we want to find a corner defined by two edges meeting at a specific
orientation, we need to preserve the location of the edges well
enough to test whether they meet.
• The use of pooling can be viewed as adding an infinitely strong prior
that the function the layer learns must be invariant to small
translations.
• When this assumption is correct, it can greatly improve the statistical
• Pooling over spatial regions produces invariance to translation, but if we pool over the
outputs of separately parametrized convolutions, the features can learn which
transformations to become invariant to (see figure below).
• Because pooling summarizes the responses over a whole neighborhood, it is possible to use
fewer pooling units than detector units, by reporting summary statistics for pooling regions
spaced k pixels apart rather than 1 pixel apart.

• Example (Figure below).

• This improves the computational efficiency of the network because the next layer has roughly k times
fewer inputs to process.
• When the number of parameters in the next layer is a function of its input size (such as when the next
layer is fully connected and based on matrix multiplication) this reduction in the input size can also
result in improved statistical efficiency and reduced memory requirements for storing the
parameters.
• Some examples of complete convolutional network architectures for classification using convolution and pooling are shown in the figure below:
The importance of CNNs
• CNNs are distinguished from classic machine learning
algorithms such as SVMs and decision trees by their ability to
autonomously extract features at a large scale, bypassing the
need for manual feature engineering and thereby enhancing
efficiency.

• The convolutional layers grant CNNs their translation-invariant

characteristics, empowering them to identify and extract
patterns and features from data irrespective of variations in
position, orientation, scale, or translation.

• A variety of pre-trained CNN architectures, including VGG-16,

ResNet50, Inceptionv3, and EfficientNet, have demonstrated top-
tier performance. These models can be adapted to new tasks
with relatively little data through a process known as fine-
tuning.

• Beyond image classification tasks, CNNs can be applied to other

domains,
An Introduction tosuch as Neural
Convolutional natural
Networks:language processing,
A Comprehensive Guide time| DataCamp
to CNNs in Deep Learning series
Key Components of a CNN

• Convolutional layers
• Activation layer (Rectified Linear Unit)
• Pooling layers
• Fully connected layers (Dense layers)
Example: Architecture of the CNNs applied to
digit recognition
Convolution layers

• Convolution is the application of a sliding window

function to a matrix of pixels representing an
image.

• The sliding function applied to the matrix is called

kernel or filter.

• Several filters of equal size are applied, and each

filter is used to recognize a specific pattern from
the image (e.g., curving of the digits, edges, whole
shape of the digits).

• For example,
• one filter might be good at finding straight lines,
another might find curves, and so on.
• Let’s consider this 32x32 grayscale image (0-Black to 255- White) of a handwritten
digit.
• Perform the convolution operation by
applying the dot product, and work as follows:

1. Apply the kernel matrix from the top-left corner

to the right.
2. Perform element-wise multiplication.
3. Sum the values of the products.
4. The resulting value corresponds to the first
value (top-left corner) in the convoluted matrix.
5. Move the kernel down with respect to the size
of the sliding window.
6. Repeat steps 1 to 5 until the image matrix is fully
covered.
• The dimension of the convoluted matrix depends on the
size of the sliding window.
• The higher the sliding window, the smaller the
dimension.

• Another name associated with the kernel in the literature

is the feature detector because the weights can be fine-
tuned to detect specific features in the input image.
• For instance:
• Averaging neighboring pixels kernel can be used to blur the
input image.
• Subtracting neighboring kernel is used to perform edge
detection.

• The more convolution layers the network has, the better

the layer is at detecting more abstract features.
• Activation function

• A Rectified Linear Unit (ReLU ) activation

function is applied after each convolution
operation.

• This function helps the network learn non-linear

relationships between the features in the image,
hence making the network more robust for
identifying different patterns.

• It also helps to mitigate the vanishing gradient

problems.
• Pooling layer

• The goal of the pooling layer is to pull the most significant

features from the convoluted matrix.
• This is done by applying some aggregation operations,
which reduce the dimension of the feature map
(convoluted matrix), hence reducing the memory used while
training the network.
• Pooling is also relevant for mitigating overfitting.

• The most common aggregation functions that can be

applied are:
• Max pooling, which is the maximum value of the feature
map
• Sum pooling corresponds to the sum of all the values of the
feature map
• Average pooling is the average of all the values.

• The last pooling layer flattens its feature map so

that it can be processed by the fully connected layer.
• Fully connected layers

• These layers are in the last layer of the convolutional neural

network, and their inputs correspond to the flattened one-
dimensional matrix generated by the last pooling layer.

• ReLU activations functions are applied to them for non-linearity.

• Finally, a softmax prediction layer is used to generate

probability values for each of the possible output labels, and
the final label predicted is the one with the highest probability score.
Example
https://youtu.be/Y1qxI-Df4Lk?si=mPbNJvO5iglUvJ4z
If Stride = 2
Padding = 1

Aiml Ece Unit-5
No ratings yet
Aiml Ece Unit-5
48 pages
Module 3
No ratings yet
Module 3
46 pages
Convolution Neural Network-1
No ratings yet
Convolution Neural Network-1
44 pages
Module 4
No ratings yet
Module 4
20 pages
Convolution Neural Networks: S. Sumitra Department of Mathematics Indian Institute of Space Science and Technology
No ratings yet
Convolution Neural Networks: S. Sumitra Department of Mathematics Indian Institute of Space Science and Technology
123 pages
Sarma CNN Vce Oct 2022
No ratings yet
Sarma CNN Vce Oct 2022
63 pages
Unit 2a
No ratings yet
Unit 2a
31 pages
Neural Networks and Deep Learning (PE - V) (18CSE23) Unit - 4
No ratings yet
Neural Networks and Deep Learning (PE - V) (18CSE23) Unit - 4
11 pages
M4 Ia2
No ratings yet
M4 Ia2
6 pages
DL Mod4
No ratings yet
DL Mod4
18 pages
Lecture CNN
No ratings yet
Lecture CNN
68 pages
CNN PPT Unit Iv
No ratings yet
CNN PPT Unit Iv
134 pages
Convolutional Neural Networks (Part I)
No ratings yet
Convolutional Neural Networks (Part I)
61 pages
Module 3
No ratings yet
Module 3
67 pages
21CS743 DL Module4 Notes
No ratings yet
21CS743 DL Module4 Notes
7 pages
21CS743 Module4 Notes
No ratings yet
21CS743 Module4 Notes
15 pages
Module-4 DL
No ratings yet
Module-4 DL
22 pages
CNN New
No ratings yet
CNN New
225 pages
CNNs
No ratings yet
CNNs
22 pages
CH 9
No ratings yet
CH 9
41 pages
CNN
No ratings yet
CNN
62 pages
Unit - 2
No ratings yet
Unit - 2
51 pages
Deep Learning 4/7: Convolutional Neural Networks: C. de Castro, IEIIT-CNR, Cristina - Decastro@ieiit - Cnr.it
0% (1)
Deep Learning 4/7: Convolutional Neural Networks: C. de Castro, IEIIT-CNR, Cristina - Decastro@ieiit - Cnr.it
49 pages
UNIT 2 Study Materials 1
No ratings yet
UNIT 2 Study Materials 1
42 pages
Lecture 2-Convolutional Networks
No ratings yet
Lecture 2-Convolutional Networks
20 pages
Convolution Neural Networks U2
No ratings yet
Convolution Neural Networks U2
24 pages
E-Note 33951 Content Document 20250328020322PM
No ratings yet
E-Note 33951 Content Document 20250328020322PM
29 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
Unit - 4 Deep Learning
No ratings yet
Unit - 4 Deep Learning
14 pages
Unit 2
No ratings yet
Unit 2
45 pages
Lecture 4-Convolutional Network
No ratings yet
Lecture 4-Convolutional Network
26 pages
L09 Convolutional Networks
No ratings yet
L09 Convolutional Networks
9 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
10-Variants of Convolution Function-21-Sep-2020Material I 21-Sep-2020 Module5 CNN
No ratings yet
10-Variants of Convolution Function-21-Sep-2020Material I 21-Sep-2020 Module5 CNN
23 pages
Unit2 CNN
No ratings yet
Unit2 CNN
34 pages
Ch. 10: Introduction To Convolution Neural Networks CNN and Systems
No ratings yet
Ch. 10: Introduction To Convolution Neural Networks CNN and Systems
69 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
6 CNN
No ratings yet
6 CNN
50 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
AE556 2024 Topic4 CNN
No ratings yet
AE556 2024 Topic4 CNN
26 pages
4th Unit Aktu Machine Learning
No ratings yet
4th Unit Aktu Machine Learning
9 pages
Deep Learning Module-04 Search Creators
No ratings yet
Deep Learning Module-04 Search Creators
17 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
DL Unit 4&5
No ratings yet
DL Unit 4&5
30 pages
FODL Unit-4
No ratings yet
FODL Unit-4
46 pages
Unit 1
No ratings yet
Unit 1
109 pages
Chap4 CNN (20240205) - DL4H Practioner Guide
No ratings yet
Chap4 CNN (20240205) - DL4H Practioner Guide
23 pages
NN 06
No ratings yet
NN 06
18 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
3 pages
CNN2
No ratings yet
CNN2
70 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
What Is A Convolutional Neural Network-Unit3
No ratings yet
What Is A Convolutional Neural Network-Unit3
12 pages
Unit 3 CNN
No ratings yet
Unit 3 CNN
47 pages
DSA5102X Lecture5
No ratings yet
DSA5102X Lecture5
44 pages
AIDS - ANN - Unit 5 - Convolutional Neural Network AIDS - ANN - Unit 5 - Convolutional Neural Network
No ratings yet
AIDS - ANN - Unit 5 - Convolutional Neural Network AIDS - ANN - Unit 5 - Convolutional Neural Network
17 pages
DeepLearning Unit-II
No ratings yet
DeepLearning Unit-II
70 pages
A Scalable Proof Methodology For RISC Processor Designs A Functional Approach
No ratings yet
A Scalable Proof Methodology For RISC Processor Designs A Functional Approach
6 pages
Week 2 Lecture Notes
No ratings yet
Week 2 Lecture Notes
98 pages
Ok Design and Implementation of 32-Bits MIPS Processor To Perform QRD Based On FPGA
No ratings yet
Ok Design and Implementation of 32-Bits MIPS Processor To Perform QRD Based On FPGA
6 pages
Abp
No ratings yet
Abp
10 pages
Unit I
No ratings yet
Unit I
26 pages
Aiml Ece Unit-3
No ratings yet
Aiml Ece Unit-3
161 pages
Ok Implementation of 32-Bit RISC Processors Without Interlocked Pipelining On Artix-7 FPGA B
No ratings yet
Ok Implementation of 32-Bit RISC Processors Without Interlocked Pipelining On Artix-7 FPGA B
4 pages
Lecture 34
No ratings yet
Lecture 34
13 pages
Startup Finance
No ratings yet
Startup Finance
1 page
Aiml Ece Unit-4
No ratings yet
Aiml Ece Unit-4
130 pages
PTSP Unit-2
No ratings yet
PTSP Unit-2
65 pages
PTSP Unit-1
No ratings yet
PTSP Unit-1
63 pages
Unit - 1 AIML
No ratings yet
Unit - 1 AIML
60 pages
PTSP Unit-3
No ratings yet
PTSP Unit-3
32 pages
PTSP U-4&5 Sir Notes
No ratings yet
PTSP U-4&5 Sir Notes
9 pages
AI in The Classroom Insights From Educators On Usa
No ratings yet
AI in The Classroom Insights From Educators On Usa
27 pages
AnneDashini 1106191005 SE Assignment
No ratings yet
AnneDashini 1106191005 SE Assignment
6 pages
Non-Hierarchical Cluster Analysis Using K-Modes Method On Student of Statistics Major 2015 at The Faculty of Mathematics and Natural Sciences Mulawarman University
No ratings yet
Non-Hierarchical Cluster Analysis Using K-Modes Method On Student of Statistics Major 2015 at The Faculty of Mathematics and Natural Sciences Mulawarman University
8 pages
How To Create A Simple Neural Network in Python
100% (1)
How To Create A Simple Neural Network in Python
4 pages
Artificial Intelligence Literacy Among Healthcare Professionals and Students: A Systematic Review
No ratings yet
Artificial Intelligence Literacy Among Healthcare Professionals and Students: A Systematic Review
11 pages
Baixados 1
No ratings yet
Baixados 1
30 pages
Ibf List of Providers
No ratings yet
Ibf List of Providers
103 pages
Linkedin Learning Courses
No ratings yet
Linkedin Learning Courses
274 pages
MQP1
No ratings yet
MQP1
3 pages
Deep 3D Histology Powered by Tissue Clearing, Omics and AI: Nature Methods
No ratings yet
Deep 3D Histology Powered by Tissue Clearing, Omics and AI: Nature Methods
13 pages
Design and Analysis of Algorithms - AD3351 - Important Questions With Answer - Unit 3 - Dynamic Programming and Greedy Technique
No ratings yet
Design and Analysis of Algorithms - AD3351 - Important Questions With Answer - Unit 3 - Dynamic Programming and Greedy Technique
8 pages
CAT 1 - Artificial Intelligence Programming
No ratings yet
CAT 1 - Artificial Intelligence Programming
6 pages
One Shot Learning
No ratings yet
One Shot Learning
1 page
ML Unit-3
No ratings yet
ML Unit-3
23 pages
Enhancing Ocean Scene Video Captioning With Multimodal Pre-Training and Video-Swin-Transformer
No ratings yet
Enhancing Ocean Scene Video Captioning With Multimodal Pre-Training and Video-Swin-Transformer
6 pages
Workera20Report PDF
No ratings yet
Workera20Report PDF
22 pages
FCM Expert - Help
No ratings yet
FCM Expert - Help
18 pages
Integrated Data Science Certification - DexLab Analytics - Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA
No ratings yet
Integrated Data Science Certification - DexLab Analytics - Big Data Hadoop SAS R Analytics Predictive Modeling & Excel VBA
13 pages
Unit-5 DL
No ratings yet
Unit-5 DL
35 pages
Multimae: Multi-Modal Multi-Task Masked Autoencoders
No ratings yet
Multimae: Multi-Modal Multi-Task Masked Autoencoders
21 pages
Deep Learning Notes PDF
No ratings yet
Deep Learning Notes PDF
26 pages
2024 Forrester XDR Wave - Leader
100% (1)
2024 Forrester XDR Wave - Leader
3 pages
SSRN 4622722
No ratings yet
SSRN 4622722
22 pages
Unit 2 - Soft Computing
No ratings yet
Unit 2 - Soft Computing
49 pages
Arpita Upadhyay (IT)
No ratings yet
Arpita Upadhyay (IT)
1 page
Resume 202404220944
No ratings yet
Resume 202404220944
1 page
Smart Technologies and The End(s) of Law Novel Entanglements of Law and Technology (PDFDrive) PDF
No ratings yet
Smart Technologies and The End(s) of Law Novel Entanglements of Law and Technology (PDFDrive) PDF
295 pages
Exploring Music Contents
No ratings yet
Exploring Music Contents
372 pages
Ejons Proceeding Book Güncel
No ratings yet
Ejons Proceeding Book Güncel
411 pages
DDoS Attacks Mitigation A Review of AI-Based Strategies and Techniques
No ratings yet
DDoS Attacks Mitigation A Review of AI-Based Strategies and Techniques
6 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Aiml Ece Unit-5

Uploaded by

Aiml Ece Unit-5

Uploaded by

UNIT-V

Introduction to Deep learning

• Convolutional Neural Network, Building blocks of CNN-

• Case study using Jetson Nano board.

• CNN uses a convolution operation

• The operation used in a CNN does not correspond precisely to the

• To obtain a less noisy estimate of the spaceship’s position, we would like to

• If we apply such a weighted average operation at every moment, we obtain a

• This operation is called convolution.

• function ‘x’ referred to as input

• The commutative property of convolution arises because we have

• The only reason to flip the kernel is to obtain the commutative

• The commutative property is useful for writing proofs, it is NOT an

• Many machine learning libraries implement cross-

• We will follow this convention of calling both operations

• In two dimensions, a doubly block circulant matrix corresponds to

• Convolution works with inputs of variable size.

• Sparse interactions (sparse connectivity or sparse weights)

• In a traditional neural net,

• In a convolutional neural net,

• To say a function is equivariant means that if the input changes, the

• A function f(x) is equivariant to a function g if f(g(x)) = g(f(x)).

• In the case of convolution, if we let g be any function that translates the

• For example, let I be a function giving image brightness at integer

• This shifts every pixel of I one unit to the right.

• If we apply this transformation to I, and then apply convolution, the

• This is useful when we know that some function of a small number of

• Invariance to local translation can be a very useful property if

• Example (Figure below).

• The convolutional layers grant CNNs their translation-invariant

• A variety of pre-trained CNN architectures, including VGG-16,

• Beyond image classification tasks, CNNs can be applied to other

• Convolution is the application of a sliding window

• The sliding function applied to the matrix is called

• Several filters of equal size are applied, and each

1. Apply the kernel matrix from the top-left corner

• Another name associated with the kernel in the literature

• The more convolution layers the network has, the better

• A Rectified Linear Unit (ReLU ) activation

• This function helps the network learn non-linear

• It also helps to mitigate the vanishing gradient

• The goal of the pooling layer is to pull the most significant

• The most common aggregation functions that can be

• The last pooling layer flattens its feature map so

• These layers are in the last layer of the convolutional neural

• ReLU activations functions are applied to them for non-linearity.

• Finally, a softmax prediction layer is used to generate

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.