0% found this document useful (0 votes)

43 views6 pages

Shallow Networks Versus Deep Networks

Uploaded by

cse21298

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views6 pages

Shallow Networks Versus Deep Networks

Uploaded by

cse21298

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

SHALLOW NETWORKS VERSUS DEEP NETWORKS

When we hear the name Neural Network, we feel that it consist of many and many hidden

layers but there is a type of neural network with a few numbers of hidden layers. Shallow

neural networks consist of only 1 or 2 hidden layers. Understanding a shallow neural network

gives us an insight into what exactly is going on inside a deep neural network. The figure

below shows a shallow neural network with 1 hidden layer, 1 input layer and 1 output layer.

Deep L-Layer Neural Network

In this section, we will look at how the concepts of forward and backpropogation can be applied
to deep neural networks. But you might be wondering at this point what in the world deep neural
networks actually are?

Shallow vs depth is a matter of degree. A logistic regression is a very shallow model as it has
only one layer (remember we don’t count the input as a layer):
A deeper neural network has more number of hidden layers:

Let’s look at some of the notations related to deep neural networks:

 L is the number of layers in the neural network

 n[l] is the number of units in layer l
 a[l] is the activations in layer l
 w[l] is the weights for z[l]

These are some of the notations which we will be using in the upcoming sections. Keep them in
mind as we proceed, or just quickly hop back here in case you miss something.

Forward Propagation in a Deep Neural Network

For a single training example, the forward propagation steps can be written as:

z[l] = W[l]a[l-1] + b[l]

a[l] = g[l] (z[l])

We can vectorize these steps for ‘m’ training examples as shown below:
Z[l] = W[l] A [l-1] + B[l]

A[l] = g[l] (Z[l])

These outputs from one layer act as an input for the next layer. We can’t compute the forward
propagation for all the layers of a neural network without a for loop, so it’s fine to have a for loop
here. Before moving further, let’s look at the dimensions of various matrices that will help us
understand these steps in a better way.

Getting your matrix dimensions right

Analyzing the dimensions of a matrix is one of the best debugging tools to check how correct our
code is. We will discuss what should be the correct dimension for each matrix in this section.
Consider the following example:

Can you figure out the number of layers (L) in this neural network? You are correct if you
guessed 5. There are 4 hidden layers and 1 output layer. The units in each layer are:

n [0] = 2, n[1] = 3, n[2] = 5, n[3] = 4, n[4] = 2, and n [5] = 1

The generalized form of dimensions of W, b and their derivatives is:

 W[l] = (n[l], n[l-1])

 b[l] = (n[l], 1)
 dW[l] = (n[l], n[l-1])
 db[l] = (n[l],1)
 Dimension of Z[l], A[l], dZ[l], dA[l] = (n[l],m)

where ‘m’ is the number of training examples. These are some of the generalized matrix
dimensions which will help you to run your code smoothly.

We have seen some of the basics of deep neural networks up to this point. But why do we need
deep representations
Deep neural networks find relations with the data (simpler to complex relations). What the first
hidden layer might be doing, is trying to find simple functions like identifying the edges in the
above image. And as we go deeper into the network, these simple functions combine together to
form more complex functions like identifying the face. Some of the common examples of
leveraging a deep neural network are:

Deep neural networks find relations with the data (simpler to complex relations). What the first
hidden layer might be doing, is trying to find simple functions like identifying the edges in the
above image. And as we go deeper into the network, these simple functions combine together to
form more complex functions like identifying the face. Some of the common examples of
leveraging a deep neural network are:

 Face Recognition
o Image ==> Edges ==> Face parts ==> Faces ==> desired face
 Audio recognition
o Audio ==> Low level sound features like (sss, bb) ==> Phonemes ==> Words
==> Sentences

Building Blocks of Deep Neural Networks

Consider any layer in a deep neural network. The input to this layer will be the activations from
the previous layer (l-1), and the output of this layer will be its own activations.

 Input: a[l-1]
 Output: a[l]

This layer first calculates the z[l] on which the activations are applied. This z [l] is saved as cache.
For the backward propagation step, it will first calculate da [l], i.e., derivative of the activation at
layer l, derivative of weights dw [l], db[l], dz[l], and finally da[l-1]. Let’s visualize these steps to reduce
the complexity:
This is how each block (layer) of a deep neural network works. Next, we will see how to
implement all of these blocks.

Forward and Backward Propagation

The input in a forward propagation step is a [l-1] and the outputs are a[l] and cache z[l], which is a
function of w[l] and b[l]. So, the vectorized form to calculate Z[l] and A[l] is:

Z[l] = W[l] * A[l-1] + b[l]

A[l] = g[l](Z[l])

We will calculate Z and A for each layer of the network. After calculating the activations, the
next step is backward propagation, where we update the weights using the derivatives. The input
for backward propagation is da [l] and the outputs are da[l-1], dW[l] and db[l]. Let’s look at the
vectorized equations for backward propagation:

dZ[l] = dA[l] * g'[l](Z[l])

dW[l] = 1/m * (dZ[l] * A[l-1].T)

db[l] = 1/m * np.sum (dZ[l], axis = 1, keepdims = True)

dA[l-1] = w[l].T * dZ[l]

This is how we implement deep neural networks.

Deep Neural Networks perform surprisingly well (maybe not so surprising if you’ve used them
before!). Running only a few lines of code gives us satisfactory results. This is because we are
feeding a large amount of data to the network and it is learning from that data using the hidden
layers.

Choosing the right hyper parameters helps us to make our model more efficient

Parameters vs hyper parameters

This is an oft-asked question by deep learning newcomers. The major difference between
parameters and hyper parameters is that parameters are learned by the model during the training
time, while hyper parameters can be changed before training the model.

Parameters of a deep neural network are W and b, which the model updates during the back
propagation step. On the other hand, there are a lot of hyper parameters for a deep NN, including:

 Learning rate – ⍺
 Number of iterations
 Number of hidden layers
 Units in each hidden layer
 Choice of activation function

Deep Learning Notes
No ratings yet
Deep Learning Notes
205 pages
SOP For Export of Fruits and Vegetables To EU
100% (2)
SOP For Export of Fruits and Vegetables To EU
51 pages
Intro Deep Learning
No ratings yet
Intro Deep Learning
43 pages
Acclimatisation and Hardening
No ratings yet
Acclimatisation and Hardening
13 pages
Week 1 CS826 - Review and Getting Started With Neural Networks
No ratings yet
Week 1 CS826 - Review and Getting Started With Neural Networks
36 pages
Chapter 2 - 3 Deep Neural Network
No ratings yet
Chapter 2 - 3 Deep Neural Network
23 pages
Dose, Dilution and The LM Potencies
No ratings yet
Dose, Dilution and The LM Potencies
12 pages
The Earth's Magnetic Field: Stephen Kimbrough Damjan Štrus Corina Toma
No ratings yet
The Earth's Magnetic Field: Stephen Kimbrough Damjan Štrus Corina Toma
5 pages
Learning Algorithm
No ratings yet
Learning Algorithm
100 pages
Corydoras
No ratings yet
Corydoras
2 pages
Neural Network Detail
No ratings yet
Neural Network Detail
4 pages
DL Unit 1
No ratings yet
DL Unit 1
200 pages
Neural Networks / Deep Learning
No ratings yet
Neural Networks / Deep Learning
9 pages
Spectral Graph Theory - Wikipedia
No ratings yet
Spectral Graph Theory - Wikipedia
24 pages
Local Media3092843488830198412
100% (1)
Local Media3092843488830198412
2 pages
4 - DL (v2)
No ratings yet
4 - DL (v2)
32 pages
Lesson 4 - Deep Learning
No ratings yet
Lesson 4 - Deep Learning
20 pages
Vol 4 2 Gonzalez
No ratings yet
Vol 4 2 Gonzalez
30 pages
Gap Analysis 2024 2025
No ratings yet
Gap Analysis 2024 2025
4 pages
Neural Networks-A Diffusion Model Changing The Landscape
No ratings yet
Neural Networks-A Diffusion Model Changing The Landscape
13 pages
Deep Learning
100% (4)
Deep Learning
100 pages
Unit 1
No ratings yet
Unit 1
16 pages
Voronoi Diagrams - A Survey of A Fundamental Geometric Data Structure
No ratings yet
Voronoi Diagrams - A Survey of A Fundamental Geometric Data Structure
61 pages
Summarise The Nature and Effects of Perceived Fairness in Groups C2
No ratings yet
Summarise The Nature and Effects of Perceived Fairness in Groups C2
1 page
Pemanfaatan Serat Selulosa ECENG GONDOK (Eichhornia Crassipes) SEBAGAI BAHAN BAKU Pembuatan Kertas: Isolasi Dan Karakterisasi
No ratings yet
Pemanfaatan Serat Selulosa ECENG GONDOK (Eichhornia Crassipes) SEBAGAI BAHAN BAKU Pembuatan Kertas: Isolasi Dan Karakterisasi
8 pages
Safari - 25 Jul 2019 at 11:43
No ratings yet
Safari - 25 Jul 2019 at 11:43
1 page
UNIT II DL
No ratings yet
UNIT II DL
17 pages
Baker 2 Phase Flow
0% (1)
Baker 2 Phase Flow
2 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
CCS355 NNDL Unit1
No ratings yet
CCS355 NNDL Unit1
30 pages
Abrar's Lesson Plan
No ratings yet
Abrar's Lesson Plan
4 pages
Lecture 11 - Introduction To Artificial Neural Networks (ANN)
No ratings yet
Lecture 11 - Introduction To Artificial Neural Networks (ANN)
35 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
Quiz 11 Unit .3 Patricia E. Benner Introduction of Nursing Theory & Model
No ratings yet
Quiz 11 Unit .3 Patricia E. Benner Introduction of Nursing Theory & Model
3 pages
DS303 NN
No ratings yet
DS303 NN
20 pages
Unit 5 - Notes
No ratings yet
Unit 5 - Notes
3 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
7 pages
Recent IELTS Writing Topics and Questions 2024 - How To Do IELTS
No ratings yet
Recent IELTS Writing Topics and Questions 2024 - How To Do IELTS
49 pages
Introduction to Deep Learning
From Everand
Introduction to Deep Learning
Eugene Charniak
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
22 pages
Analyzing Types of Neural Networks in Deep Learning
No ratings yet
Analyzing Types of Neural Networks in Deep Learning
15 pages
Applied Deep Learning - Part 1 - Artificial Neural Networks - by Arden Dertat - Towards Data Science
No ratings yet
Applied Deep Learning - Part 1 - Artificial Neural Networks - by Arden Dertat - Towards Data Science
34 pages
FU The Freeform Universal RPG (Classic Rules)
No ratings yet
FU The Freeform Universal RPG (Classic Rules)
24 pages
1 - ALG - Exponential - Growth - Decay - Functions
No ratings yet
1 - ALG - Exponential - Growth - Decay - Functions
22 pages
Mind - How To Build A Neural Network (Part One)
No ratings yet
Mind - How To Build A Neural Network (Part One)
9 pages
Project Format & Details
No ratings yet
Project Format & Details
7 pages
MLT Unit 4 and 5 Part 2
No ratings yet
MLT Unit 4 and 5 Part 2
34 pages
The Routledge Handbook of Methodologies in Human Geography Mark W Rosenberg Stephanie E Coen Sarah A Lovell PDF Download
No ratings yet
The Routledge Handbook of Methodologies in Human Geography Mark W Rosenberg Stephanie E Coen Sarah A Lovell PDF Download
86 pages
Deep Learning UNIT 1
No ratings yet
Deep Learning UNIT 1
22 pages
An Ingression Into Deep Learning - Resp
No ratings yet
An Ingression Into Deep Learning - Resp
25 pages
Assignment 2
No ratings yet
Assignment 2
12 pages
ATT III - 17. Application of Leadership and Teamworking Skills
No ratings yet
ATT III - 17. Application of Leadership and Teamworking Skills
6 pages
Unit III
No ratings yet
Unit III
29 pages
Qulay Qo'LanmaInglizEnglish
No ratings yet
Qulay Qo'LanmaInglizEnglish
19 pages
CS 611 Slides 5
No ratings yet
CS 611 Slides 5
28 pages
Taking I Niti Ative: // You're Reading..
No ratings yet
Taking I Niti Ative: // You're Reading..
14 pages
Cambridge IGCSE: 0500/12 First Language English
No ratings yet
Cambridge IGCSE: 0500/12 First Language English
16 pages
The Deep Learning Revolution: Introductory Overview Lecture
No ratings yet
The Deep Learning Revolution: Introductory Overview Lecture
35 pages
Udacity Deep LEarning Part4 RNN
No ratings yet
Udacity Deep LEarning Part4 RNN
338 pages
Intro To DL
No ratings yet
Intro To DL
28 pages
Deep Learning Day 27
No ratings yet
Deep Learning Day 27
43 pages
EmotionFlashcards 1
No ratings yet
EmotionFlashcards 1
9 pages
GITAM School of Technology, Visakhapatnam
No ratings yet
GITAM School of Technology, Visakhapatnam
4 pages
Enhancing Deep Learning Performance Using Displaced Rectifier Linear Unit
From Everand
Enhancing Deep Learning Performance Using Displaced Rectifier Linear Unit
David Macêdo
No ratings yet
Unit I
No ratings yet
Unit I
90 pages
DL 02 Deep Forward Networks
No ratings yet
DL 02 Deep Forward Networks
47 pages
Grade 8 Math Module 3
No ratings yet
Grade 8 Math Module 3
23 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
Chapter 6 AI
No ratings yet
Chapter 6 AI
52 pages
OCI DL Fundations
No ratings yet
OCI DL Fundations
4 pages
Final Term Table Exam Fall 2024-2025 Final
No ratings yet
Final Term Table Exam Fall 2024-2025 Final
3 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Neural Networks
No ratings yet
Neural Networks
27 pages
Neurall
No ratings yet
Neurall
10 pages
Unit 03 - Neural Networks - MD
No ratings yet
Unit 03 - Neural Networks - MD
24 pages
Deep Learning
No ratings yet
Deep Learning
21 pages
Unit 1
No ratings yet
Unit 1
19 pages
Logaritmos Exponencial by Ven Reprint
No ratings yet
Logaritmos Exponencial by Ven Reprint
83 pages
ML Unit-5
No ratings yet
ML Unit-5
22 pages
AI Lab 1
No ratings yet
AI Lab 1
11 pages
Unit 1
No ratings yet
Unit 1
20 pages
W1 Ann
No ratings yet
W1 Ann
3 pages
Unit 4
100% (1)
Unit 4
57 pages
Deep Learning - Unit 1 Notes
No ratings yet
Deep Learning - Unit 1 Notes
27 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
Assignment #7 - Dr. Totanes
No ratings yet
Assignment #7 - Dr. Totanes
3 pages
Deep Learning Fundamentals in Python
From Everand
Deep Learning Fundamentals in Python
LazyProgrammer
4/5 (9)
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
G7 Final Report - Es FaridaMahmoud2022
No ratings yet
G7 Final Report - Es FaridaMahmoud2022
43 pages
The Tech Interview Playbook: From DSA to System Design
From Everand
The Tech Interview Playbook: From DSA to System Design
Chinmoy Mukherjee
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Shallow Networks Versus Deep Networks

Uploaded by

Shallow Networks Versus Deep Networks

Uploaded by

SHALLOW NETWORKS VERSUS DEEP NETWORKS

Deep L-Layer Neural Network

Let’s look at some of the notations related to deep neural networks:

 L is the number of layers in the neural network

Forward Propagation in a Deep Neural Network

z[l] = W[l]a[l-1] + b[l]

a[l] = g[l] (z[l])

A[l] = g[l] (Z[l])

Getting your matrix dimensions right

n [0] = 2, n[1] = 3, n[2] = 5, n[3] = 4, n[4] = 2, and n [5] = 1

The generalized form of dimensions of W, b and their derivatives is:

 W[l] = (n[l], n[l-1])

Building Blocks of Deep Neural Networks

Forward and Backward Propagation

Z[l] = W[l] * A[l-1] + b[l]

dZ[l] = dA[l] * g'[l](Z[l])

dW[l] = 1/m * (dZ[l] * A[l-1].T)

dA[l-1] = w[l].T * dZ[l]

This is how we implement deep neural networks.

Parameters vs hyper parameters

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.