0% found this document useful (0 votes)

9 views29 pages

ANN Presentation

The document discusses key concepts in artificial intelligence and neural computing, focusing on overfitting and underfitting in neural networks, loss functions, and backpropagation techniques. It outlines methods to prevent overfitting, such as data augmentation, dropout, and early stopping, as well as issues like vanishing and exploding gradients. Additionally, it addresses the importance of biases in neural networks and provides references for further learning.

Uploaded by

nusrat della

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views29 pages

ANN Presentation

Uploaded by

nusrat della

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 29

Bangladesh Army University of Engineering &

Technology
Department of Information and Communication Engineering (ICE)

Artificial Intelligence and

Neural Computing
ICE 4211
MURSHEDA NUSRAT DELLA
LECTURER
Lecture 2 DEPT. OF ICE
BAUET
Outlines
•Overfitting in neural network
•Under fitting in neural network
•Loss function
•Back propagation
•Vanishing & Exploding Gradient

2
Dept. of Information and Communication Engineering (ICE), BAUET
Overfitting
•Overfitting occurs when our model becomes really good at being able to classify
or predict on data that was included in the training set but is not so good at
classifying test data.
•Over fitted model is unable to generalize well meaning it's learned the features
of the training set extremely well but if we give the model any data that slightly
deviates from the exact data used during training it's unable to generalize and
accurately predict the output

9/23/2022 3
Dept. of Information and Communication Engineering (ICE), BAUET
Overfitting
• we get metrics for the validation accuracy and loss as well as the training accuracy
and loss if the validation metrics are considerably worse than the training
metrics then that's indication that our model is overfitting.

9/23/2022 4
Dept. of Information and Communication Engineering (ICE), BAUET
Under fitting
•Under fitting is a scenario in data science where a data model is
unable to capture the relationship between the input and output
variables accurately, generating a high error rate on both the
training set and unseen data.

9/23/2022 5
Dept. of Information and Communication Engineering (ICE), BAUET
Under fitting
•If we have an under fitted model, this means that we do not have enough
parameters to capture the trends in the underlying system. Imagine
for example that we have data that is parabolic in nature, but we try to
fit this with a linear function, with just one parameter.

9/23/2022 6
Dept. of Information and Communication Engineering (ICE), BAUET
Simple Techniques to Prevent Overfitting
•It will be able to learn from the
training set also with more
data we're hoping to be
adding more diversity to the
training set
• Data augmentation: Modifications
by cropping rotating flipping
or zooming we'll cover more
on the concept of data

9/23/2022 7
Dept. of Information and Communication Engineering (ICE), BAUET
Simple Techniques to Prevent Overfitting

•Reducing overfitting is something called dropout the general idea behind dropout
is that if you add it to a model it will randomly ignore some subset of nodes in a
given layer during training

•Feature selection if the dataset contains many features.

9/23/2022 8
Dept. of Information and Communication Engineering (ICE), BAUET
Simple Techniques to Prevent Overfitting

•Early stopping (model)

We can first train our model for an arbitrarily large
number of epochs and plot the validation loss graph
.Once the validation loss begins to degrade (e.g., stops
decreasing but rather begins increasing), we stop the
training and save the current model. We can
implement this either by monitoring the loss graph or
set an early stopping trigger.[1]

9/23/2022 9
Dept. of Information and Communication Engineering (ICE), BAUET
Backpropagation
•Feed forward network
•Loss calculation
•Back propagation
•Weight update

•Minimize loss function

9/23/2022 10
Dept. of Information and Communication Engineering (ICE), BAUET
Backpropagation
Each node in the model receives its input from
the previous layer, and that this input is a
weighted sum of the weights at each of the
connections multiplied by the previous layer's
output.[1]

Gradient means gradient of the loss function

with respect to the weights.

9/23/2022 11
Dept. of Information and Communication Engineering (ICE), BAUET
Backpropagation
weighted sum is passed to an
activation function, and the result
from this activation function is the
output for a particular node and is
then passed as part of the input for
the nodes in the next layer. This
happens for each layer in the network
until we reach the output layer, and
this process is called forward
propagation.

9/23/2022 12
Dept. of Information and Communication Engineering (ICE), BAUET
Backpropagation
Backpropagation is the tool that gradient
descent uses to calculate the gradient of
the loss function.

9/23/2022 13
Dept. of Information and Communication Engineering (ICE), BAUET
Backpropagation

Influence a change in this layer's activation output by

jumping backwards, and again, updating the weights
here in the same way

9/23/2022 14
Dept. of Information and Communication Engineering (ICE), BAUET
Backpropagation
In addition to updating weights to move in
the desired direction i.e. positive or
negative, backpropagation is also working
to efficiently update the weights so that the
updates are being done in a manner that
helps to reduce the loss function most
efficiently.[2]

9/23/2022 15
Dept. of Information and Communication Engineering (ICE), BAUET
Gradient descent
• To find the derivative of loss with respect to the weight, if
• all data points(n) are considered, it is gradient descent.(computationally powerful , higher memory)
• One data point-> stochastic gradient descent
• K<n : mini batch gradient descent

Mini-batch gradient
descent
Gradient descent
Stochastic gradient
descent

9/23/2022 16
Dept. of Information and Communication Engineering (ICE), BAUET
Vanishing & Exploding Gradient
•Vanishing & Exploding Gradient explained | A problem resulting from
backpropagation
•Unstable gradient
•Gradient means gradient of the loss function with respect to the
weights.
•This problem involves with the earlier layers of the neural network
•Sgd works the gradient of the loss with respect to weight.
•Gradient is calculated using back propagation

9/23/2022 17
Dept. of Information and Communication Engineering (ICE), BAUET
Vanishing & Exploding Gradient
•Gradient of the earlier layer of network becomes very small, vanishingly small-so
vanishing gradient
•Model uses gradient value to update the weight. the weight gets updated in
some way that is proportional to the gradient. If the gradient is vanishingly small,
then this update is, in turn, going to be vanishingly small as well.

9/23/2022 18
Dept. of Information and Communication Engineering (ICE), BAUET
Vanishing Gradient
• Therefore, if this newly updated value of the
weight has just barely moved from its
original value, then it's not really doing much
for the network.
• As a result, this weight becomes kind of stuck,
never really updating enough to even
get close to its optimal value
which has implications for the
remainder of the network to the right of
this one weight and impairs the ability of the
network to learn well.

9/23/2022 19
Dept. of Information and Communication Engineering (ICE), BAUET
Exploding Gradient
Not a gradient that vanishes, but rather, a
gradient that explodes.
calculating the gradient with respect to the same
weight, but instead of really small terms, what if
they were large? And by large, we mean greater
than one.[2]

9/23/2022 20
Dept. of Information and Communication Engineering (ICE), BAUET
Exploding Gradient
Instead of barely moving our weight with this
update, we're going to greatly move it, So much
so, that the optimal value for this weight won't
be achieved because the proportion to which the
weight becomes updated with each epoch is just
too large and continues to move further and
further away from its optimal value.

9/23/2022 21
Dept. of Information and Communication Engineering (ICE), BAUET
Ways to solve
• Changing activation function:
Certain activation functions, like the sigmoid function, squishes a large input space into a small input space
between 0 and 1. Therefore, a large change in the input of the sigmoid function will cause a small change in
the output. Hence, the derivative becomes small.
Use Relu

9/23/2022 22
Dept. of Information and Communication Engineering (ICE), BAUET
Ways to solve
Weight Initialization: A way to reduce the vanishing gradient problem

9/23/2022 23
Dept. of Information and Communication Engineering (ICE), BAUET
Bias

• The values assigned to these biases are learnable, just like the weights. Just how
stochastic gradient descent learns and updates the weights via backpropagation
during training, SGD is also learning and updating the biases as well.
•bias at each neuron as having a role similar to that of a threshold. This is because
the bias value is what's going to determine whether or not the activation
output from a neuron is going to be propagated forward through the network.
•In other words, the bias is determining whether or not, or by how much, a
neuron will fire[3]

9/23/2022 24
Dept. of Information and Communication Engineering (ICE), BAUET
Bias

9/23/2022 25
Dept. of Information and Communication Engineering (ICE), BAUET
Bias

9/23/2022 26
Dept. of Information and Communication Engineering (ICE), BAUET
Link
•To get a clear concept this video link will be helpful:
•https://www.youtube.com/playlist?list=PLZbbT5o_s2xq7LwI2y8_Qtv
uXZedL6tQU

9/23/2022 27
Dept. of Information and Communication Engineering (ICE), BAUET
References
1. https://elitedatascience.com/overfitting-in-machine-learning
2. https://deeplizard.com/learn/video/qO_NLVjD6zE
3. https://deeplizard.com

9/23/2022 28
Dept. of Information and Communication Engineering (ICE), BAUET
Thank you

9/23/2022 29
Dept. of Information and Communication Engineering (ICE), BAUET

Unit 2 Introduction To Deep Learning
No ratings yet
Unit 2 Introduction To Deep Learning
79 pages
1 Intro
No ratings yet
1 Intro
91 pages
CS601 - Machine Learning - Unit 2 New
No ratings yet
CS601 - Machine Learning - Unit 2 New
56 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
59 pages
Ipl Prediction Documentation
No ratings yet
Ipl Prediction Documentation
18 pages
Unit-1 and 2 and 3
No ratings yet
Unit-1 and 2 and 3
212 pages
CV Lec4
No ratings yet
CV Lec4
46 pages
CS601 - Machine Learning - Unit 2 - Notes - 1672759753
No ratings yet
CS601 - Machine Learning - Unit 2 - Notes - 1672759753
14 pages
DL Unit2
No ratings yet
DL Unit2
113 pages
Gradient Descent
No ratings yet
Gradient Descent
8 pages
Deep Learning Module-02 Search Creators
No ratings yet
Deep Learning Module-02 Search Creators
15 pages
Deep Learning Tutorial 9
No ratings yet
Deep Learning Tutorial 9
70 pages
ANN Presentation Exam Hafsa
No ratings yet
ANN Presentation Exam Hafsa
29 pages
00005187-Deep Learning
No ratings yet
00005187-Deep Learning
11 pages
Notes 7sem Pec Csm701
No ratings yet
Notes 7sem Pec Csm701
23 pages
CS601 Machine Learning Unit 2 Notes 1672759753
No ratings yet
CS601 Machine Learning Unit 2 Notes 1672759753
14 pages
Linear Models (Unit II) Chapter III 1
No ratings yet
Linear Models (Unit II) Chapter III 1
24 pages
HCIP-AI-EI Developer V2.0 Training Material
No ratings yet
HCIP-AI-EI Developer V2.0 Training Material
508 pages
Upload Unit 2
No ratings yet
Upload Unit 2
19 pages
Deep Feedforward Networks and Regularization: Licheng Zhang
No ratings yet
Deep Feedforward Networks and Regularization: Licheng Zhang
56 pages
DL U-I Introduction Part-2
No ratings yet
DL U-I Introduction Part-2
48 pages
Week 06 - Deep Feedforward Networks - Optimization
No ratings yet
Week 06 - Deep Feedforward Networks - Optimization
83 pages
Deep Learning Notes-2
No ratings yet
Deep Learning Notes-2
16 pages
Deep Learning Module-02
No ratings yet
Deep Learning Module-02
15 pages
EE769 7 Introduction To Neural Networks
No ratings yet
EE769 7 Introduction To Neural Networks
52 pages
Ch2-Training, Optimization and Regularization of DNN-new
No ratings yet
Ch2-Training, Optimization and Regularization of DNN-new
114 pages
May Deep Learning
No ratings yet
May Deep Learning
16 pages
Linearity: Skip To Content
No ratings yet
Linearity: Skip To Content
10 pages
Chapter 3
No ratings yet
Chapter 3
17 pages
DL M2 Tech
No ratings yet
DL M2 Tech
32 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
26 pages
DL Notes
No ratings yet
DL Notes
16 pages
2023246032-Backward Propagation and Other Differential Algorithms
No ratings yet
2023246032-Backward Propagation and Other Differential Algorithms
48 pages
NN WK 3 Lec 5 6 Gradient Descent
No ratings yet
NN WK 3 Lec 5 6 Gradient Descent
7 pages
Unit 2
No ratings yet
Unit 2
37 pages
Day 2 - Loss & Activation Functions
No ratings yet
Day 2 - Loss & Activation Functions
8 pages
Activation Function
No ratings yet
Activation Function
34 pages
Different Activation Functions With The Equations
No ratings yet
Different Activation Functions With The Equations
6 pages
4.back Propagation New
No ratings yet
4.back Propagation New
27 pages
Optim
No ratings yet
Optim
33 pages
DL Unit2
No ratings yet
DL Unit2
22 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Lecture 7 - Optimization Part I
No ratings yet
Lecture 7 - Optimization Part I
38 pages
Lecture 5 - CS50's Introduction To Artificial Intelligence With Python
No ratings yet
Lecture 5 - CS50's Introduction To Artificial Intelligence With Python
16 pages
Artificial Neural Network Notes
No ratings yet
Artificial Neural Network Notes
9 pages
DL Regularization
No ratings yet
DL Regularization
51 pages
9.b Handout-3-GD Variants
No ratings yet
9.b Handout-3-GD Variants
3 pages
WINSEM2024-25 CSE4006 ETH AP2024254000693 2025-01-08 Reference-Material-I
No ratings yet
WINSEM2024-25 CSE4006 ETH AP2024254000693 2025-01-08 Reference-Material-I
40 pages
6 Working Example 01-08-2024
No ratings yet
6 Working Example 01-08-2024
21 pages
Unit V
No ratings yet
Unit V
25 pages
Deep Learning Module-03 Search Creators
No ratings yet
Deep Learning Module-03 Search Creators
20 pages
2.vanishing Gradient and Exploding Gradient Simple Notes
No ratings yet
2.vanishing Gradient and Exploding Gradient Simple Notes
2 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
Deep Learning - Summary - Deep - Learning
No ratings yet
Deep Learning - Summary - Deep - Learning
17 pages
UNIT2
No ratings yet
UNIT2
25 pages
DL Class3
No ratings yet
DL Class3
28 pages
Model of Neuron in An ANN
No ratings yet
Model of Neuron in An ANN
12 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
Unit V
No ratings yet
Unit V
165 pages
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
No ratings yet
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
12 pages
ML - Project - Business Report
No ratings yet
ML - Project - Business Report
43 pages
Deep Learning (All in One)
No ratings yet
Deep Learning (All in One)
23 pages
Credit Card Fraud Detection
100% (1)
Credit Card Fraud Detection
10 pages
DL - Assignment 4 Solution
No ratings yet
DL - Assignment 4 Solution
6 pages
Short-Term Prediction in Vessel Heave Motion Based
No ratings yet
Short-Term Prediction in Vessel Heave Motion Based
13 pages
House Prices Prediction in King County
No ratings yet
House Prices Prediction in King County
10 pages
ML 1
No ratings yet
ML 1
51 pages
Feature Engineering and Selection: A Practical Approach For Predictive Models 1st Edition Max Kuhn Download PDF
No ratings yet
Feature Engineering and Selection: A Practical Approach For Predictive Models 1st Edition Max Kuhn Download PDF
50 pages
01 Introduction
No ratings yet
01 Introduction
51 pages
Machine Learning Approaches: Decision Trees
No ratings yet
Machine Learning Approaches: Decision Trees
44 pages
IDP AUS Institution List
No ratings yet
IDP AUS Institution List
1 page
Enhancing Error Prediction in Machineries Through Sensor Data Fusion
No ratings yet
Enhancing Error Prediction in Machineries Through Sensor Data Fusion
78 pages
ICE 1111 MND Lec 5
No ratings yet
ICE 1111 MND Lec 5
27 pages
Module 2
No ratings yet
Module 2
73 pages
Karthik Nambiar 60009220193
No ratings yet
Karthik Nambiar 60009220193
9 pages
ICT FundamentalICT 1111 Chapter 1
No ratings yet
ICT FundamentalICT 1111 Chapter 1
29 pages
Introduction To C Programming
No ratings yet
Introduction To C Programming
19 pages
Data Science Ai
No ratings yet
Data Science Ai
27 pages
Artificial Intelligence (AI)
No ratings yet
Artificial Intelligence (AI)
11 pages
Thesis Proposal 03
No ratings yet
Thesis Proposal 03
17 pages
Communication and Internet
No ratings yet
Communication and Internet
72 pages
Data Science Question Bank Updated
No ratings yet
Data Science Question Bank Updated
15 pages
Data Science Pocket Dictionary 1691284156
No ratings yet
Data Science Pocket Dictionary 1691284156
28 pages
Quiz Feedback2 - Coursera
0% (1)
Quiz Feedback2 - Coursera
5 pages
Data Science Question Bank With Answer
No ratings yet
Data Science Question Bank With Answer
39 pages
Operating System
No ratings yet
Operating System
45 pages
ICE 1111 MND Lec 2
No ratings yet
ICE 1111 MND Lec 2
43 pages
Social Media and PR Presentation
No ratings yet
Social Media and PR Presentation
12 pages
Artificial Intelligence and Machine Learning Models - 2025 - Sustainable Energy
No ratings yet
Artificial Intelligence and Machine Learning Models - 2025 - Sustainable Energy
20 pages
ICE 1111 MND Lec 3
No ratings yet
ICE 1111 MND Lec 3
33 pages
ICE 1111 MND Lec 4
No ratings yet
ICE 1111 MND Lec 4
29 pages
Simio AI Whitepaper 2025-1
No ratings yet
Simio AI Whitepaper 2025-1
11 pages
Database and Programming Concept
No ratings yet
Database and Programming Concept
28 pages
ICE 1111 MND Lec 6 TCP IP
No ratings yet
ICE 1111 MND Lec 6 TCP IP
22 pages
AHA Communication Strategy Case Study
No ratings yet
AHA Communication Strategy Case Study
9 pages
PM Guided Project
No ratings yet
PM Guided Project
25 pages
A Quantitative Investment Model Based On Random Fo
No ratings yet
A Quantitative Investment Model Based On Random Fo
14 pages
Company List Feedback From Departments
No ratings yet
Company List Feedback From Departments
2 pages
Apegs Salary Survey Summary Results 2021
No ratings yet
Apegs Salary Survey Summary Results 2021
4 pages
Exercise #2
No ratings yet
Exercise #2
3 pages
Physics Based Modelling
No ratings yet
Physics Based Modelling
10 pages
Final Exam SP '18
No ratings yet
Final Exam SP '18
6 pages
Grokking - Generalization Beyond Overfitting On Small Algorithmic Datasets - Harrison Edwards Alethea Power Yuri Burda Igor Babuschkin Vedant Misra
No ratings yet
Grokking - Generalization Beyond Overfitting On Small Algorithmic Datasets - Harrison Edwards Alethea Power Yuri Burda Igor Babuschkin Vedant Misra
9 pages
Machine Learning: Version 2 CSE IIT, Kharagpur
No ratings yet
Machine Learning: Version 2 CSE IIT, Kharagpur
6 pages
Hidden Surface Determination: Unveiling the Secrets of Computer Vision
From Everand
Hidden Surface Determination: Unveiling the Secrets of Computer Vision
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ANN Presentation

Uploaded by

ANN Presentation

Uploaded by

Bangladesh Army University of Engineering &

Artificial Intelligence and

•Feature selection if the dataset contains many features.

•Early stopping (model)

•Minimize loss function

Gradient means gradient of the loss function

Influence a change in this layer's activation output by

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.