0% found this document useful (0 votes)

9 views13 pages

DL IT324a 3

The document discusses regularization techniques for neural networks to combat overfitting, which occurs when a model performs well on training data but poorly on unseen data. It highlights two common methods: L2 regularization, which penalizes large weights to simplify the model, and dropout, which randomly removes nodes to prevent reliance on specific features. The document also notes the limitations of deep learning, including slow training times and complex model parameters.

Uploaded by

Jay Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views13 pages

DL IT324a 3

Uploaded by

Jay Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Regularization of Neural

Networks

Di n e s h K . V i s h wa k a rm a , P h .D.
P ROF E S S OR, DE P ARTM E N T OF IN F ORM ATION TE C HN OL OGY

D E L H I TE CH NOL OG I CA L UNI V E R SI TY, D E L H I .

W ebpage:
h t t p :/ / www.d t u .a c.i n / W eb /D e p ar t m en t s/ I n fo r m at i o nT e ch no l o gy / fa cul ty / d k vi s hwa ka r m a.p h p
Regularization
 To improve the performance of the NN,
regularization is done.
 An NN performs incredibly well on the
training set, but not nearly as good on the
test set.
 NN has a very high variance and it cannot
generalize well to data it has not been
trained on.
 These are the sign of overfitting.
3/30/2022 Dinesh K. Vishwakarma, Ph.D. 2
Solution of Overfitting
 Get more data
 Use regularization
Getting more data is sometimes impossible,
and other times very expensive.
Therefore, regularization is a common method
to reduce overfitting and consequently improve
the model’s performance.

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 3

Solution of Overfitting…
 Two most common approach used as
Regularization for NN:
 L2 regularization
 Dropout.

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 4

L2 regularization
 The cost function can be defined as
𝟏 𝟏 𝑳 𝑳 𝟏 𝒎
𝐉(𝒘 , 𝒃 … . . 𝒘 , 𝒃 ) = σ𝒊=𝟏 𝑳 𝒚ෝ𝒊 − 𝒚𝒊 .
𝒎
 Where L can be a loss function such as
cross entropy loss function.
 L2 regularization, a component is added
that penalizes large weights.

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 5

L2 regularization…
 Lambda: regularization parameter. The
addition of the Frobenius norm, denoted by the
subscript F.
 lambda is a parameter that can be tuned.
Larger weight values will be more penalized if the
value of lambda is large.
Similarly, for a smaller value of lambda, the
regularization effect is smaller.
 This makes sense, because the cost function must be minimized.
 By adding the squared norm of the weight matrix and multiplying it
by the regularization parameters, large weights will be driven down
in order to minimize the cost function.
3/30/2022 Dinesh K. Vishwakarma, Ph.D. 6
How Regularization Works?
 Adding the regularization component will
drive the values of the weight matrix down.
This will effectively de-correlate the NN.
 Recall, we feed the activation function with
the following weighted sum: z= 𝒘𝑻 𝒙 + 𝒃.
 By reducing the values in the weight
matrix, z will also be reduced, which in
turns decreases the effect of the activation
function.
 Therefore, a less complex function will be fit
to the data, effectively reducing overfitting.

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 7

Dropout Regularization
 Dropout involves going over all the layers in a
neural network and setting probability of keeping
a certain nodes or not.
 The input layer and the output layer are kept the
same.
 The probability of keeping each node is set at
random. Only threshold is decided: a value that
will determine if the node is kept or not.
 For example, if you set the threshold to 0.8, then
there is a probability of 20% that a node will be
removed from the network.
 Therefore, this will result in a much smaller and
simpler neural network.

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 8

Dropout Regularization…

 Dropout means that the NN cannot rely on any input node,

since each have a random probability of being removed.
Therefore, the NN will be reluctant to give high weights to
certain features, because they might disappear.
 Consequently, the weights are spread across all features,
making them smaller. This effectively shrinks the model and
regularizes it.

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 9

 https://towardsdatascience.com/how-to-
improve-a-neural-network-with-
regularization-8a18ecda9fe3

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 10

When to use Deep Learning?
 Data size is large Deep
 High end Learning
infrastructure

Performance
 Lack of domain Machine
understanding Learning

 Complex problem
such as image
classification, Amount of Data
speech recognition
etc. Fuel of deep learning is the big data
by Andrew Ng
3/30/2022 Dinesh K. Vishwakarma, Ph.D. 11
Limitations of Deep Learning
 Very slow to train
 Models are very complex, with lot of
parameters to optimize:
Initialization of weights
Layer-wise training algorithm
Neural architecture
• Number of layers
• Size of layers
• Type – regular, pooling, max pooling, soft max
Fine-tuning of weights using back propagation

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 12

Thank you!
dinesh@dtu.ac.in

3/30/2022 Dinesh K. Vishwakarma, Ph.D.

Slide 13 of 74

Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mark Scheme (Results) January 2025: Pearson Edexcel International Advanced Level in Pure Mathematics 2 (WMA12) Paper 01
No ratings yet
Mark Scheme (Results) January 2025: Pearson Edexcel International Advanced Level in Pure Mathematics 2 (WMA12) Paper 01
23 pages
Share Zimbabwejobs FRIDAY, ALLWeeklyjobs 10
No ratings yet
Share Zimbabwejobs FRIDAY, ALLWeeklyjobs 10
254 pages
研究生个人陈述
100% (2)
研究生个人陈述
7 pages
DLL - Mathematics 5 - Q1 - W5
No ratings yet
DLL - Mathematics 5 - Q1 - W5
10 pages
DL CS 7 M4 Live Class Flow
No ratings yet
DL CS 7 M4 Live Class Flow
37 pages
Training Neural Netwok: Data Set
No ratings yet
Training Neural Netwok: Data Set
35 pages
How Ai-Powered Sportsbooks Are Shaping The Sports Betting Industry
No ratings yet
How Ai-Powered Sportsbooks Are Shaping The Sports Betting Industry
39 pages
ACS133 Assignment 2 - Briefing 2023-2024
No ratings yet
ACS133 Assignment 2 - Briefing 2023-2024
6 pages
LecML - 3 NN
No ratings yet
LecML - 3 NN
33 pages
Contributions To Nonlinear Elliptic Equations and Systems
No ratings yet
Contributions To Nonlinear Elliptic Equations and Systems
434 pages
Hme126 Recreation and Leisure Management
No ratings yet
Hme126 Recreation and Leisure Management
19 pages
Empowering Deep Learning For Images: A Comparative Analysis of Regularization Techniques in CNNs
No ratings yet
Empowering Deep Learning For Images: A Comparative Analysis of Regularization Techniques in CNNs
13 pages
Regularization
No ratings yet
Regularization
3 pages
2.6 Regularization
No ratings yet
2.6 Regularization
24 pages
DLL - Tle-He 6 - Q3 - W4
No ratings yet
DLL - Tle-He 6 - Q3 - W4
3 pages
Lecture 1 Part II
No ratings yet
Lecture 1 Part II
24 pages
Broñola, Rancel - Lesson Plan
No ratings yet
Broñola, Rancel - Lesson Plan
6 pages
MTH 101 (Elementary Mathematics I) - 2223venn
No ratings yet
MTH 101 (Elementary Mathematics I) - 2223venn
19 pages
Hmems80 2021 Week00 Step by Step PDF
No ratings yet
Hmems80 2021 Week00 Step by Step PDF
6 pages
Mid Term II Examination Schedule
No ratings yet
Mid Term II Examination Schedule
9 pages
Lagging Behind
No ratings yet
Lagging Behind
5 pages
12-Regularization For Deep Learning-17!08!2024
No ratings yet
12-Regularization For Deep Learning-17!08!2024
51 pages
L10 Regularization Slides
No ratings yet
L10 Regularization Slides
45 pages
Deep Learning Module 2 Important Topics PYQs
No ratings yet
Deep Learning Module 2 Important Topics PYQs
30 pages
6 - Tips For Training Deep Neural Networks
No ratings yet
6 - Tips For Training Deep Neural Networks
59 pages
Regularization and Normalization
No ratings yet
Regularization and Normalization
29 pages
TLE - TVL CROP PRODUCTION 78 - q0 - CLAS2 - Preoperative Check Up of Farm Tools Implements and Equipment - v3 - RO QA Liezl Arosio
No ratings yet
TLE - TVL CROP PRODUCTION 78 - q0 - CLAS2 - Preoperative Check Up of Farm Tools Implements and Equipment - v3 - RO QA Liezl Arosio
10 pages
Iwrbs Q2 Week6
100% (1)
Iwrbs Q2 Week6
5 pages
Lecture 5-6
No ratings yet
Lecture 5-6
45 pages
DL Lect 7
No ratings yet
DL Lect 7
15 pages
Book List 2023 24 For Website
No ratings yet
Book List 2023 24 For Website
10 pages
Mod 4
No ratings yet
Mod 4
65 pages
Week 10
No ratings yet
Week 10
69 pages
Dataset Augmentation
No ratings yet
Dataset Augmentation
30 pages
Pa 4 Unit
No ratings yet
Pa 4 Unit
33 pages
5 Regularization
No ratings yet
5 Regularization
79 pages
NN 08
No ratings yet
NN 08
36 pages
What Is Regularization.
No ratings yet
What Is Regularization.
10 pages
Regularization
No ratings yet
Regularization
46 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
16 pages
Unit 4
No ratings yet
Unit 4
93 pages
S10 DNN Regularization Wip
No ratings yet
S10 DNN Regularization Wip
11 pages
07 Regularization
No ratings yet
07 Regularization
51 pages
Deep Neural Networks
No ratings yet
Deep Neural Networks
26 pages
Opening Closing Second Round DME UG
No ratings yet
Opening Closing Second Round DME UG
4 pages
NNDL Notes
No ratings yet
NNDL Notes
73 pages
Overfitting Underfitting: UNIT 2: Optimization and Regularization in Neural Networks
No ratings yet
Overfitting Underfitting: UNIT 2: Optimization and Regularization in Neural Networks
18 pages
Htet & Ctet Results
No ratings yet
Htet & Ctet Results
3 pages
03 Reg Slides
No ratings yet
03 Reg Slides
64 pages
UNIT-II Regularization in Deep Learning
No ratings yet
UNIT-II Regularization in Deep Learning
24 pages
4 NN Regularization
No ratings yet
4 NN Regularization
13 pages
Regularization Slides
No ratings yet
Regularization Slides
50 pages
Cours 4
No ratings yet
Cours 4
30 pages
Regularization For Neural Networks 1718966083
No ratings yet
Regularization For Neural Networks 1718966083
9 pages
Unit 4
No ratings yet
Unit 4
35 pages
4th Unit DL Final Class Notes
No ratings yet
4th Unit DL Final Class Notes
68 pages
BACK PROPAGATION and REGULATION, BATCH NORMALIZATION
No ratings yet
BACK PROPAGATION and REGULATION, BATCH NORMALIZATION
20 pages
SS Sba
No ratings yet
SS Sba
17 pages
Unit 2.3
No ratings yet
Unit 2.3
43 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Curriculum Vitae Santosa 2018
No ratings yet
Curriculum Vitae Santosa 2018
7 pages
Convolutional Neural Networks (Image Recognition) Part - II: Dr. Syed M. Usman
No ratings yet
Convolutional Neural Networks (Image Recognition) Part - II: Dr. Syed M. Usman
75 pages
Hemorrhagic Shock Clinical Presentation: History
No ratings yet
Hemorrhagic Shock Clinical Presentation: History
12 pages
Chap 2 Training Feed Forward Neural Networks
No ratings yet
Chap 2 Training Feed Forward Neural Networks
22 pages
Texas Medical Board June Actions
No ratings yet
Texas Medical Board June Actions
17 pages
465-Lecture 10-11
No ratings yet
465-Lecture 10-11
79 pages
Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects For Blind Persons
No ratings yet
Portable Camera-Based Assistive Text and Product Label Reading From Hand-Held Objects For Blind Persons
5 pages
Training Neural
No ratings yet
Training Neural
16 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
Silk - Yogaacaara Bhik - Su 2007
No ratings yet
Silk - Yogaacaara Bhik - Su 2007
27 pages
Japan's Jishu-Bosai-Soshiki Community Activities: Analysis of Its Role in Participatory Community Disaster Risk Management
No ratings yet
Japan's Jishu-Bosai-Soshiki Community Activities: Analysis of Its Role in Participatory Community Disaster Risk Management
12 pages
UNIT LV
No ratings yet
UNIT LV
8 pages
Case Interview Abbreviated Guide
100% (3)
Case Interview Abbreviated Guide
16 pages
Lec 05 Regularization
No ratings yet
Lec 05 Regularization
77 pages
Unit-2 Improving-Deep-Neural-Networks
No ratings yet
Unit-2 Improving-Deep-Neural-Networks
18 pages
Franchisee 2
No ratings yet
Franchisee 2
4 pages
Bataan 2
No ratings yet
Bataan 2
2 pages
Unit Online 1.4
No ratings yet
Unit Online 1.4
132 pages
Deep Neural Network Module 4 Regularization
No ratings yet
Deep Neural Network Module 4 Regularization
53 pages
Prof Ed Notes
100% (6)
Prof Ed Notes
14 pages
DL Class3
No ratings yet
DL Class3
28 pages
Practical Aspects of Deep Learning PI
No ratings yet
Practical Aspects of Deep Learning PI
46 pages
Curs5site PDF
No ratings yet
Curs5site PDF
47 pages
Deep Feedforward Networks and Regularization: Licheng Zhang
No ratings yet
Deep Feedforward Networks and Regularization: Licheng Zhang
56 pages
Deep Learning Basics Lecture 4 Regularization II
No ratings yet
Deep Learning Basics Lecture 4 Regularization II
27 pages
A Quick Guide On Basic Regularization Methods For Neural Networks - by Jaime Durán - Yottabytes - Medium
No ratings yet
A Quick Guide On Basic Regularization Methods For Neural Networks - by Jaime Durán - Yottabytes - Medium
9 pages
Deep Learning: Computer Science and Engineering
No ratings yet
Deep Learning: Computer Science and Engineering
18 pages
A Probabilistic Theory of Deep Learning: Unit 2
100% (1)
A Probabilistic Theory of Deep Learning: Unit 2
17 pages
Regularization: Swetha V, Research Scholar
No ratings yet
Regularization: Swetha V, Research Scholar
32 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

DL IT324a 3

Uploaded by

DL IT324a 3

Uploaded by

Regularization of Neural

D E L H I TE CH NOL OG I CA L UNI V E R SI TY, D E L H I .

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 3

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 4

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 5

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 7

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 8

 Dropout means that the NN cannot rely on any input node,

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 9

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 10

3/30/2022 Dinesh K. Vishwakarma, Ph.D. 12

3/30/2022 Dinesh K. Vishwakarma, Ph.D.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.