0% found this document useful (0 votes)

92 views27 pages

Neural Networks and Fuzzy Systems: Multi-Layer Feed Forward Networks

The document discusses multi-layer feedforward neural networks (MLFFN). MLFFNs are needed to overcome the limitations of single-layer perceptrons and solve nonlinear problems. They do this by dividing the problem space into smaller linearly separable regions and combining the outputs of multiple hidden neurons. Gradient descent and backpropagation are used to minimize an error function and update weights and biases in the network to reduce errors.

Uploaded by

mesfer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

92 views27 pages

Neural Networks and Fuzzy Systems: Multi-Layer Feed Forward Networks

Uploaded by

mesfer

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

Neural Networks and

Fuzzy Systems

Multi-layer Feed forward Networks

Dr. Tamer Ahmed Farrag

Course No.: 803522-3
Course Outline
Part I : Neural Networks (11 weeks)
• Introduction to Machine Learning
• Fundamental Concepts of Artificial Neural Networks
(ANN)
• Single layer Perception Classifier
• Multi-layer Feed forward Networks
• Single layer FeedBack Networks
• Unsupervised learning
Part II : Fuzzy Systems (4 weeks)
• Fuzzy set theory
• Fuzzy Systems
2
Outline

• Why we need Multi-layer Feed forward Networks

(MLFF)?
• Error Function (or Cost Function or Loss function)
• Gradient Descent
• Backpropagation

3
Why we need Multi-layer Feed forward Networks
(MLFF)?

• Overcoming failure of single layer perceptron in

solving nonlinear problems.
• First Suggestion:
• Divide the problem space into smaller linearly separable
regions
• Use a perceptron for each linearly separable region
• Combine the output of multiple hidden neurons to
Region 1
produce a final decision neuron. gio
n2
+ Re
io n1
Reg

Region 2

4
Why we need Multi-layer Feed forward Networks
(MLFF)?
• Second suggestion
• In some cases we need a curve decision boundary or we try to solve
more complicated classification and regression problems.
• So, we need to:
• Add more layers
• Increase a number of neurons in each layer.
• Use non linear activation function in
the hidden layers.

• So , we need Multi-layer Feed forward Networks (MLFF).

5
Notation for Multi-Layer Networks
•• Dealing
with multi-layer networks is easy if a sensible notation is
adopted.
• We simply need another label (n) to tell us which layer in the network
we are dealing with.
• Each unit j in layer n receives activations from the previous layer of
processing units and sends activations to the next layer of units.

layer (0) layer (1) layer (n-1) layer (n)

(𝟏)
1 𝒘
𝒊𝒋
1 (𝒏)
𝒘
𝒊𝒋
2
2

6
ANN Representation
(1 input layer + 1 hidden layer +1 output layer)
layer (0) layer (1) layer (2)
(𝟏)
𝒘
𝟏𝟏
=
(𝟐)

𝒘

(𝟏) 𝒘

(𝟏)
𝟏𝟐 ( 𝒛 |𝒂 ) 𝒘
(𝟏)
𝟏
(𝟏)
𝟏
𝒘
𝟏𝟏

𝒘

(𝟏)
𝟏𝟑

𝒘
(𝟐)
𝟏𝟐

(𝟐)
( 𝒛𝟏 |𝒂 )
(𝟐) (𝟐)
𝟏
𝒚 𝟏

(𝟏) 𝟐𝟏
𝟐𝟏 (𝟏)
𝒘

𝑥2 = 𝑎
(0)
2 (𝟏)
𝟐𝟐
( 𝒛 𝟐 |𝒂 )
(𝟏)
𝟐 𝒘
(𝟐)
𝟐𝟐
𝒘 𝟐𝟑
𝒘

(𝟏)
𝒘
(𝟐)
𝟑𝟏 ( 𝒛𝟐 |𝒂 )
(𝟐) (𝟐)
𝟐
𝒚 𝟐
𝟑𝟏
( 𝒛 𝟑 |𝒂 )
(𝟏) (𝟏) (𝟐)
𝒘
(𝟏)
𝟑𝟐 𝒘
𝟑𝟐
𝟑
(0) (𝟏)
𝑥2 = 𝑎 2
𝒘
𝟑𝟑 • example:
for
+ +

σ ()
++
σ ()
7
Gradient Descent
and Backpropagation
Error Function
● how we can evaluate performance of a neuron ????
● We can use a Error function (or cost function or loss
function) to measure how far off we are from the
expected value.
● Choosing appropriate Error function help the learning
algorithm to reach to best values for weights and biases.
● We’ll use the following variables:
○ D to represent the true value (desired value)
○ y to represent neuron’s prediction

9
Error Functions
(Cost function or Lost Function)
• There are many formulates for error functions.
• In this course, we will deal with two Error function
formulas.
1Sum Squared Error (SSE) :
for single perceptron

Cross entropy (CE):

10
Why the error in ANN occurs?
• Each weight and bias in the network contribute in
the occasion of the error.

• To solve this we need:

• A cost function or error function to compute the error.
(SSE or CE Error function)
• An optimization algorithm to minimize the error
function. (Gradient Decent)
• A learning algorithm to modify weights and biases to
new values to get the error down. (Backpropagation)
• Repeat this operation until find the best solution

11
Gradient Decent (in 1 dimension)
• Assume we have a error function E and we need to
use it to update one weight w
• The figure show the error function in terms of w
• Our target is to learn the value of w produces the
minimum value of E.

How?
E

W
minimum 12
Gradient Decent (in 1 dimension)
• In
Gradient Decent algorithm, we use the following
equation to get a better value of w:
(called Delta rule)
Where:
: is the learning rate
: is mathematically can be computed using derivative of
E with respect to w ()

E
(3)

W
minimum 13
Local Minima problem

14
Choosing learning rate

15
Gradient Decent (multi dimension)
•• In
ANN with many layers and many neurons in each layer the
Error function will be multi-variable function.
• So, the derivative in equation (3) should be partial derivative
(4)

• We write equation (4) as :

• Same process will be use to get the

new bias value:

16
derivative of activation functions

Sigmoid

17
Learning Rule in the output layer
•using
SSE as error function and sigmoid
as Activation function
= * *
Where:

From the previous table:

= ) 18
Learning Rule in the output layer (cont.)

•So (How?),

• Then:
=

19
Learning Rule in the Hidden layer
• Now we have to determine the appropriate
weight change for an input to hidden weight.
• This is more complicated because it depends on
the error at all of the nodes this weighted
connection can lead to.
• The mathematical proof is out our scope.

20
Gradient Decent (Notes)
•Note
1:
• the neuron activation function (f ) should be is
defined and differentiable function.
Note 3:
• The calculating of for the hidden layer will be more
difficult (Why?)
Note 2:
• The previous calculation will be repeated for each
weight and for each bias in the ANN
• So, we need big computational power (what about
deeper networks? )

21
Gradient Decent (Notes)
• is represent the change in the values of to get
better output
• The equation of is dependent on the choosing of
the Error(Cost) function and activation function.
• Gradient Decent algorithm help in calculated the
new values of weights and bias.
• Question: is one iteration (one trail) enough to bet
the best values for weights and biases
• Answer: No, we need a extended version ?
Backpropagation
22
How Backpropagation Work?
𝑭𝒐𝒓𝒘𝒂𝒓𝒅
𝑷𝒓𝒐𝒑𝒂𝒈𝒂𝒕𝒊𝒐𝒏 𝑩𝒂𝒄𝒌
𝑷𝒓𝒐𝒑𝒂𝒈𝒂𝒕𝒊𝒐𝒏

𝒍𝒂𝒚𝒆𝒓
𝟎 𝒍𝒂𝒚𝒆𝒓
𝟏 𝒍𝒂𝒚𝒆𝒓
𝟐

𝒘
𝟏𝟏
(𝟏)
-
(𝟏)
𝒂 𝟏
(𝟏)
𝒘
𝟏𝟐 𝒘
(𝟐)
𝟏𝟏 -
(𝟏)
𝒘
𝟐𝟏

𝒘 (𝟏)
𝟐𝟐 𝒚
(𝟏)
𝒘
𝟑𝟏 (𝟐)
𝒘
𝟐𝟏
(𝟏)
𝒘
𝟑𝟐

23
Online Learning vs. Offline Learning

• Online: Pattern-by-Pattern
• Offline: Batch learning
learning • Error calculated for all
• Error calculated for each patterns
pattern • Weights updated once at
• Weights updated after each the end of each epoch
individual pattern

24
Choosing Appropriate Activation and Cost
Functions
• We already know consideration of single layer networks what output
activation and cost functions should be used for particular problem types.
• We have also seen that non-linear hidden unit activations are needed,
such as sigmoids.
• So we can summarize the required network properties:
• Regression/ Function Approximation Problems
• SSE cost function, linear output activations, sigmoid hidden activations
• Classification Problems (2 classes, 1 output)
• CE cost function, sigmoid output and hidden activations
• Classification Problems (multiple-classes, 1 output per class)
• CE cost function, softmax outputs, sigmoid hidden activations
• In each case, application of the gradient descent learning algorithm (by
computing the partial derivatives) leads to appropriate back-propagation
weight update equations.

25
Overall picture : learning process on ANN

26
Neural network simulator
• Search through the internet to find a simulator and
report it

For example:
• https://www.mladdict.com/neural-network-simula
tor

• http://playground.tensorflow.org/

CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Team Management PDF
100% (1)
Team Management PDF
25 pages
Bba 207 Guide Book
No ratings yet
Bba 207 Guide Book
37 pages
Technical Study Into The Means of Prolonging Blast Furnace Campaingn Life
No ratings yet
Technical Study Into The Means of Prolonging Blast Furnace Campaingn Life
142 pages
The Art and Science of Leadership
No ratings yet
The Art and Science of Leadership
112 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
Blast Furnace Burden
No ratings yet
Blast Furnace Burden
41 pages
Blast Furnace Raw Material,Structure & Design
No ratings yet
Blast Furnace Raw Material,Structure & Design
27 pages
Akshat Shrivastava Csit 1 0827CI201020
No ratings yet
Akshat Shrivastava Csit 1 0827CI201020
6 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
ENGG4999
No ratings yet
ENGG4999
4 pages
Lec 10 Commonly Used Furnaces
No ratings yet
Lec 10 Commonly Used Furnaces
35 pages
Analysis of The Slag and Metal Influece On The Life EF
No ratings yet
Analysis of The Slag and Metal Influece On The Life EF
9 pages
Chap 1 4 Without Conclusion
No ratings yet
Chap 1 4 Without Conclusion
15 pages
Lec 12 Design of An EAF
100% (1)
Lec 12 Design of An EAF
14 pages
BF Cast House Operation
100% (2)
BF Cast House Operation
127 pages
Week 3 - Sensation and Perception
No ratings yet
Week 3 - Sensation and Perception
17 pages
Powerpoint in Problem Solving in Mathematics (MMW)
100% (1)
Powerpoint in Problem Solving in Mathematics (MMW)
25 pages
Writing A Research Proposal
100% (1)
Writing A Research Proposal
4 pages
Somatoform Disorders
No ratings yet
Somatoform Disorders
6 pages
Pur Com
No ratings yet
Pur Com
10 pages
Opmt 620
No ratings yet
Opmt 620
9 pages
Ross, Gordon, Bagnell - 2011 - A Reduction of Imitation Learning and Structured Prediction Arxiv 1011 - 0686v3 Cs - LG 16 Mar 2011
No ratings yet
Ross, Gordon, Bagnell - 2011 - A Reduction of Imitation Learning and Structured Prediction Arxiv 1011 - 0686v3 Cs - LG 16 Mar 2011
9 pages
Empathic
No ratings yet
Empathic
11 pages
Philosophy Log
No ratings yet
Philosophy Log
13 pages
Refractory Concrete Review
No ratings yet
Refractory Concrete Review
17 pages
4 The Impact of Organizational Culture
100% (1)
4 The Impact of Organizational Culture
11 pages
Cohesive Zone in Dissected Blast Furnaces
100% (1)
Cohesive Zone in Dissected Blast Furnaces
10 pages
Poststructuralism
No ratings yet
Poststructuralism
4 pages
Minimal Mentalism 3
100% (6)
Minimal Mentalism 3
112 pages
Heat - Mass Balance at ULCOS PDF
No ratings yet
Heat - Mass Balance at ULCOS PDF
3 pages
Dreamstuff
No ratings yet
Dreamstuff
7 pages
7B-4 Charging HM in EAF's - Reducing Cost PDF
No ratings yet
7B-4 Charging HM in EAF's - Reducing Cost PDF
16 pages
工长-translate-chapter 1 Basic Processce
No ratings yet
工长-translate-chapter 1 Basic Processce
77 pages
DIASS12 - Q1 - Mod3 - The Professionals and Practitiones of Counseling - v2
No ratings yet
DIASS12 - Q1 - Mod3 - The Professionals and Practitiones of Counseling - v2
14 pages
BSP Plant Performance 60th TISCO Jamshedpur
100% (1)
BSP Plant Performance 60th TISCO Jamshedpur
22 pages
Optimization of Tap Hole Concept in JSPL BF
No ratings yet
Optimization of Tap Hole Concept in JSPL BF
9 pages
2B0422G1-J143-D001 Rev 0 Process Control Basic Engineering R
100% (1)
2B0422G1-J143-D001 Rev 0 Process Control Basic Engineering R
49 pages
Blast Furnace Trough Castables
No ratings yet
Blast Furnace Trough Castables
104 pages
The Effect of Project AGKARABASA On The Reading Comprehension of Grade 7 Students in Bintawan National High School in English
No ratings yet
The Effect of Project AGKARABASA On The Reading Comprehension of Grade 7 Students in Bintawan National High School in English
8 pages
01
100% (1)
01
38 pages
Basic English Dialogs Morning Routine
No ratings yet
Basic English Dialogs Morning Routine
3 pages
BELL LESS Top
No ratings yet
BELL LESS Top
10 pages
Study of High Alumina in BF
No ratings yet
Study of High Alumina in BF
51 pages
ISO Plant Networking July
No ratings yet
ISO Plant Networking July
13 pages
Have To Speaking
No ratings yet
Have To Speaking
2 pages
Steel Interstop Flow Control Technology 1909 en 190917 Eb Mon
No ratings yet
Steel Interstop Flow Control Technology 1909 en 190917 Eb Mon
15 pages
Unit 1 Metals and Melting Practices: Structure
No ratings yet
Unit 1 Metals and Melting Practices: Structure
45 pages
Training 2 LF Fundamentals
100% (1)
Training 2 LF Fundamentals
64 pages
Key To Furnace
No ratings yet
Key To Furnace
26 pages
Refractory Materials For Different Parts of Electric ARC Furnace - Catherine Gao - Pulse - LinkedIn
No ratings yet
Refractory Materials For Different Parts of Electric ARC Furnace - Catherine Gao - Pulse - LinkedIn
4 pages
career choices
No ratings yet
career choices
2 pages
Restoration of Stamp Charge Battery Health Through Operational Excellence at Tata Steel, India
No ratings yet
Restoration of Stamp Charge Battery Health Through Operational Excellence at Tata Steel, India
8 pages
Transformation To The Agile PMO: Agility Is Not A Destination, It Is A Journey!
No ratings yet
Transformation To The Agile PMO: Agility Is Not A Destination, It Is A Journey!
10 pages
Presentation Blast Furnace
No ratings yet
Presentation Blast Furnace
7 pages
Electric Arc Furnace
No ratings yet
Electric Arc Furnace
14 pages
B1PLUS U5 Extra Grammar Practice Challenge PDF
No ratings yet
B1PLUS U5 Extra Grammar Practice Challenge PDF
1 page
Copper Stave Cooled Blast Furnace - Bell-Less Top and Process Models As Key Tools For Efficient Operation and Long Campaign
No ratings yet
Copper Stave Cooled Blast Furnace - Bell-Less Top and Process Models As Key Tools For Efficient Operation and Long Campaign
11 pages
Philippine Politics and Governance
No ratings yet
Philippine Politics and Governance
3 pages
Activity 2. Lesson 1.2 Math in The Modern World
No ratings yet
Activity 2. Lesson 1.2 Math in The Modern World
1 page
Unit 4 v3 PDF
No ratings yet
Unit 4 v3 PDF
12 pages
Article Critique Grading Rubric
No ratings yet
Article Critique Grading Rubric
2 pages
Meltingfurnace 170728094222
No ratings yet
Meltingfurnace 170728094222
31 pages
BF
No ratings yet
BF
56 pages
Paper On Casthouse Refractories - Tata Steel
No ratings yet
Paper On Casthouse Refractories - Tata Steel
5 pages
Blast Furnace
No ratings yet
Blast Furnace
12 pages
Blast Furnace
No ratings yet
Blast Furnace
32 pages
BF Dimension
No ratings yet
BF Dimension
56 pages
Presentation On Tap Hole Management Practice by A. Chakraborty
No ratings yet
Presentation On Tap Hole Management Practice by A. Chakraborty
22 pages
Blast Furnace - Taphole Clay: Products Chemistry (%) Density G/CM (LB/FT Al O Sio Sic C Fesi N
No ratings yet
Blast Furnace - Taphole Clay: Products Chemistry (%) Density G/CM (LB/FT Al O Sio Sic C Fesi N
1 page
Computer Modeling of Refractory/Slag/Metal Interactions
No ratings yet
Computer Modeling of Refractory/Slag/Metal Interactions
8 pages
THE EFFECT OF FOAMY SLAG IN THE ELECTRIC ARC FURNACES ON ELECTRIC Energy Consumption PDF
No ratings yet
THE EFFECT OF FOAMY SLAG IN THE ELECTRIC ARC FURNACES ON ELECTRIC Energy Consumption PDF
10 pages
R 02 Danieli Corus - Blast Furnace Cast House
100% (1)
R 02 Danieli Corus - Blast Furnace Cast House
11 pages
NLC Assessment Tool
No ratings yet
NLC Assessment Tool
3 pages
Factors Affecting PCI PDF
No ratings yet
Factors Affecting PCI PDF
11 pages
Large Blast Furnace Operation in China PDF
No ratings yet
Large Blast Furnace Operation in China PDF
6 pages
Silica Lining in An Induction Furnace
No ratings yet
Silica Lining in An Induction Furnace
3 pages
Charging Hot Metal To The EAF Using Consteel: Steelmaking
No ratings yet
Charging Hot Metal To The EAF Using Consteel: Steelmaking
6 pages
Inter Influencing Coke Rate Factors
No ratings yet
Inter Influencing Coke Rate Factors
13 pages
Tundish CFD
No ratings yet
Tundish CFD
19 pages
IREFCON 2014 (Trough Castable For Blast Furnace) 61c034b365
No ratings yet
IREFCON 2014 (Trough Castable For Blast Furnace) 61c034b365
5 pages
Slag Attack Mag Carb
No ratings yet
Slag Attack Mag Carb
12 pages
EOF: The Reliable Choice For Indian "Start-Up" Steel Plants
No ratings yet
EOF: The Reliable Choice For Indian "Start-Up" Steel Plants
8 pages
Developments in Blast Furnace Process Control at Port Kembla Base
No ratings yet
Developments in Blast Furnace Process Control at Port Kembla Base
13 pages
How's Steel Manufactured?: Raw Materials For Ironmaking
No ratings yet
How's Steel Manufactured?: Raw Materials For Ironmaking
6 pages
High Productivity and Coke Rate Reduction at Siderar Blast Furnace #2
No ratings yet
High Productivity and Coke Rate Reduction at Siderar Blast Furnace #2
11 pages
Cohesive Zone - Blast Furnace
No ratings yet
Cohesive Zone - Blast Furnace
8 pages
Blast Furnace Iron Making
No ratings yet
Blast Furnace Iron Making
9 pages
Tap Hole Clays
No ratings yet
Tap Hole Clays
1 page
Continuous casting The Ultimate Step-By-Step Guide
From Everand
Continuous casting The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Neural Networks and Fuzzy Systems: Multi-Layer Feed Forward Networks

Uploaded by

Neural Networks and Fuzzy Systems: Multi-Layer Feed Forward Networks

Uploaded by

Neural Networks and

Multi-layer Feed forward Networks

Dr. Tamer Ahmed Farrag

• Why we need Multi-layer Feed forward Networks

• Overcoming failure of single layer perceptron in

• So , we need Multi-layer Feed forward Networks (MLFF).

layer (0) layer (1) layer (n-1) layer (n)

Cross entropy (CE):

• To solve this we need:

• We write equation (4) as :

• Same process will be use to get the

From the previous table:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.