0% found this document useful (0 votes)

13 views6 pages

NN 1

The document provides an introduction to neural networks, focusing on the perceptron as the fundamental unit and its role in classifying data. It explains how perceptrons learn through adjusting weights and biases, and discusses the limitations of single-layer perceptrons with non-linearly separable data, exemplified by the XOR function. Additionally, it introduces the concept of Multi-Layer Perceptrons (MLPs) to approximate multi-dimensional non-linear functions.

Uploaded by

myroslavrepin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views6 pages

NN 1

Uploaded by

myroslavrepin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Neural Networks: Introduction and Overview

Nikhil Sardana
October 2017

1 Introduction
Neural networks are fundamental to modern machine learning. In order to un-
derstand Convolutional Neural Networks (CNNs), Recurrent Neural Networks
(RNNs), Generative Adversarial Networks (GANs), not only is it essential to un-
derstand the theory behind standard Neural Networks, but also the mathemat-
ics. To ensure complete understand, we only use numpy to build our network,
removing any reliance on a library.

2 The Perceptron

2.1 Definition
A perceptron is the fundamental unit of a Neural Network (which is even called
a Multi-Layer Perceptron for this reason). Refer to the diagram above. Percep-
trons contain two or more inputs, a weight for each input, a bias, an activation
function (the step function) and an output. For the perceptron above with 2
inputs, the intermediate value f (x) is as follows

f (x) = w1 x1 + w2 x2 + b

The final output y is just the step function:

(
0 if f (x) < 0
y=
1 if f (x) > 0

1
2.2 Visualization
The purpose of a perceptron is to classify data. Consider the function AND.
x1 x2 out
0 0 0
0 1 0
1 0 0
1 1 1
Let’s graph this data.

The line y = −x + 1.5 splits this data the best. Let’s rearrange this to get
x + y − 1.5 = 0. Going back to the perceptron formula

f (x) = w1 x1 + w2 x2 + b

we can see that for the optimal perceptron, w1 and w2 are the coefficients of x
and y, and b = −1.5. If f (x) > 0, then x + y − 1.5 > 0. We can see through
this example that perceptrons are nothing more than linear functions. Above a
line, perceptrons classify data points 1, below the line, they are 0.

2.3 Learning
How do perceptrons ”learn” the best possible linear function to split the data?
Perceptrons adjust the weights and bias to iteratively approach a solution.
Let’s consider this data:

2
1

The perceptron that represents the dashed line y + x − 1.5 = 0 has two
inputs, x1 , x2 , with corresponding weights w1 = 1, w2 = 1, and bias b = −1.5.
Let y represent the output of this perceptron. In the data above, the point (1, 0)
is the only misclassified point. The perceptron outputs 0 because it is below the
line, but it should output a 1.
For some data point (input) i with coordinates (i1 , i2 ), the perceptron adjusts
its weights and bias according to this formula:

w1 = w1 + α(d − y)(i1 )

w2 = w2 + α(d − y)(i2 )
b = b + α(d − y)
Where d is the desired output, and α is the learning rate, a constant usually
between 0 and 1. Notice that the equation degenerates to w = w and b = b
when the desired output equals the perceptron output. In other words, the
perceptron only learns from misclassified points.
In the case of the above data, the perceptron only learns from the point
(1, 0). Let’s set α = 0.2 and compute the learning steps:

w1 = 1 + 0.2(1 − 0)(1) = 1.2

w2 = 1 + 0.2(1 − 0)(0) = 1
b = −1.5 + 0.2(1 − 0) = −1.3
After 1 iteration, the perceptron now represents the function y +1.2x−1.3 =
0, which is shown below:

3
1

The next iteration follows:

w1 = 1.2 + 0.2(1 − 0)(1) = 1.4

w2 = 1 + 0.2(1 − 0)(0) = 1
b = −1.3 + 0.2(1 − 0) = −1.1

All the points are now correctly classified. The perceptron has learned!
Notice how it has not learned the best possible line, only the first one that
zeroes the difference between expected and actual output.

2.4 Non-Linearly Separable Data

Consider the function XOR:

4
x1 x2 out
0 0 1
0 1 0
1 0 0
1 1 1

Let’s graph this data.

We need two lines to separate this data! A perceptron will never reach the
optimal solution. However, multiple perceptrons can learn multiple lines, which
can be used to classify non-linearly separable data.

3 Multi-Layer Perceptron
A neural network (NN) or Multi-Layer Perceptron (MLP) is a bunch of these
perceptrons glued together, and can be used to approximate multi-dimensional
non-linearly separable data. Let us again consider XOR. How do we arrange
perceptrons to represent the two functions?
Clearly, we need two perceptrons, one for each function. The output of these
two perceptrons can be used as inputs to a third perceptron, which will give us
our output. Refer to the diagram below.

5
Let perceptron 1 model y + x − 1.5 = 0 (the upper line), and perceptron 2
model y + x − 0.5 = 0 (the lower line). Because the weights are the coefficients
of these functions, w1 = 1, w2 = 1, w3 = 1, w4 = 1 and the biases b1 = −1.5 and
b2 = −0.5.
The output of Perceptron 1 will be a 1 for points above the upper line, and
a 0 for the points below the upper line. The output of Perceptron 2 will be a 1
for points above the lower line, and a 0 for points below the lower line. Thus,
above both lines, we get 2. In between the lines, we get 1. Below the lines, we
get 0. However, in order to create a threshold to separate the points between
the lines from the points outside, we would like the outputs for points between
the lines to be additive.
In other words, we would like the inputs of Perceptron 3 to cancel outside
the lines, and have a maximum for points inside the lines. Thus, we let w6 = 1
and w5 = −1. This gives us an output of 1 for points between the lines, and an
output of 0 for points outside the lines. Thus, we can set the bias for Perceptron
3: b3 = −0.5.

4 Problems
1. Write out the weights, biases, and structure of the Perceptron that classi-
fies the function OR.
2. Write out the weights, biases, and structure of the Multi-Layer perceptron
that classifies the function XNOR.
3. Write an implementation of the XOR Multi-Layer Perceptron in Python.

Learning XOR - Gradient Based Learning - Hidden Units
No ratings yet
Learning XOR - Gradient Based Learning - Hidden Units
43 pages
P95 Course Slides
No ratings yet
P95 Course Slides
86 pages
Analysis and Study of Perceptron To Solve Xor Problem
No ratings yet
Analysis and Study of Perceptron To Solve Xor Problem
6 pages
Unit 5
No ratings yet
Unit 5
46 pages
Unit en Multilayer Perceptron
No ratings yet
Unit en Multilayer Perceptron
71 pages
Percept Ron
No ratings yet
Percept Ron
49 pages
Neural Network and Fuzzy Logic
50% (2)
Neural Network and Fuzzy Logic
54 pages
NN-Ch2 New V1
No ratings yet
NN-Ch2 New V1
99 pages
MR Haroon Khan Employment Offer Letter (Princess Cruise Ship Company)
No ratings yet
MR Haroon Khan Employment Offer Letter (Princess Cruise Ship Company)
3 pages
Design Deliverables
No ratings yet
Design Deliverables
11 pages
Dave Reed: Connectionist Approach To AI
No ratings yet
Dave Reed: Connectionist Approach To AI
26 pages
Unit 1 Until MLP
No ratings yet
Unit 1 Until MLP
56 pages
Neural N Problems - SLP
No ratings yet
Neural N Problems - SLP
123 pages
Preceptron
No ratings yet
Preceptron
17 pages
1c Perceptrons
No ratings yet
1c Perceptrons
20 pages
Using ICT To Improve Your Monitoring & Evaluation: A Workbook To Help You Develop An Effective ICT System (Davey, Parkinson and Wadia (2008)
No ratings yet
Using ICT To Improve Your Monitoring & Evaluation: A Workbook To Help You Develop An Effective ICT System (Davey, Parkinson and Wadia (2008)
92 pages
Unit 2
No ratings yet
Unit 2
15 pages
ECE/CS 559 - Neural Networks Lecture Notes #4 The Perceptron and Its Training
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #4 The Perceptron and Its Training
12 pages
ML Lecture#4
No ratings yet
ML Lecture#4
109 pages
20.NeuralNets Short
No ratings yet
20.NeuralNets Short
60 pages
It ML Unit 2 Notes Final
No ratings yet
It ML Unit 2 Notes Final
23 pages
Bim309 Ai Week13
No ratings yet
Bim309 Ai Week13
53 pages
Lecture 5 NN
No ratings yet
Lecture 5 NN
57 pages
De Leon vs. NLRC Digest
100% (1)
De Leon vs. NLRC Digest
9 pages
Linear Separability
No ratings yet
Linear Separability
4 pages
Module 4 Lab 1
No ratings yet
Module 4 Lab 1
5 pages
P5 Neural Nets
No ratings yet
P5 Neural Nets
114 pages
Perceptron
No ratings yet
Perceptron
26 pages
Unit 1.1
No ratings yet
Unit 1.1
44 pages
Branch Prediction With Neural Networks - Hidden Layers and Recurrent Connections
No ratings yet
Branch Prediction With Neural Networks - Hidden Layers and Recurrent Connections
15 pages
ML 03
No ratings yet
ML 03
42 pages
PMFIAS Prelims Magnum 2025 06 Science and Technology
No ratings yet
PMFIAS Prelims Magnum 2025 06 Science and Technology
210 pages
Pipeline Pre Trenching Pre Qua - Rev A 27june22 - Final
No ratings yet
Pipeline Pre Trenching Pre Qua - Rev A 27june22 - Final
57 pages
Week3 Perceptron Mlprwerwerwer
No ratings yet
Week3 Perceptron Mlprwerwerwer
8 pages
Fosroc Nitomortar FC (FS) : Constructive Solutions
No ratings yet
Fosroc Nitomortar FC (FS) : Constructive Solutions
2 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
87 pages
Chapter 7
No ratings yet
Chapter 7
31 pages
1c Perceptrons4
No ratings yet
1c Perceptrons4
5 pages
02 Neural Network
No ratings yet
02 Neural Network
28 pages
NN Part1
No ratings yet
NN Part1
43 pages
Single Layer Feedforward Networks
No ratings yet
Single Layer Feedforward Networks
21 pages
Artificial Neural Networks Unit 3: Single-Layer Perceptrons
No ratings yet
Artificial Neural Networks Unit 3: Single-Layer Perceptrons
11 pages
UNIT1 Perceptron MLP
No ratings yet
UNIT1 Perceptron MLP
26 pages
Neural Network
No ratings yet
Neural Network
82 pages
Ai Unit 4 Part 2
No ratings yet
Ai Unit 4 Part 2
45 pages
Eric Garland V SCHROEDER Et Al 2023-2024 Custody & DENIAL - MTD Case
No ratings yet
Eric Garland V SCHROEDER Et Al 2023-2024 Custody & DENIAL - MTD Case
7 pages
CH 12 - Artificial Neural Networks
No ratings yet
CH 12 - Artificial Neural Networks
39 pages
A Changing of The Guards at The College of Arts and Sciences (1981)
No ratings yet
A Changing of The Guards at The College of Arts and Sciences (1981)
151 pages
Lecturenotes Perceptron
No ratings yet
Lecturenotes Perceptron
7 pages
Perceptron Lecture 3
No ratings yet
Perceptron Lecture 3
25 pages
Slide 2
No ratings yet
Slide 2
35 pages
Percept Ron
No ratings yet
Percept Ron
15 pages
CFBC 718 e 2 C
No ratings yet
CFBC 718 e 2 C
30 pages
Hydrogen Aircraft and Airport Safety
No ratings yet
Hydrogen Aircraft and Airport Safety
31 pages
01 Road Roller Basic Knowledge (6611E)
0% (1)
01 Road Roller Basic Knowledge (6611E)
16 pages
Perceptron PDF
0% (1)
Perceptron PDF
8 pages
Backgroud of Malaysia Airlines 1
No ratings yet
Backgroud of Malaysia Airlines 1
38 pages
Lecture Notes 3 Perceptron
No ratings yet
Lecture Notes 3 Perceptron
7 pages
Unit 3
No ratings yet
Unit 3
29 pages
Pattern Recognition & Analysis Assignment - Ii
No ratings yet
Pattern Recognition & Analysis Assignment - Ii
19 pages
NN Unit 2
No ratings yet
NN Unit 2
20 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
Netbackup Troubleshooting Commands
No ratings yet
Netbackup Troubleshooting Commands
4 pages
820P 203
No ratings yet
820P 203
10 pages
A Presentation On: By: Edutechlearners
No ratings yet
A Presentation On: By: Edutechlearners
33 pages
Neural Networks
No ratings yet
Neural Networks
54 pages
1 - Perceptron in Machine Learning
No ratings yet
1 - Perceptron in Machine Learning
6 pages
Unit 4
No ratings yet
Unit 4
9 pages
Estrada V Sandiganbayan Digest
No ratings yet
Estrada V Sandiganbayan Digest
2 pages
Perceptron: Neuron Model (Special Form of Single Layer Feed Forward)
No ratings yet
Perceptron: Neuron Model (Special Form of Single Layer Feed Forward)
17 pages
Reducing Patient Falls Through Purposeful Hourly Rounding
No ratings yet
Reducing Patient Falls Through Purposeful Hourly Rounding
78 pages
Language Elements: Clauses
No ratings yet
Language Elements: Clauses
6 pages
Spru I 11444
No ratings yet
Spru I 11444
24 pages
Iv. Single Layer Structures: 4.1. Perceptrons
No ratings yet
Iv. Single Layer Structures: 4.1. Perceptrons
26 pages
As ISO IEC 6523.1-2005 Information Technology - Structure For The Identification of Organizations and Organiz
No ratings yet
As ISO IEC 6523.1-2005 Information Technology - Structure For The Identification of Organizations and Organiz
7 pages
AG - Sales Coordinator CV
No ratings yet
AG - Sales Coordinator CV
5 pages
Defining A Function: Docstring
No ratings yet
Defining A Function: Docstring
8 pages
Section A (50 Marks)
No ratings yet
Section A (50 Marks)
4 pages
Subject: Insufficient Fuel Tank Wall Thickness/Fuel Leak: 1200 New Jersey Avenue SE Washington, DC 20590
No ratings yet
Subject: Insufficient Fuel Tank Wall Thickness/Fuel Leak: 1200 New Jersey Avenue SE Washington, DC 20590
2 pages
150 Hcs Type-01new
No ratings yet
150 Hcs Type-01new
3 pages
Time Value of Money
No ratings yet
Time Value of Money
3 pages
4l60e Pump Revisions
100% (4)
4l60e Pump Revisions
22 pages
Linguine Pasta - Google Search
No ratings yet
Linguine Pasta - Google Search
1 page
Mapei 4-To-1 Mud Bed Mix Floor and Decor
No ratings yet
Mapei 4-To-1 Mud Bed Mix Floor and Decor
1 page
Opportunities For Nys Graduates May 2,2025
No ratings yet
Opportunities For Nys Graduates May 2,2025
4 pages
ISO 14001 Environment Management Watermark
No ratings yet
ISO 14001 Environment Management Watermark
2 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

NN 1

Uploaded by

NN 1

Uploaded by

Neural Networks: Introduction and Overview

The final output y is just the step function:

w1 = 1 + 0.2(1 − 0)(1) = 1.2

The next iteration follows:

w1 = 1.2 + 0.2(1 − 0)(1) = 1.4

2.4 Non-Linearly Separable Data

Let’s graph this data.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.