0% found this document useful (0 votes)

8 views20 pages

Lec 06

This document provides an introduction to neural networks, outlining their structure and functioning, including the concepts of artificial neurons, activation functions, and layers of neurons. It explains the relationship between neural networks and linear models, the necessity of multiple layers for solving complex problems, and the training process involving weights and biases. The lecture emphasizes the importance of architecture and hyperparameters in neural network design and learning.

Uploaded by

202411073

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views20 pages

Lec 06

Uploaded by

202411073

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

IT549: Deep Learning

Lecture 06

Introduction to Neural Networks

(Slides are created from the lecture notes of Dr. Derek Bridge, UCC, Ireland)

Arpit Rana
13th January 2025
Introduction

Neural Networks are loosely inspired by what we know about our brains:
● Networks of neurons.
● However, they are not models of our brains.
○ E.g. there is no evidence that the brain uses the learning algorithm that is used by
neural networks.
Biological Neuron

● Our brain is a network of about 1011 neurons, each connected

to about 104 others
● Sufﬁcient electrical activity on a neuron’s dendrites causes an
electrical pulse to be sent down the axon, where it may
activate other neurons.

https://commons.wikimedia.org/wiki/File:Neuron.svg
Artiﬁcial Neuron

● A simple artiﬁcial neuron has n real-valued inputs, 𝒙1, . . . , 𝒙n.

● The connections have real-valued weights, w1, . . . , wn..
● The neuron also has a number b called the bias.

Output: hw(x)

��
Weights
w1 w2 wn

𝒙1 𝒙2 ... 𝒙n

Inputs
Artiﬁcial Neuron

● The neuron computes the weighted sum of its inputs and adds b:

or if 𝒙 is a row vector of the inputs and 𝔴 is a (column) vector of the weights

Although artificial neurons are

inspired by real neurons, really
b
all we're doing is the dot product
of two vectors, followed by
w element-wise application of the
activation function.

𝒙 z a
Artiﬁcial Neuron

Many activation functions have been proposed, including:

● linear activation function:

● step activation function:

● sigmoid activation function:

● tanh activation function (tanh is the hyperbolic tangent):

● ReLU activation function (ReLU stands for Rectiﬁed Linear Unit):

Apart from the linear activation function, these activation functions are non-linear, which is
important to the power of neural networks.
Artiﬁcial Neuron

Activation functions and their derivatives

Relationship with Linear Models

● A single artiﬁcial neuron that uses the linear activation function gives us the same linear
models that we had in Linear Regression.

○ If we ﬁnd the values of the weights and bias using MSE as our loss function, then
we will be doing OLS regression.

● A single artiﬁcial neuron that uses the sigmoid activation function gives us the same
models that we had when using Logistic Regression for binary classiﬁcation.

○ We can set the weights using the binary cross-entropy function as our loss
function.
Layers of Neurons

We don't usually have just one neuron. We have a layer, containing several neurons.
● For now let's consider what is called a dense layer (also a fully-connected layer): every
input is connected to every neuron in the layer.

● So now we have more than one output, one per neuron, each calculated as before.

Image Source: Hands on Machine Learning by Aurelien Geron

Matrix Multiplication

Suppose there are 𝑚 inputs and 𝑝 neurons in a layer. We can put all the weights into a 𝑚 x 𝑝
matrix:

m = 2, 𝑝 = 3

��

b + 𝒙𝐖 g(z)

𝒙 z a
Multilayer Neural Network

Let's assume we have multiple layers and they are also dense layers. These neural networks
contain:

● an input layer (although this is not a

layer of neurons);
● one or more hidden layers;
● an output layer.

Every neuron has a bias.

● The network shown in the diagram is a layered, dense, feedforward network.

● The depth of a MLNN is simply the number of layers of neurons.
Matrix Multiplication Again

Similarly, we can obtain output for the second layer, and so on.

(0)
b(1)
b

𝐖(0) 𝐖(1)

𝒙 z(0) a(0) z(1) a(1)

Matrix Multiplication Again

● When we make predictions for unseen examples, we often want predictions, not for a
single object 𝒙, but for a set of objects X.

○ This is also true during training, in the case of Batch Gradient Descent and
Mini-Batch Gradient Descent.
Matrix Multiplication Again

● This is all that a neural network consists of! They are just collections of:
○ matrix multiplications; and
○ element-wise activation functions.

● In general, they are collections of:

○ afﬁne transformations (matrix multiplication being one example of an afﬁne
transformation, which are linear operations); and
○ element-wise functions (activation functions being one example).

● Looking at neural networks in this way also helps us realise that a neural network simply
deﬁnes a function as a composite of other functions.

● In the example above, the whole network computes the following:

Why Do We Need More Layers?

● A single neuron (or layer of neurons) gives us linear models.

● With linear models, there are problems we cannot solve, e.g., we cannot build a classiﬁer
that correctly classiﬁes exclusive-or:

𝒙1 𝒙2 𝒙1 ⊕ 𝒙2

0 0 0

0 1 1

1 0 1

1 1 0

Note: A recent paper in Science Magazine claims that a single layer of biological neurons can compute exclusive-or. If true, this confirms
what we said earlier: artificial neural networks are inspired by the human brain, but they are not a model of the human brain.
Why Do We Need More Layers?

But, a two-layer network that can correctly classify exclusive-or.

𝒙1 𝒙2 𝒙1 ⊕ 𝒙2
All connections
have a weight 0 0 0
equal to 1, except
the four 0 1 1
connections where
the weight is 1 0 1
shown.
1 1 0

So, with multiple layers of neurons and the non-linearities of their activation functions, we
can eliminate these limitations.
Why Do We Need More Layers?

In general, MLNN has the following advantages:

● Other things being equal, each extra hidden layer enlarges the set of hypotheses that the
network can represent: increasing complexity.

● In fact, the universal approximation theorem states that a feed-forward network with a
ﬁnite (but arbitrarily large) single hidden layer can approximate any continuous function
(to any desired precision), under mild assumptions on the activation function.
Training a Neural Network

Neural networks learn by modifying the values of the weights and biases.

● It is our job to decide on the neural network architecture (structure).

● It is our job to choose the values of numerous hyperparameters that we will encounter.
○ The hyperparameters of a neural network are the number of layers, number of
neurons in each layer, activation function, loss function, optimizer, learning rate,
batch size, etc.
● But, we use a dataset and a learning algorithm to ﬁnd the values of the network's
parameters.
○ The parameters of a neural network are its weights and biases.
Training a Neural Network

● A lot of this is done using supervised learning:

○ So we need a labeled dataset;

○ a loss function; and
○ a learning algorithm known as backpropagation (or backprop) that uses some
variant of Gradient Descent.
Next lecture Neural Network Examples
16th January 2025

Module 2
100% (1)
Module 2
62 pages
Neural Network: BY, Deekshitha J P Rakshitha Shankar
No ratings yet
Neural Network: BY, Deekshitha J P Rakshitha Shankar
27 pages
Module 3
No ratings yet
Module 3
83 pages
Lecture 7 - Neural Networks
No ratings yet
Lecture 7 - Neural Networks
48 pages
4.0 The Complete Guide To Artificial Neural Networks
No ratings yet
4.0 The Complete Guide To Artificial Neural Networks
23 pages
CS 611 Slides 5
No ratings yet
CS 611 Slides 5
28 pages
06 NeuralNetworks 2024
No ratings yet
06 NeuralNetworks 2024
82 pages
Safari - 25 Jul 2019 at 11:43
No ratings yet
Safari - 25 Jul 2019 at 11:43
1 page
Deep Learning
No ratings yet
Deep Learning
37 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
Day1 05 Introduction To DeepLearning Part
No ratings yet
Day1 05 Introduction To DeepLearning Part
20 pages
Unit 2 - Soft Computing
No ratings yet
Unit 2 - Soft Computing
49 pages
Intro To NN
No ratings yet
Intro To NN
4 pages
6ee412 ch6 Neural DSP
No ratings yet
6ee412 ch6 Neural DSP
41 pages
Neural Networks - Annotated
No ratings yet
Neural Networks - Annotated
21 pages
Neural Networks: - Genetic Algorithms - Genetic Programming - Behavior-Based Systems
No ratings yet
Neural Networks: - Genetic Algorithms - Genetic Programming - Behavior-Based Systems
74 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
Neural Nets
No ratings yet
Neural Nets
33 pages
Neural Networks - Annotated
No ratings yet
Neural Networks - Annotated
21 pages
DL Mod1
No ratings yet
DL Mod1
58 pages
CS 329 Lecture4 2025new
No ratings yet
CS 329 Lecture4 2025new
61 pages
Deep Learning UNIT 1
No ratings yet
Deep Learning UNIT 1
22 pages
UNIT-4 Foundations of Deep Learning
100% (1)
UNIT-4 Foundations of Deep Learning
43 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
29 pages
Neural Networks
No ratings yet
Neural Networks
61 pages
Chapter 3-1 Neural Network
No ratings yet
Chapter 3-1 Neural Network
43 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
3 pages
Machine Learning and Pattern Recognition Week 8 Neural Net Intro
No ratings yet
Machine Learning and Pattern Recognition Week 8 Neural Net Intro
3 pages
L10 Neural Network
No ratings yet
L10 Neural Network
52 pages
Neural Network
No ratings yet
Neural Network
37 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
13 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
75 pages
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
No ratings yet
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
52 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
86 pages
Unit 5
No ratings yet
Unit 5
29 pages
Unit V
No ratings yet
Unit V
49 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
Perceptron and Backpropagation
No ratings yet
Perceptron and Backpropagation
17 pages
Unit III
No ratings yet
Unit III
29 pages
Artificial Intelligence Basics
No ratings yet
Artificial Intelligence Basics
13 pages
Lecture8,9-Neural Networks
No ratings yet
Lecture8,9-Neural Networks
65 pages
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
No ratings yet
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
40 pages
Artificial Neural Network Part-2
No ratings yet
Artificial Neural Network Part-2
15 pages
Supervised Learning Unit 4-Neural Network
No ratings yet
Supervised Learning Unit 4-Neural Network
30 pages
Introduction To Recurrent Neural Network
No ratings yet
Introduction To Recurrent Neural Network
18 pages
Unit I
No ratings yet
Unit I
90 pages
Chapter 6 AI
No ratings yet
Chapter 6 AI
52 pages
Ker As Tutorial
No ratings yet
Ker As Tutorial
33 pages
Neural Networks From Scratch: 3.1 Formal Neuron
No ratings yet
Neural Networks From Scratch: 3.1 Formal Neuron
8 pages
LLM For Maths People
No ratings yet
LLM For Maths People
53 pages
2 DeepLearning
No ratings yet
2 DeepLearning
46 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
82 pages
Neural Networks
No ratings yet
Neural Networks
27 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
Unit 03 - Neural Networks - MD
No ratings yet
Unit 03 - Neural Networks - MD
24 pages
Unit 5 ML
No ratings yet
Unit 5 ML
37 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Artificial Neural Networks (Anns) : Intro
No ratings yet
Artificial Neural Networks (Anns) : Intro
15 pages
Unit 1
No ratings yet
Unit 1
20 pages
TO Artificial Neural Networks
No ratings yet
TO Artificial Neural Networks
22 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
Neural Network Oxygen
No ratings yet
Neural Network Oxygen
25 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
Neural Networks and Fuzzy Systems: Neurolab
No ratings yet
Neural Networks and Fuzzy Systems: Neurolab
17 pages
NN Lecture Notes
No ratings yet
NN Lecture Notes
45 pages
Final Mcqs Ai Unit 3
No ratings yet
Final Mcqs Ai Unit 3
6 pages
Neural Language Model, RNNS: Pawan Goyal
No ratings yet
Neural Language Model, RNNS: Pawan Goyal
15 pages
MLT CNN Architectures
No ratings yet
MLT CNN Architectures
104 pages
SC QB (24-25)
No ratings yet
SC QB (24-25)
14 pages
Final Neural June 2020
No ratings yet
Final Neural June 2020
2 pages
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
No ratings yet
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
9 pages
CNN 2
No ratings yet
CNN 2
47 pages
Diagram For ANN
No ratings yet
Diagram For ANN
2 pages
TEAM MEMBERS Noopur Sharma Vartika Singh Vivashwat Thakur
No ratings yet
TEAM MEMBERS Noopur Sharma Vartika Singh Vivashwat Thakur
13 pages
Convolutional Neural Network CNN For Ima
No ratings yet
Convolutional Neural Network CNN For Ima
5 pages
NNDL Lab Manual
No ratings yet
NNDL Lab Manual
43 pages
Hopfield
No ratings yet
Hopfield
3 pages
深度强化学习（初稿）
No ratings yet
深度强化学习（初稿）
289 pages
Model Test Paper Soft Computing
No ratings yet
Model Test Paper Soft Computing
2 pages
Comprehensive Popular Deep Learning Interview Questions Answers
No ratings yet
Comprehensive Popular Deep Learning Interview Questions Answers
15 pages
DL4CV Seq Att
No ratings yet
DL4CV Seq Att
63 pages
30 Deep Learning Quiz Questions and Answers OnlineExamMaker Blog
No ratings yet
30 Deep Learning Quiz Questions and Answers OnlineExamMaker Blog
18 pages
How To Build A Neural Network From Scratch
No ratings yet
How To Build A Neural Network From Scratch
4 pages
ML Lab3 PGM
No ratings yet
ML Lab3 PGM
3 pages
Soft Computing Index
No ratings yet
Soft Computing Index
4 pages
RNN LSTM Transformers Notes
No ratings yet
RNN LSTM Transformers Notes
4 pages
2025-Final Year B.Tech. Project Report Sample Format
No ratings yet
2025-Final Year B.Tech. Project Report Sample Format
19 pages
CS485 Ch5 Transformers
No ratings yet
CS485 Ch5 Transformers
50 pages
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lec 06

Uploaded by

Lec 06

Uploaded by

IT549: Deep Learning

Introduction to Neural Networks

● Our brain is a network of about 1011 neurons, each connected

● A simple artiﬁcial neuron has n real-valued inputs, 𝒙1, . . . , 𝒙n.

or if 𝒙 is a row vector of the inputs and 𝔴 is a (column) vector of the weights

Although artificial neurons are

Many activation functions have been proposed, including:

● linear activation function:

● step activation function:

● sigmoid activation function:

● tanh activation function (tanh is the hyperbolic tangent):

● ReLU activation function (ReLU stands for Rectiﬁed Linear Unit):

Activation functions and their derivatives

Image Source: Hands on Machine Learning by Aurelien Geron

● an input layer (although this is not a

Every neuron has a bias.

● The network shown in the diagram is a layered, dense, feedforward network.

𝒙 z(0) a(0) z(1) a(1)

● In general, they are collections of:

● In the example above, the whole network computes the following:

● A single neuron (or layer of neurons) gives us linear models.

But, a two-layer network that can correctly classify exclusive-or.

In general, MLNN has the following advantages:

● It is our job to decide on the neural network architecture (structure).

● A lot of this is done using supervised learning:

○ So we need a labeled dataset;

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.