0% found this document useful (0 votes)

7 views12 pages

Unit II

Uploaded by

renuka.ai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views12 pages

Unit II

Uploaded by

renuka.ai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

1.

Shallow Neural Networks

a shallow neural network has only one (or just a few) hidden layers between the input and
output layers. The input layer receives the data, the hidden layer(s) process it, and the final
layer produces the output.

Shallow neural networks are simpler, more easily trained, and have greater computational
efficiency than deep neural networks, which may have thousands of hidden units in dozens
of layers. Shallow networks are typically used for simpler tasks such as linear regression,
binary classification, or low-dimensional feature extraction.

Logistic Regression

Logistic regression is a shallow supervised ML technique most commonly used to

solve classification problems, especially where the outcome is binary, such as (A or B), (yes
or no), and (malignant or benign).

At the heart of logistic regression lies the logistic function, f(x) = 1 / (1 + e−x), which
has a sigmoidal shape and returns a value between 0 and 1 for all inputs x.

Support Vector Machine (SVM)

In a SVM data is segregated into two classes, each represented by points in space
separated by as large a distance as possible. A dividing boundary separates the classes. The
choice of the boundary is taken as the one that maximizes the distance (the "margin")
between the boundary and the closest point in each group.

Random Forest

The Random Forest is a machine learning technique for classification and prediction
of data. The building block of the Random Forest is the Decision Tree.

Cluster Analysis

Unlike the three supervised machine learning techniques above, Cluster Analysis
is unsupervised. Its goal is to subdivide large data sets into clusters, groups of objects that
have similar properties or features compared to other groups.

Popular clustering methods used imaging applications include:

 K-means Clustering
 Connectivity-Based Clustering
 Gaussian-Mixture Clustering
 Density-Based Clustering

2.Deep Neural Network

A Convolutional Neural Network (CNN) is a type of Deep Learning neural

network architecture commonly used in Computer Vision. Computer vision is a field of
Artificial Intelligence that enables a computer to understand and interpret the image or
visual data.
CNN Architecture using Convolutional layers

In a regular Neural Network there are three types of layers:

1. Input Layers: It’s the layer in which we give input to our model. The number of
neurons in this layer is equal to the total number of features in our data (number
of pixels in the case of an image).

2. Hidden Layer: The input from the Input layer is then fed into the hidden layer.
There can be many hidden layers depending on our model and data size. Each
hidden layer can have different numbers of neurons which are generally greater
than the number of features. The output from each layer is computed by matrix
multiplication of the output of the previous layer with learnable weights of that
layer and then by the addition of learnable biases followed by activation function
which makes the network nonlinear.

3. Output Layer: The output from the hidden layer is then fed into a logistic
function like sigmoid or softmax which converts the output of each class into the
probability score of each class.
CNN Simple architecture

4. Hyperparameter tuning is the process of selecting the optimal values for a machine
learning model’s hyperparameters. Hyperparameters are settings that control the
learning process of the model, such as the learning rate, the number of neurons in a
neural network, or the kernel size in a support vector machine. The goal of
hyperparameter tuning is to find the values that lead to the best performance on a
given task.
5. Batch Normalization (BN) is a powerful technique that addresses these issues by
stabilizing the learning process and accelerating convergence. Batch
Normalization(BN) is a popular technique used in deep learning to improve the
training of neural networks by normalizing the inputs of each layer.
6. The XOR problem is a classic problem in artificial intelligence and machine
learning. XOR, which stands for exclusive OR, is a logical operation that takes
two binary inputs and returns true if exactly one of the inputs is true. The XOR
gate follows a specific truth table, where the output is true only when the inputs
differ. This problem is particularly interesting because a single-layer perceptron,
the simplest form of a neural network, cannot solve it.

7. Backpropagation Process in Deep Neural

Backpropagation is one of the important concepts of a neural network. Our task is to

classify our data best. For this, we have to update the weights of parameter and bias, but how
can we do that in a deep neural network? In the linear regression model, we use gradient
descent to optimize the parameter. Similarly here we also use gradient descent algorithm
using Backpropagation.

For a single training example, Backpropagation algorithm calculates the gradient of

the error function. Backpropagation can be written as a function of the neural network.
Backpropagation algorithms are a set of methods used to efficiently train artificial neural
networks following a gradient descent approach which exploits the chain rule.

The main features of Backpropagation are the iterative, recursive and efficient method
through which it calculates the updated weight to improve the network until it is not able to
perform the task for which it is being trained. Derivatives of the activation function to be
known at network design time is required to Backpropagation.

Now, how error function is used in Backpropagation and how Backpropagation works? Let
start with an example and do it mathematically to understand how exactly updates the weight
using Backpropagation.

Input values

X1=0.05
X2=0.10

Initial weight

W1=0.15 w5=0.40
W2=0.20 w6=0.45
W3=0.25 w7=0.50
W4=0.30 w8=0.55

Bias Values

b1=0.35 b2=0.60

Target Values

T1=0.01
T2=0.99

Now, we first calculate the values of H1 and H2 by a forward pass

Forward Pass

To find the value of H1 we first multiply the input value from the weights
as

H1=x1×w1+x2×w2+b1
H1=0.05×0.15+0.10×0.20+0.35
H1=0.3775

To calculate the final result of H1, we performed the sigmoid function as

We will calculate the value of H2 in the same way as H1

H2=x1×w3+x2×w4+b1
H2=0.05×0.25+0.10×0.30+0.35
H2=0.3925

To calculate the final result of H1, we performed the sigmoid function as

Now, we calculate the values of y1 and y2 in the same way as we

calculate the H1 and H2.

To find the value of y1, we first multiply the input value i.e., the outcome
of H1 and H2 from the weights as

y1=H1×w5+H2×w6+b2
y1=0.593269992×0.40+0.596884378×0.45+0.60
y1=1.10590597

To calculate the final result of y1 we performed the sigmoid function as

We will calculate the value of y2 in the same way as y1

y2=H1×w7+H2×w8+b2
y2=0.593269992×0.50+0.596884378×0.55+0.60
y2=1.2249214

To calculate the final result of H1, we performed the sigmoid function as

Our target values are 0.01 and 0.99. Our y1 and y2 value is not matched
with our target values T1 and T2.

8.Activation Function

Activation functions add a nonlinear property to the neural network. This allows the
network to model more complex data. ReLU should generally be used as an activation
function in the hidden layers. In the output layer, the expected value range of the
predictions must always be considered.

Neural Network Components

1. Input Layer
2. Hidden Layer
3. Output Layer

Activation Function

The two main categories of activation functions are:

o Linear Activation Function

o Non-linear Activation Functions

Linear Activation Function

Non-linear Activation Function

The normal data input to neural networks is unaffected by the complexity or other factors.

Activation Function

o Linear Function

Equation: A linear function's equation, which is y = x, is similar to the eqn of a single

direction.

o Sigmoid Function

It is a functional that is graphed in a "S" shape.

A is equal to 1/(1 + e-x).

Non-linear in nature. Observe that while Y values are fairly steep, X values range from -2 to
2. To put it another way, small changes in x also would cause significant shifts in the value of
Y. spans from 0 to 1.
Tanh Function

The activation that consistently outperforms sigmoid function is known as tangent hyperbolic
function. It's actually a sigmoid function that has been mathematically adjusted. Both are
comparable to and derivable from one another.

Range of values: -1 to +1. non-linear nature

Equation:

max A(x) (0, x). If x is positive, it outputs x; if not, it outputs 0.

Value Interval: [0, inf]

o ReLU (Rectified Linear Unit) Activation Function

Currently, the ReLU is the activation function that is employed the most globally. Since
practically all convolutional neural networks and deep learning systems employ it.

The derivative and the function are both monotonic.

o Softmax Function

Although it is a subclass of the sigmoid function, the softmax function comes in handy when
dealing with multiclass classification issues.

Used frequently when managing several classes. In the output nodes of image classification
issues, the softmax was typically present. The softmax function would split by the sum of the
outputs and squeeze all outputs for each category between 0 and 1.
Gradient Descent in Machine Learning
Gradient Descent is known as one of the most commonly used optimization algorithms to
train machine learning models by means of minimizing errors between actual and expected
results. Further, gradient descent is also used to train Neural Networks.

The best way to define the local minimum or local maximum of a function using gradient
descent is as follows:

o If we move towards a negative gradient or away from the gradient of the function at
the current point, it will give the local minimum of that function.
o Whenever we move towards a positive gradient or towards the gradient of the
function at the current point, we will get the local maximum of that function.

Types of Gradient Descent

Based on the error in various training models, the Gradient Descent learning algorithm can be
divided into Batch gradient descent, stochastic gradient descent, and mini-batch
gradient descent. Let's understand these different types of gradient descent:

1. Batch Gradient Descent:

Batch gradient descent (BGD) is used to find the error for each point in the training set and
update the model after evaluating all training examples. This procedure is known as the
training epoch. In simple words, it is a greedy approach where we have to sum over all
examples for each update.

2. Stochastic gradient descent

Stochastic gradient descent (SGD) is a type of gradient descent that runs one training
example per iteration

3. Mini Batch Gradient Descent:

Mini Batch gradient descent is the combination of both batch gradient descent and stochastic
gradient descent. It divides the training datasets into small batch sizes then performs the
updates on those batches separately.

Notes On Introduction To Deep Learning
No ratings yet
Notes On Introduction To Deep Learning
19 pages
Deep Learning-Question Bank-Module-Wise
67% (3)
Deep Learning-Question Bank-Module-Wise
5 pages
Unit 2 Deep Learning and Neural Networks
No ratings yet
Unit 2 Deep Learning and Neural Networks
38 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
Activation Functions in Neural Networks - 241102 - 224129
No ratings yet
Activation Functions in Neural Networks - 241102 - 224129
7 pages
shortnotedeeplearning (2)
No ratings yet
shortnotedeeplearning (2)
11 pages
Deep-Learning-Series-1-Important-Topics
No ratings yet
Deep-Learning-Series-1-Important-Topics
16 pages
AD3451 ML UNIT 4 NOTES
No ratings yet
AD3451 ML UNIT 4 NOTES
36 pages
NN unit_1
No ratings yet
NN unit_1
27 pages
ML_MU_Unit_5NeuralNetworkpdf__2025_04_16_13_47_39
No ratings yet
ML_MU_Unit_5NeuralNetworkpdf__2025_04_16_13_47_39
57 pages
Unit 4
No ratings yet
Unit 4
19 pages
Deep Learning (1)
No ratings yet
Deep Learning (1)
19 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
UNIT II DNN
No ratings yet
UNIT II DNN
24 pages
UNIT-I.pptx
No ratings yet
UNIT-I.pptx
90 pages
ca3dl
No ratings yet
ca3dl
6 pages
Types of Machine Learning: Supervised Learning: The Computer Is Presented With Example Inputs and Their
No ratings yet
Types of Machine Learning: Supervised Learning: The Computer Is Presented With Example Inputs and Their
50 pages
NNML_Full
No ratings yet
NNML_Full
19 pages
Deep Learning
No ratings yet
Deep Learning
13 pages
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 e914f1ab6afb
No ratings yet
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 e914f1ab6afb
52 pages
Deep Learning
No ratings yet
Deep Learning
37 pages
Unit-2
No ratings yet
Unit-2
35 pages
Perceptron in Machine Learning
No ratings yet
Perceptron in Machine Learning
11 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
Unit Iv
No ratings yet
Unit Iv
34 pages
ANN Unit IV Notes
No ratings yet
ANN Unit IV Notes
4 pages
Lecture_09_slides_-_after
No ratings yet
Lecture_09_slides_-_after
57 pages
Unit-5
No ratings yet
Unit-5
59 pages
cst414- Deep learning
No ratings yet
cst414- Deep learning
34 pages
Unit -4 Artificial Neural Networks
No ratings yet
Unit -4 Artificial Neural Networks
33 pages
NN Concepts
No ratings yet
NN Concepts
4 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
CV Lec5
No ratings yet
CV Lec5
54 pages
Activation Function To Back Pro
No ratings yet
Activation Function To Back Pro
22 pages
Week-7(ANN)
No ratings yet
Week-7(ANN)
23 pages
Ad3451 Ml Unit 4 Notes
No ratings yet
Ad3451 Ml Unit 4 Notes
34 pages
DEEP LEARNING Paper
No ratings yet
DEEP LEARNING Paper
12 pages
tutorial 1,2
No ratings yet
tutorial 1,2
12 pages
Introduction Deep Eng (1)
No ratings yet
Introduction Deep Eng (1)
50 pages
Machine Learning NN
100% (2)
Machine Learning NN
16 pages
CS 329 Lecture4 2025New
No ratings yet
CS 329 Lecture4 2025New
61 pages
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2015
No ratings yet
CS 224D: Deep Learning For NLP: Lecture Notes: Part III Spring 2015
14 pages
Module-2
100% (1)
Module-2
62 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Module1 ECO-598 AI & ML Aug 21
No ratings yet
Module1 ECO-598 AI & ML Aug 21
45 pages
Deep Learning
100% (4)
Deep Learning
100 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
Basics of ANN
No ratings yet
Basics of ANN
16 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
2023246032-Backward Propagation and Other Differential Algorithms
No ratings yet
2023246032-Backward Propagation and Other Differential Algorithms
48 pages
Deep Learning 1
No ratings yet
Deep Learning 1
48 pages
lecture 9-NN- modified
No ratings yet
lecture 9-NN- modified
94 pages
ml
No ratings yet
ml
16 pages
Chapter 5 Final
No ratings yet
Chapter 5 Final
80 pages
AyushChokhani AI Asiignment 2
No ratings yet
AyushChokhani AI Asiignment 2
12 pages
LLM Ai Interview SS
No ratings yet
LLM Ai Interview SS
187 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
The Significance of LLM Tokenization
No ratings yet
The Significance of LLM Tokenization
6 pages
Lecture8,9-Neural Networks
No ratings yet
Lecture8,9-Neural Networks
65 pages
Single Layer Perceptron
No ratings yet
Single Layer Perceptron
14 pages
A_Transformer-Based_Framework_for_Scene_Text_Recognition
No ratings yet
A_Transformer-Based_Framework_for_Scene_Text_Recognition
16 pages
BLM5135_10_ResidualNetworks_Transformer
No ratings yet
BLM5135_10_ResidualNetworks_Transformer
60 pages
Lec 2 - Neural Network Perceptron Adaline PDF
No ratings yet
Lec 2 - Neural Network Perceptron Adaline PDF
7 pages
Attention Is All You Need
No ratings yet
Attention Is All You Need
7 pages
Pythonfile
No ratings yet
Pythonfile
36 pages
Understanding Convolutional Neural Networks With A Mathematical Model
No ratings yet
Understanding Convolutional Neural Networks With A Mathematical Model
21 pages
Deep Learning and NLP With PYTHON - Course Outline
No ratings yet
Deep Learning and NLP With PYTHON - Course Outline
11 pages
How To Choose An Activation Function For Deep Learning
No ratings yet
How To Choose An Activation Function For Deep Learning
15 pages
Nndl Internal i Key
No ratings yet
Nndl Internal i Key
5 pages
99-Article Text-341-1-10-20190510
No ratings yet
99-Article Text-341-1-10-20190510
9 pages
Activation Functions and Initialization Methods
No ratings yet
Activation Functions and Initialization Methods
17 pages
Universal Approximation Theorem visualization
No ratings yet
Universal Approximation Theorem visualization
11 pages
Deepfake Video Detection System Using Deep Neural Networks
No ratings yet
Deepfake Video Detection System Using Deep Neural Networks
6 pages
Presentation On Variational Autoencoders
No ratings yet
Presentation On Variational Autoencoders
44 pages
Hao 2016
No ratings yet
Hao 2016
23 pages
Recurrent Neural Network: What Does RNN Stand For?
No ratings yet
Recurrent Neural Network: What Does RNN Stand For?
7 pages
Amharic Abstractive Text Summarization
No ratings yet
Amharic Abstractive Text Summarization
6 pages
NNDL Lab Manual
No ratings yet
NNDL Lab Manual
43 pages
BCS 465 Neural Network - 2020
No ratings yet
BCS 465 Neural Network - 2020
5 pages
231AD63 Deep learning
No ratings yet
231AD63 Deep learning
2 pages
Full Single-Type Deep Learning Models With Multihead Attention For Speech Enhancement
No ratings yet
Full Single-Type Deep Learning Models With Multihead Attention For Speech Enhancement
3 pages
3rd Lecture
No ratings yet
3rd Lecture
21 pages
ML Mentorship Prahitha Movva V1
No ratings yet
ML Mentorship Prahitha Movva V1
5 pages
MLT Unit 2 Perceptron
No ratings yet
MLT Unit 2 Perceptron
34 pages
CMPT 413/713: Natural Language Processing: Nat Langlab
No ratings yet
CMPT 413/713: Natural Language Processing: Nat Langlab
31 pages
RNN, NLP
No ratings yet
RNN, NLP
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Unit II

Uploaded by

Unit II

Uploaded by

1.

Shallow Neural Networks

Logistic regression is a shallow supervised ML technique most commonly used to

Support Vector Machine (SVM)

Popular clustering methods used imaging applications include:

2.Deep Neural Network

A Convolutional Neural Network (CNN) is a type of Deep Learning neural

In a regular Neural Network there are three types of layers:

7. Backpropagation Process in Deep Neural

Backpropagation is one of the important concepts of a neural network. Our task is to

For a single training example, Backpropagation algorithm calculates the gradient of

Now, we first calculate the values of H1 and H2 by a forward pass

To calculate the final result of H1, we performed the sigmoid function as

We will calculate the value of H2 in the same way as H1

To calculate the final result of H1, we performed the sigmoid function as

Now, we calculate the values of y1 and y2 in the same way as we

To calculate the final result of y1 we performed the sigmoid function as

We will calculate the value of y2 in the same way as y1

To calculate the final result of H1, we performed the sigmoid function as

Neural Network Components

The two main categories of activation functions are:

o Linear Activation Function

Linear Activation Function

Non-linear Activation Function

Equation: A linear function's equation, which is y = x, is similar to the eqn of a single

It is a functional that is graphed in a "S" shape.

A is equal to 1/(1 + e-x).

Range of values: -1 to +1. non-linear nature

max A(x) (0, x). If x is positive, it outputs x; if not, it outputs 0.

Value Interval: [0, inf]

o ReLU (Rectified Linear Unit) Activation Function

The derivative and the function are both monotonic.

Types of Gradient Descent

1. Batch Gradient Descent:

2. Stochastic gradient descent

3. Mini Batch Gradient Descent:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.