0% found this document useful (0 votes)

6 views19 pages

NNML Full

The document provides an overview of neural networks, including the comparison between biological and artificial neurons, and details various types of neural network architectures such as Feedforward Neural Networks, Convolutional Neural Networks, and Recurrent Neural Networks. It explains key concepts like activation functions, supervised and unsupervised learning, and the importance of data preprocessing in machine learning. Additionally, it discusses challenges and trends in machine learning, highlighting its applications across various domains.

Uploaded by

singhgaurav7974

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views19 pages

NNML Full

Uploaded by

singhgaurav7974

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Fundamentals of Neural Networks

Biological Neurons vs Artificial Neurons:

Biological neurons form the basis of the human brain and are connected through synapses.

 They consist of dendrites (input), a cell body (processing), and an axon (output). Artificial
Neural Networks (ANNs) mimic this structure.
 In ANNs, inputs represent dendrites, the node represents the cell body, weights act as
synapses, and output resembles the axon.
 The main goal of ANNs is to simulate the way the human brain learns and makes decisions,
using interconnected artificial neurons that transmit data and adjust through learning.

McCulloch-Pitts Perceptron

Perceptron is a supervised learning algorithm used for binary classification. It takes input
values, multiplies them by weights, adds a bias, and passes the result through an activation
function to determine output.

 Single-layer Perceptron: This is the simplest form with one input layer and one output
node. It can only solve linearly separable problems. If the weighted sum exceeds a
threshold, the output is 1; otherwise, it is 0.
 Multi-layer Perceptron(MLP): MLP consists of an input layer, one or more hidden
layers, and an output layer. It uses activation functions like Sigmoid, Tanh, or ReLU. The
MLP learns using the backpropagation algorithm, which involves forward propagation
to compute output and backward propagation to adjust weights by minimizing error. It can
model complex, non-linear problems and perform classification and regression tasks.

Activation Functions in Neural Networks

Activation functions are mathematical equations that determine the output of a neural network
model.

They decide whether a neuron should be activated or not by introducing non-linearity into the
model. Without activation functions, neural networks would behave like a linear regression model,
regardless of the number of layers.

They play a critical role in helping neural networks learn and make complex decisions by enabling
them to approximate non-linear functions.

1. Sigmoid Activation Function

Definition:

The sigmoid function maps any input value into the range of 0 to 1 using the formula:
Properties:

 Range: (0, 1)
 Shape: S-shaped (sigmoid curve)
 Differentiable: Yes, which is necessary for backpropagation.

Advantages:

 Outputs values between 0 and 1, making it suitable for binary classification and
probability predictions.
 Provides a smooth gradient, preventing abrupt changes in output values .
GeeksforGeeks+1Wikipedia+1

Disadvantages:

 Saturates for large input values, leading to vanishing gradients and slow learning.
 Outputs are not zero-centered, which can make optimization more challenging

Use Case:

 Typically used in the output layer of binary classification models.

2. Tanh (Hyperbolic Tangent) Activation Function

Definition:

Tanh is similar to the sigmoid function but outputs values in a different range:
Properties:

 Range: (-1, 1)
 Shape: S-shaped, like sigmoid, but zero-centered.

Advantages:

 Centered around zero, which makes learning faster.

 Outputs both positive and negative values.

Disadvantages:

 Also suffers from the vanishing gradient problem.

 For very high or low input values, the gradient becomes very small.

Use Case:

 Often used in hidden layers of neural networks.

3. ReLU (Rectified Linear Unit) Activation Function

Definition:

ReLU is the most widely used activation function in modern neural networks:

Properties:

 Range: [0, ∞)
 Simple computation and fast convergence.

Advantages:

 Introduces non-linearity while being easy to compute.

 Reduces the likelihood of vanishing gradients compared to sigmoid/tanh.
 Sparse activation (only some neurons activate at once), which improves efficiency.
Disadvantages:

 Dying ReLU Problem: If neurons only output 0, they may stop learning.
 Not zero-centered.

Use Case:

 Mostly used in the hidden layers of deep neural networks.

4. Softmax Activation Function

Definition:

Softmax is an extension of the sigmoid function for multi-class classification problems. It

converts raw scores into probabilities:

Properties:

 Range: (0, 1), and the outputs sum to 1.

 Gives a probabilistic interpretation of outputs.

Advantages:

 Clearly highlights the most likely class.

 Helps in interpreting the model’s confidence in predictions.

Disadvantages:

 Computationally more intensive than ReLU.

 Can be sensitive to outliers (large input values can dominate the output).

Use Case:

 Commonly used in the output layer of multi-class classification models.

Neural Network Architectures: Feedforward Neural
Networks (FNNs)
A Feedforward Neural Network (FNN) is the simplest type of artificial neural network wherein
the connections between the nodes do not form a cycle. It is also known as a multilayer perceptron
when it contains one or more hidden layers.

Structure:

An FNN consists of the following layers:

 Input Layer: Accepts input features. Each neuron corresponds to one feature.
 Hidden Layers: Perform computations using weighted inputs and biases. Non-linear
activation functions (e.g., ReLU, sigmoid, tanh) are applied here.
 Output Layer: Produces the final output of the network, such as a classification label or a
numeric value.

Working Mechanism:

1. Data is passed through the input layer.

2. Each neuron calculates a weighted sum of inputs, adds a bias, and passes the result through
an activation function.
3. The result propagates through the network layer by layer until it reaches the output.
4. The weights are updated using backpropagation, a technique where the error is
propagated backward to minimize loss using gradient descent.

Features:

 The data moves strictly in one direction (input → hidden → output).

 No feedback loops or memory is involved.
 Efficient for simple pattern recognition and classification tasks.

Applications:

 Email spam detection

 Customer churn prediction
 Credit scoring
 Digit recognition
2. Convolutional Neural Networks (CNNs): Applications in
Image Processing
A Convolutional Neural Network (CNN) is a deep learning model specialized for processing
data with a grid-like topology, such as images. CNNs are biologically inspired by the visual cortex
and are highly effective in image-related tasks.

Architecture:

1. Convolutional Layer: Uses filters (kernels) that slide over the image to extract local
features like edges, textures, or colors. It captures spatial hierarchies.
2. Activation Layer: Applies a non-linear activation function (commonly ReLU) to
introduce non-linearity.
3. Pooling Layer: Reduces spatial dimensions (e.g., MaxPooling) to make computation
efficient and reduce overfitting.
4. Fully Connected Layer (Dense Layer): Flattens the feature map into a vector for final
classification.

Why CNNs are Effective for Image Processing:

 Shared weights reduce the number of parameters.

 Preserve spatial information.
 Detect features hierarchically: edges in early layers, objects in deeper layers.

Applications in Image Processing:

 Image Classification: Classify an image into categories (e.g., dog, cat, etc.).
 Object Detection: Identify and locate objects in an image (e.g., YOLO, SSD).
 Facial Recognition: Used in security systems and photo tagging.
 Medical Imaging: Detect anomalies in X-rays, MRIs, and CT scans.
 Self-driving Cars: Lane detection, obstacle recognition, and traffic sign identification.
3. Recurrent Neural Networks (RNNs)
A Recurrent Neural Network (RNN) is a type of neural network designed to handle sequential
data. It has loops in its architecture that allow it to store past information in a hidden state and
use it for future computations.

Architecture:

 Each neuron not only receives input from the current time step but also receives input
from the hidden state of the previous time step.
 The network shares weights across time steps.
 Uses Backpropagation Through Time (BPTT) for training.

Hidden State:

 Acts as memory to retain important features from previous inputs.

 This makes RNNs suitable for time series and natural language tasks.

Challenges:

 Vanishing Gradient Problem: When gradients become too small, it’s hard to learn
long-term dependencies.
 Exploding Gradients: When gradients grow too large.
 These are mitigated using improved architectures like:
o LSTM (Long Short-Term Memory): Uses gates to regulate memory flow.
o GRU (Gated Recurrent Unit): Simplified LSTM with similar performance.

Applications:

 Text Generation and Language Modeling

 Speech Recognition
 Machine Translation
 Stock Price Prediction
 Chatbots and Virtual Assistants
1. Introduction to Machine Learning
Definition:

Machine Learning (ML) is a subset of Artificial Intelligence (AI) that enables computer systems
to learn patterns and make decisions or predictions from data without being explicitly
programmed. Instead of following strictly coded instructions, ML models learn from past
experiences (data) and improve their performance over time.

How it Works:

ML involves training algorithms on datasets so that the model can learn underlying patterns. Once
trained, the model can be used to make predictions or decisions on new, unseen data.

2. Basics of Machine Learning: Definitions, Applications,

and Scope
Applications of Machine Learning:

1. Healthcare:
o Disease prediction and diagnosis (e.g., cancer detection)
o Drug discovery and personalized treatment
2. Finance:
o Credit scoring and risk assessment
o Fraud detection and stock market prediction
3. Retail and Marketing:
o Recommendation systems (e.g., Amazon, Netflix)
o Customer segmentation and demand forecasting
4. Transportation:
o Self-driving cars and traffic prediction
o Route optimization in logistics
5. Natural Language Processing:
o Chatbots and virtual assistants
o Language translation and speech recognition

Scope of Machine Learning:

 ML is at the core of many emerging technologies like autonomous vehicles, smart

assistants, and predictive analytics.
 It plays a key role in big data analysis and decision-making systems.
 With the availability of large datasets and computational resources (e.g., cloud computing,
GPUs), ML continues to grow rapidly across all domains.
3. Types of Machine Learning
Machine Learning is typically categorized into three major types:

A. Supervised Learning

 In supervised learning, the algorithm learns from labeled training data, mapping inputs to
known outputs.
 The goal is to predict the output for new inputs based on what it learned.

Examples:

 Predicting house prices based on area, location, etc.

 Classifying emails into spam and non-spam.

Algorithms:

 Linear Regression
 Logistic Regression
 Decision Trees
 Support Vector Machines (SVM)
 k-Nearest Neighbors (KNN)

B. Unsupervised Learning

 In this type, the data provided to the algorithm is unlabeled.

 The goal is to discover hidden patterns, structures, or relationships within the data.

Examples:

 Customer segmentation in marketing

 Organizing documents or images into groups

Algorithms:

 K-Means Clustering
 Hierarchical Clustering
 Principal Component Analysis (PCA)
 Association Rule Mining

C. Reinforcement Learning

 In reinforcement learning, an agent learns to interact with an environment and takes

actions to maximize cumulative reward.
 Learning is based on trial-and-error and feedback in the form of rewards or penalties.
Examples:

 Game playing agents (e.g., AlphaGo)

 Robot navigation
 Real-time bidding in advertising

Key Components:

 Agent: Learner or decision-maker

 Environment: Everything the agent interacts with
 Policy: Strategy the agent follows
 Reward Signal: Feedback to evaluate actions

4. Key Challenges and Trends in Machine Learning

Challenges:

1. Data Availability and Quality:

o ML algorithms require large, high-quality datasets to perform effectively.
o Noisy, incomplete, or biased data can affect model performance.
2. Overfitting and Underfitting:
o Overfitting occurs when a model learns noise instead of pattern.
o Underfitting occurs when the model is too simple to capture complexity.
3. Interpretability:
o Complex models like deep neural networks are often "black boxes," making it
difficult to understand or trust their decisions.
4. Computational Resources:
o Training large models, especially on big data, requires powerful hardware (e.g.,
GPUs).
5. Ethical and Social Concerns:
o ML models can inherit biases from training data.
o There's a risk of misuse, privacy violations, and discrimination.

Trends in Machine Learning:

1. AutoML:
o Automates model selection, tuning, and deployment.
o Makes ML accessible to non-experts.
2. Explainable AI (XAI):
o Focuses on making ML models more transparent and understandable.
3. Federated Learning:
o Allows models to be trained across decentralized devices while preserving user
privacy.
4. Edge ML:
o Enables running ML algorithms on devices like smartphones and IoT sensors.
5. Introduction to Data Preprocessing
Before feeding data into a machine learning model, it must be preprocessed to ensure quality,
consistency, and relevance. Data preprocessing significantly affects model accuracy and
performance.

A. Data Cleaning:

 Handling Missing Values: Using mean, median imputation, or deletion.

 Removing Duplicates: Prevents misleading model training.
 Handling Outliers: Use techniques like z-score, IQR to detect and remove or transform.
 Noise Removal: Using filters or statistical methods.

B. Data Transformation:

 Scaling and Normalization:

o Scaling ensures that features are on the same scale (e.g., 0–1).
o Standardization (z-score) centers data around the mean.
 Encoding Categorical Variables:
o One-hot encoding, label encoding for converting text into numerical format.
 Log Transformation:
o Helps with skewed distributions.

C. Feature Engineering:

 Feature Creation: Create new features from raw data (e.g., extract "hour" from
timestamp).
 Feature Selection: Remove redundant or irrelevant features using correlation, mutual
information, or wrapper methods.
 Dimensionality Reduction: PCA, LDA to reduce features while retaining variance.

Objective: Improve model performance by selecting meaningful and informative input

variables.
Supervised Learning Techniques
Supervised Learning involves training models using labeled data (i.e., input-output pairs). The
model learns to map inputs to outputs and is later used to predict outcomes for unseen data.

It is mainly divided into two types:

I. Regression:
Regression algorithms predict a continuous numerical value. Two common types are:

🔷 1. Linear Regression

➤ Goal:

Predict a continuous output based on linear relationships between input variables.

➤ Equation: y = mx + c

➤ Working:

 Finds the best-fitting straight line through the data points.

 Minimizes the Mean Squared Error (MSE) between predicted and actual values.

➤ Use Cases:

 Predicting house prices

 Estimating sales revenue

🔷 2. Logistic Regression

➤ Goal:

Used for classification tasks (binary or multi-class), despite the name "regression".
➤ Equation: Sigmoid function =

➤ Working:

 Outputs a probability between 0 and 1 using a sigmoid function.

 Based on a threshold (e.g., 0.5), it predicts a class label.

➤ Use Cases:

 Spam detection (Spam or Not)

 Disease diagnosis (Yes/No)

➤ II. Classification
Classification algorithms predict discrete categories or class labels.

🔷 3. k-Nearest Neighbors (k-NN)

➤ Goal:

Classify a data point based on the majority label of its k-nearest neighbors.

➤ Working:

 Calculate Euclidean (or other) distances between the new point and all training points.
 Select the k closest points.
 Assign the class with the most votes among neighbors.

➤ Use Cases:

 Recommender systems
 Image classification

➤ Advantages:

 Simple and intuitive

 No training phase
➤ Disadvantages:

 Slow prediction for large datasets

 Sensitive to irrelevant features

🔷 4. Decision Tree

A Decision Tree is a supervised learning algorithm used for classification and regression. It
splits the data based on feature values by asking questions at each node. Each branch shows an
outcome, and each leaf gives a prediction. The goal is to keep dividing the data until each group
is similar or belongs to one class. It's simple, easy to understand, and widely used.

➤ Goal:

Predict outcomes by learning decision rules from data features.

➤ Working:

 Splits data based on feature values.

 Uses Gini Index or Information Gain to choose best splits.
 Tree is formed where internal nodes are decision rules, and leaves are outcomes.

➤ Use Cases:

 Credit risk analysis

 Customer churn prediction

➤ Advantages:

 Easy to interpret
 Handles both numerical and categorical data

➤Disadvantages:

 Prone to overfitting
🔷 5. Random Forest

➤ Goal:

An ensemble learning method that builds multiple decision trees and merges them for better
accuracy.

➤ Working:

 Creates many decision trees on random subsets of data and features.

 Final prediction is based on majority voting (classification) or average (regression).

➤ Use Cases:

 Fraud detection
 Stock price prediction

➤ Advantages:

 High accuracy
 Reduces overfitting

➤ Disadvantages:

 Less interpretable than a single tree

 Slower prediction than individual models
🔷 6. Support Vector Machines (SVM)

Support Vector Machine (SVM) is a supervised machine learning algorithm used for
classification and regression tasks. While it can handle regression problems, SVM is
particularly well-suited for classification tasks.

➤ Working:

 Selects the best separating hyperplane with the maximum margin between classes.
 Can use kernel tricks to handle non-linearly separable data (e.g., RBF, Polynomial).

➤ Use Cases:

 Face detection
 Text classification
 Bioinformatics

➤ Advantages:

 Works well for high-dimensional data

 Effective in complex but small- to medium-sized datasets

➤ Disadvantages:

 Memory-intensive
 Difficult to tune parameters (kernel, C, gamma).
K-Means
K-Means is an unsupervised, iterative clustering technique that partitions a dataset into k distinct
clusters by assigning each point to the nearest cluster “mean,” then recomputing means until
convergence. It emphasizes intra-cluster similarity and inter-cluster dissimilarity, making it a fast,
scalable method for grouping data.

Definition
 Unsupervised iterative technique: No labels are used; clusters form based solely on data
distribution.
 Cluster: A set of points exhibiting mutual similarity, with each point belonging to the
cluster whose mean (centroid) is nearest.

Algorithm Steps
1. Choose k: Decide the number of clusters you want.
2. Initialize centroids: Randomly pick k data points as initial cluster centers, ensuring they
are as far apart as possible.
3. Compute distances: For each data point, calculate its distance to every centroid (e.g.,
Euclidean or custom distance).
4. Assign clusters: Assign each point to the cluster of its nearest centroid.
5. Update centroids: Recalculate each centroid as the mean of all points assigned to that
cluster.
6. Repeat: Go back to step 3 and iterate until one of the following stopping criteria is met:
o Centroids no longer move
o Point assignments remain unchanged
o A preset maximum number of iterations is reached.

Advantages
 Time complexity: Efficient O(n k t) where n = instances, k = clusters, t = iterations.
 Local optimum: Often converges quickly, and can be enhanced to find global optima
using methods like simulated annealing or genetic algorithms.

Disadvantages
 Need to specify k: You must know the number of clusters in advance.
 Sensitivity to outliers: Cannot handle noise or outliers wel
Gradient Descent (GD)
 Definition: An optimization algorithm used to train machine learning models (including
neural networks) by iteratively adjusting parameters in the direction of the negative
gradient of the loss function, thereby minimizing prediction error.
 Usage:

 Trains models by minimizing the difference between actual and expected outputs
(e.g., Mean Squared Error in regression).
 Fundamental to backpropagation in neural networks, where weights and biases
are updated via GD at each layer.

Types of Gradient Descent

1. Batch Gradient Descent (BGD)

 How it works: Computes the gradient of the cost function using the entire training dataset
before each parameter update.
 Advantages:

 Produces stable, smooth convergence since each update is based on the full dataset.
 Precise gradient estimates lead to consistent progress toward the global minimum.

 Disadvantages:

 Can be very slow on large datasets due to full-dataset passes each iteration.
 High memory usage, as it must load all samples to compute each update.

2. Stochastic Gradient Descent (SGD)

 How it works: Updates model parameters using the gradient computed from one randomly
selected training example per iteration.
 Advantages:

 Faster convergence in practice for large datasets, since updates are made more
frequently.
 Requires minimal memory and can begin learning before seeing the entire dataset.
 Randomness helps escape shallow local minima in non-convex loss landscapes.

 Disadvantages:

 Updates have high variance, causing the loss to fluctuate rather than decrease
smoothly.
 May require careful tuning of learning rate and often benefits from decay schedules.
3. Mini-Batch Gradient Descent (MBGD)

 How it works: Splits the training set into small batches (e.g., 32–256 samples) and
performs an update for each mini-batch.
 Advantages:

 Balances the stability of BGD with the efficiency of SGD.

 Exploits vectorized operations on modern hardware (GPUs/TPUs) for faster
computation.
 Reduces the variance of parameter updates compared to SGD, leading to smoother
convergence.

 Disadvantages:

 Still requires batch-size tuning; too small batches behave like SGD, too large like
BGD.
 May get stuck in local minima if batch size is poorly chosen.

Neural Network-Soniya
100% (1)
Neural Network-Soniya
72 pages
Deep Learning Modeule V01
No ratings yet
Deep Learning Modeule V01
70 pages
Aimlf Unit4
No ratings yet
Aimlf Unit4
20 pages
Unit 1 Fundamentals of Deep Learning
No ratings yet
Unit 1 Fundamentals of Deep Learning
20 pages
Guideline: G1117 VHF Data Exchange System (Vdes)
100% (1)
Guideline: G1117 VHF Data Exchange System (Vdes)
29 pages
Chapter 5
No ratings yet
Chapter 5
63 pages
Unit 1
No ratings yet
Unit 1
16 pages
Deep Learning
No ratings yet
Deep Learning
37 pages
Mgt602 Final Term Solved Subjective Papers
100% (1)
Mgt602 Final Term Solved Subjective Papers
5 pages
Windows Kernel Programming (1st Edition) by Pavel Yosifovich
100% (2)
Windows Kernel Programming (1st Edition) by Pavel Yosifovich
392 pages
Int254 Unit 3
No ratings yet
Int254 Unit 3
29 pages
ML QB 4
No ratings yet
ML QB 4
69 pages
Computer Languages PDF
50% (2)
Computer Languages PDF
2 pages
AI Assignment 5
No ratings yet
AI Assignment 5
13 pages
Questions and Answers
No ratings yet
Questions and Answers
33 pages
Assignment 4
No ratings yet
Assignment 4
46 pages
NN DL Unit - I
No ratings yet
NN DL Unit - I
30 pages
Research Proposal Presentation
No ratings yet
Research Proposal Presentation
20 pages
ML Prep For Samsung
No ratings yet
ML Prep For Samsung
73 pages
Course Material Neural Updated
No ratings yet
Course Material Neural Updated
90 pages
SCC600A-5 Skema
No ratings yet
SCC600A-5 Skema
38 pages
ML 6
No ratings yet
ML 6
10 pages
NN Fundamentals CS
No ratings yet
NN Fundamentals CS
36 pages
Essential Concept in Artificial Neural Networks
No ratings yet
Essential Concept in Artificial Neural Networks
27 pages
Deep Learning UNIT 1
No ratings yet
Deep Learning UNIT 1
22 pages
DL Unit-3 (CDS)
No ratings yet
DL Unit-3 (CDS)
32 pages
Lesson 2 Neural Network Architectures
No ratings yet
Lesson 2 Neural Network Architectures
35 pages
MACHINE LEARNING Unit-2
No ratings yet
MACHINE LEARNING Unit-2
21 pages
ISE-1 Imp DLPDF
No ratings yet
ISE-1 Imp DLPDF
28 pages
Neural Networks in Machine Learning11
No ratings yet
Neural Networks in Machine Learning11
11 pages
Unit 3 Self Made
No ratings yet
Unit 3 Self Made
23 pages
Models of Artificial Neural Networks
No ratings yet
Models of Artificial Neural Networks
6 pages
Unit II
No ratings yet
Unit II
12 pages
Deep Learning Techniques: 1. Define Neural Networks
No ratings yet
Deep Learning Techniques: 1. Define Neural Networks
31 pages
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
No ratings yet
AI Unit5 Neural Network 1c2c9166 c1b7 47a3 8ce1 E914f1ab6afb
52 pages
Seminar
No ratings yet
Seminar
13 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
ML MU Unit 5NeuralNetworkpdf 2025 04 16 13 47 39
No ratings yet
ML MU Unit 5NeuralNetworkpdf 2025 04 16 13 47 39
57 pages
Deep Learning
No ratings yet
Deep Learning
13 pages
Neural Network Representation
No ratings yet
Neural Network Representation
5 pages
Ma Process A Literature Review and Research Agenda
100% (1)
Ma Process A Literature Review and Research Agenda
5 pages
Basic Models of Artificial Neural Networks
No ratings yet
Basic Models of Artificial Neural Networks
5 pages
Dsa Theory Da
No ratings yet
Dsa Theory Da
41 pages
DL - FNN - RNN
No ratings yet
DL - FNN - RNN
5 pages
Artificial Neural Network Part-2
No ratings yet
Artificial Neural Network Part-2
15 pages
Unit 1 Introduction To Neural Networks Cleaned
No ratings yet
Unit 1 Introduction To Neural Networks Cleaned
4 pages
2630 20230529 Mahdi Momen Aldawood HH 15261 946399124
No ratings yet
2630 20230529 Mahdi Momen Aldawood HH 15261 946399124
11 pages
Unit 5
No ratings yet
Unit 5
59 pages
(May-2022) New PassLeader DP-900 Exam Dumps
No ratings yet
(May-2022) New PassLeader DP-900 Exam Dumps
8 pages
Unit 3 Endsem PYQs
No ratings yet
Unit 3 Endsem PYQs
19 pages
UNIT 2 Artificia
No ratings yet
UNIT 2 Artificia
23 pages
Let Reviewer For Mathematics Major
No ratings yet
Let Reviewer For Mathematics Major
62 pages
Unit IV Artificial Neural Networks
No ratings yet
Unit IV Artificial Neural Networks
25 pages
Online TV Shows Pitch Deck by Slidesgo
No ratings yet
Online TV Shows Pitch Deck by Slidesgo
51 pages
ML Unit 6
No ratings yet
ML Unit 6
2 pages
Neural NetworksChapter2Sup
No ratings yet
Neural NetworksChapter2Sup
20 pages
Unit 4 - Artificial Intelligence
No ratings yet
Unit 4 - Artificial Intelligence
9 pages
MODEL NO.: V390HJ1 Suffix: P03: Product Specification
No ratings yet
MODEL NO.: V390HJ1 Suffix: P03: Product Specification
31 pages
Chapter One
No ratings yet
Chapter One
9 pages
Shortnotedeeplearning
No ratings yet
Shortnotedeeplearning
11 pages
Intelligent Control of Drives-1
No ratings yet
Intelligent Control of Drives-1
82 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Eng PPT Tech
No ratings yet
Eng PPT Tech
18 pages
Gnuplot Viva Questions
No ratings yet
Gnuplot Viva Questions
5 pages
User Manual: Rev X9
No ratings yet
User Manual: Rev X9
41 pages
Deep Learning - Unit 1 Notes
No ratings yet
Deep Learning - Unit 1 Notes
27 pages
Types of Neural Networks and Definition of Neural Network
No ratings yet
Types of Neural Networks and Definition of Neural Network
15 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
Readme
No ratings yet
Readme
1 page
CP4252 ML Unit - V
No ratings yet
CP4252 ML Unit - V
17 pages
Ewst
No ratings yet
Ewst
167 pages
Object Oriented Programming
No ratings yet
Object Oriented Programming
81 pages
Picapool Contribution NHC
No ratings yet
Picapool Contribution NHC
6 pages
Learner Enrollment and Survey Form: - , (Enclosure No. 4 To Deped Order No. S. 2020)
No ratings yet
Learner Enrollment and Survey Form: - , (Enclosure No. 4 To Deped Order No. S. 2020)
2 pages
Parkinson Disease Detection Using Deep Neural Networks
No ratings yet
Parkinson Disease Detection Using Deep Neural Networks
4 pages
Introduction To Convolutional Neural Networks
No ratings yet
Introduction To Convolutional Neural Networks
4 pages
Gauss Contest Paper 2021
No ratings yet
Gauss Contest Paper 2021
4 pages
MicroPROCESSOR Week 1
No ratings yet
MicroPROCESSOR Week 1
66 pages
The Ultimate AI Testing Playbook
No ratings yet
The Ultimate AI Testing Playbook
14 pages
PRAESENSA 2.10 Configuration Manual EnUS 100857072779
No ratings yet
PRAESENSA 2.10 Configuration Manual EnUS 100857072779
212 pages
Making Money On Social Media
No ratings yet
Making Money On Social Media
51 pages
Lesson 1b Introduction To Python
No ratings yet
Lesson 1b Introduction To Python
19 pages
Personalized Stress Monitoring Using Wearable Sensors in Everyday Settings
No ratings yet
Personalized Stress Monitoring Using Wearable Sensors in Everyday Settings
4 pages
Syllabus of Unreal Engine Certification Online Tra
No ratings yet
Syllabus of Unreal Engine Certification Online Tra
5 pages
478 Userguide en
No ratings yet
478 Userguide en
15 pages
【48通讯机架无屏 KSBT】A5 V2409 Ho-01-04 USER MANUAL - LFP48100PB~48300PB
No ratings yet
【48通讯机架无屏 KSBT】A5 V2409 Ho-01-04 USER MANUAL - LFP48100PB~48300PB
26 pages
CN Previous
No ratings yet
CN Previous
5 pages
Manual LN63
No ratings yet
Manual LN63
19 pages
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

NNML Full

Uploaded by

NNML Full

Uploaded by

Fundamentals of Neural Networks

Biological Neurons vs Artificial Neurons:

Activation Functions in Neural Networks

1. Sigmoid Activation Function

 Typically used in the output layer of binary classification models.

2. Tanh (Hyperbolic Tangent) Activation Function

 Centered around zero, which makes learning faster.

 Also suffers from the vanishing gradient problem.

 Often used in hidden layers of neural networks.

3. ReLU (Rectified Linear Unit) Activation Function

 Introduces non-linearity while being easy to compute.

 Mostly used in the hidden layers of deep neural networks.

4. Softmax Activation Function

Softmax is an extension of the sigmoid function for multi-class classification problems. It

 Range: (0, 1), and the outputs sum to 1.

 Clearly highlights the most likely class.

 Computationally more intensive than ReLU.

 Commonly used in the output layer of multi-class classification models.

An FNN consists of the following layers:

1. Data is passed through the input layer.

 The data moves strictly in one direction (input → hidden → output).

 Email spam detection

Why CNNs are Effective for Image Processing:

 Shared weights reduce the number of parameters.

Applications in Image Processing:

 Acts as memory to retain important features from previous inputs.

 Text Generation and Language Modeling

2. Basics of Machine Learning: Definitions, Applications,

Scope of Machine Learning:

 ML is at the core of many emerging technologies like autonomous vehicles, smart

 Predicting house prices based on area, location, etc.

 In this type, the data provided to the algorithm is unlabeled.

 Customer segmentation in marketing

 In reinforcement learning, an agent learns to interact with an environment and takes

 Game playing agents (e.g., AlphaGo)

 Agent: Learner or decision-maker

4. Key Challenges and Trends in Machine Learning

1. Data Availability and Quality:

Trends in Machine Learning:

 Handling Missing Values: Using mean, median imputation, or deletion.

 Scaling and Normalization:

Objective: Improve model performance by selecting meaningful and informative input

It is mainly divided into two types:

Predict a continuous output based on linear relationships between input variables.

 Finds the best-fitting straight line through the data points.

 Predicting house prices

 Outputs a probability between 0 and 1 using a sigmoid function.

 Spam detection (Spam or Not)

🔷 3. k-Nearest Neighbors (k-NN)

 Simple and intuitive

 Slow prediction for large datasets

Predict outcomes by learning decision rules from data features.

 Splits data based on feature values.

 Credit risk analysis

 Creates many decision trees on random subsets of data and features.

 Less interpretable than a single tree

 Works well for high-dimensional data

Types of Gradient Descent

2. Stochastic Gradient Descent (SGD)

 Balances the stability of BGD with the efficiency of SGD.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.