0% found this document useful (0 votes)

31 views16 pages

MLT UNIT-2 Notes

Uploaded by

srimaddhesia9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views16 pages

MLT UNIT-2 Notes

Uploaded by

srimaddhesia9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

UNIT -2

 Regression:

Regression analysis is a statistical method to model the

relationship between a dependent (target) and independent
(predictor) variables with one or more independent variables.
More specifically, Regression analysis helps us to understand how
the value of the dependent variable is changing corresponding to
an independent variable when other independent variables are
held fixed. It predicts continuous/real values such
as temperature, age, salary, price, etc.

Some examples of regression can be as:

o Prediction of rain using temperature and other factors

o Determining Market trends
o Prediction of road accidents due to rash driving.

 Types Of Regression:
Linear Regression:
o Linear regression is a statistical regression method which is
used for predictive analysis.
o It is one of the very simple and easy algorithms which works
on regression and shows the relationship between the
continuous variables.
o It is used for solving the regression problem in machine
learning.
o Linear regression shows the linear relationship between the
independent variable (X-axis) and the dependent variable (Y-
axis), hence called linear regression.
o If there is only one input variable (x), then such linear
regression is called simple linear regression. And if there
is more than one input variable, then such linear regression
is called multiple linear regression.

Below is the mathematical equation for Linear regression:

1. Y= aX+b

Here, Y = dependent variables (target variables),

X= Independent variables (predictor variables),
a and b are the linear coefficients

Some popular applications of linear regression are:

o Analyzing trends and sales estimates

o Salary forecasting
o Real estate prediction
o Arriving at ETAs in traffic.

Logistic Regression:
o Logistic regression is another supervised learning algorithm
which is used to solve the classification problems.
In classification problems, we have dependent variables
in a binary or discrete format such as 0 or 1.
o Logistic regression algorithm works with the categorical
variable such as 0 or 1, Yes or No, True or False, Spam or not
spam, etc.
o It is a predictive analysis algorithm which works on the
concept of probability.
o Logistic regression is a type of regression, but it is different
from the linear regression algorithm in the term how they are
used.
o Logistic regression uses sigmoid function or logistic
function which is a complex cost function. This sigmoid
function is used to model the data in logistic regression. The
function can be represented as:

o f(x)= Output between the 0 and 1 value.

o x= input to the function
o e= base of natural logarithm.

 Bayesian Learning:

BL in ML is an approach that uses Bayesian probability

theory to model and make prediction about data.

Here are some key aspects of BL in ML are Probability

distribution, Baye’s Theorem , Parameter Estimation,
Decision making, Model Selection, handles small data.

 Bayes Classifier :

BC is a type of probabilistics classifier that uses Bayes

theorem to make prediction. It indicates the probability of
a data point belonging to a specific class and them
selects the class with the highest probability.

There are several types of Bayes Classifier-Naïve Bayes

and Bayesian Network.
 Bayes Theorem:

Bayes theorem is also known as the Bayes Rule or Bayes

Law. It is used to determine the conditional probability of
event A when event B has already happened. The general
statement of Bayes’ theorem is “The conditional
probability of an event A, given the occurrence of another
event B, is equal to the product of the event of B, given A
and the probability of A divided by the probability of
event B.” i.e.

P(A|B) = P(B|A) P(A) / P(B)

where,

P(A) and P(B) are the probabilities of events A and B

P(A|B) is the probability of event A when event B

happens.

P(B|A) is the probability of event B when A happens.

 Naïve Bayes Theorem:

o Naïve Bayes algorithm is a supervised learning algorithm,

which is based on Bayes theorem and used for solving
classification problems.
o It is mainly used in text classification that includes a high-
dimensional training dataset.
o Naïve Bayes Classifier is one of the simple and most
effective Classification algorithms which helps in building the
fast machine learning models that can make quick
predictions.
o It is a probabilistic classifier, which means it predicts on the
basis of the probability of an object.
o Some popular examples of Naïve Bayes Algorithm are spam
filtration, Sentimental analysis, and classifying articles.

Working of Naïve Bayes' Classifier:

Working of Naïve Bayes' Classifier can be understood with the help of the
below example:

Suppose we have a dataset of weather conditions and corresponding

target variable "Play". So, using this dataset we need to decide whether we
should play or not on a particular day according to the weather conditions.
So, to solve this problem, we need to follow the below steps:

1. Convert the given dataset into frequency tables.

2. Generate a Likelihood table by finding the probabilities of given
features.
3. Now, use Bayes theorem to calculate the posterior probability.

Problem: If the weather is sunny, then the Player should play or not?

Solution: To solve this, first consider the below dataset:

Outlook Play

0 Rainy Yes

1 Sunny Yes

2 Overcast Yes

3 Overcast Yes

4 Sunny No

5 Rainy Yes

6 Sunny Yes

7 Overcast Yes

8 Rainy No

9 Sunny No

10 Sunny Yes

11 Rainy No

12 Overcast Yes

13 Overcast Yes
Frequency table for the Weather Conditions:

Weather Yes No

Overcast 5 0

Rainy 2 2

Sunny 3 2

Total 10 5

Likelihood table weather condition:

Weather No Yes
Overcast 0 5 5/14= 0.35

Rainy 2 2 4/14=0.29

Sunny 2 3 5/14=0.35

All 4/14=0.29 10/14=0.71

Applying Bayes'theorem:

P(Yes|Sunny) = P(Sunny|Yes) *P(Yes)/P(Sunny)

P(Sunny|Yes) = 3/10= 0.3

P(Sunny)= 0.35

P(Yes)=0.71

So, P(Yes|Sunny) = 0.3*0.71/0.35= 0.60

P(No|Sunny) = P(Sunny|No) *P(No)/P(Sunny)

P(Sunny|NO) = 2/4=0.5

P(No)= 0.29

P(Sunny)= 0.35

So, P(No|Sunny) = 0.5*0.29/0.35 = 0.41

So, as we can see from the above calculation that P(Yes|Sunny)>P(No|

Sunny)

Hence on a Sunny day, Player can play the game.

 Support Vector machine:

Support Vector Machine or SVM is one of the most popular

Supervised Learning algorithms, which is used for Classification as
well as Regression problems. However, primarily, it is used for
Classification problems in Machine Learning.

The goal of the SVM algorithm is to create the best line or

decision boundary that can segregate n-dimensional space into
classes so that we can easily put the new data point in the correct
category in the future. This best decision boundary is called a
hyperplane.

Types of support vector machines

Support vector machines have different types and variants that provide specific
functionalities and address specific problem scenarios. Here are two types of
SVMs and their significance:

1. Linear SVM. Linear SVMs use a linear kernel to create a straight-line

decision boundary that separates different classes. They are effective
when the data is linearly separable or when a linear approximation is
sufficient. Linear SVMs are computationally efficient and have good
interpretability, as the decision boundary is a hyperplane in the input
feature space.
2. Nonlinear SVM. Nonlinear SVMs address scenarios where the data
cannot be separated by a straight line in the input feature space. They
achieve this by using kernel functions that implicitly map the data into a
higher-dimensional feature space, where a linear decision boundary can
be found. Popular kernel functions used in this type of SVM include the
polynomial kernel, Gaussian (RBF) kernel and sigmoid kernel. Nonlinear
SVMs can capture complex patterns and achieve higher classification
accuracy when compared to linear SVMs.

 Kernel Function:
Kernel Function is a method used to take data as input and transform it
into the required form of processing data.
“Kernel” is used due to a set of mathematical functions used in Support
Vector Machine providing the window to manipulate the data.
So, Kernel Function generally transforms the training set of data so that
a non-linear decision surface is able to transform to a linear equation in
a higher number of dimension spaces. Basically, It returns the inner
product between two points in a standard feature dimension.
Types Of kernel:
Linear Kernel
A linear kernel is a type of kernel function used in machine
learning, including in SVMs (Support Vector Machines). It is the
simplest and most commonly used kernel function, and it defines
the dot product between the input vectors in the original feature
space.

The linear kernel can be defined as:

1. K(x, y) = x .y

Where x and y are the input feature vectors. The dot product of
the input vectors is a measure of their similarity or distance in the
original feature space.
Gaussian (RBF) Kernel
The Gaussian kernel, also known as the radial basis function (RBF)
kernel, is a popular kernel function used in machine learning,
particularly in SVMs (Support Vector Machines). It is a nonlinear
kernel function that maps the input data into a higher-
dimensional feature space using a Gaussian function.

The Gaussian kernel can be defined as:

1. K(x, y) = exp(-gamma * ||x - y||^2)

Where x and y are the input feature vectors, gamma is a

parameter that controls the width of the Gaussian function, and ||
x - y||^2 is the squared Euclidean distance between the input
vectors.

Polynomial Kernel:
It represents the similarity of vectors in the training set of data in a
feature space over polynomials of the original variables used in the
kernel.
K (x, y) = (x.y + c) d
Advantages of SVMs
SVMs are powerful machine learning algorithms that have the following
advantages:

 Effective in high-dimensional spaces. High-dimensional data refers to

data in which the number of features is larger than the number of
observations, i.e., data points. SVMs perform well even when the
number of features is larger than the number of samples. They can
handle high-dimensional data efficiently, making them suitable for
applications with a large number of features.
 Resistant to overfitting. SVMs are less prone to overfitting compared to
other algorithms, like decision trees -- overfitting is where a model
performs extremely well on the training data but becomes too specific
to that data and can't generalize to new data. SVMs' use of the margin
maximization principle helps in generalizing well to unseen data.
 Versatile. SVMs can be applied to both classification and regression
problems. They support different kernel functions, enabling flexibility in
capturing complex relationships in the data. This versatility makes SVMs
applicable to a wide range of tasks.
 Effective in cases of limited data. SVMs can work well even when the
training data set is small. The use of support vectors ensures that only a
subset of data points influences the decision boundary, which can be
beneficial when data is limited.
 Ability to handle nonlinear data. SVMs can implicitly handle non-
linearly separable data by using kernel functions. The kernel trick
enables SVMs to transform the input space into a higher-dimensional
feature space, making it possible to find linear decision boundaries.
Disadvantages of SVMs
While support vector machines are popular for the reasons listed above, they also
come with some limitations and potential issues:

 Computationally intensive. SVMs can be computationally expensive,

especially when dealing with large data sets. The training time and
memory requirements increase significantly with the number of training
samples.
 Sensitive to parameter tuning. SVMs have parameters such as the
regularization parameter and the choice of kernel function. The
performance of SVMs can be sensitive to these parameter settings.
Improper tuning can lead to suboptimal results or longer training times.
 Lack of probabilistic outputs. SVMs provide binary classification outputs
and do not directly estimate class probabilities. Additional techniques,
such as Platt scaling or cross-validation, are needed to obtain probability
estimates.
 Difficulty in interpreting complex models. SVMs can create complex
decision boundaries, especially when using nonlinear kernels. This
complexity may make it challenging to interpret the model and
understand the underlying patterns in the data.
 Scalability issues. SVMs may face scalability issues when applied to
extremely large data sets. Training an SVM on millions of samples can
become impractical due to memory and computational constraints.

 Bayesian Belief Network:

A Bayesian Belief Network (BBN), also known as a Bayesian
Network or a Probabilistic Graphical Model, is a graphical
representation of probabilistic relationships among a set of
variables.

BBNs are used in machine learning and artificial intelligence to

model and reason about uncertainty, make predictions, and
perform probabilistic inference. They are particularly useful in
situations where there is uncertainty or incomplete information
about a system.

Bayesian Network can be used for building models from data and
experts’ opinions, and it consists of two parts:

o Directed Acyclic Graph

o Table of conditional probabilities.

 EM Algorithm:
The EM algorithm is the combination of various unsupervised ML algorithms,
such as the k-means clustering algorithm. Being an iterative approach, it
consists of two modes. In the first mode, we estimate the missing or latent
variables. Hence it is referred to as the Expectation/estimation step (E-
step). Further, the other mode is used to optimize the parameters of the
models so that it can explain the data more clearly. The second mode is
known as the maximization-step or M-step.
o Expectation step (E - step): It involves the estimation (guess) of all
missing values in the dataset so that after completing this step, there should
not be any missing value.
o Maximization step (M - step): This step involves the use of estimated data
in the E-step and updating the parameters.
o Repeat E-step and M-step until the convergence of the values occurs.

The primary goal of the EM algorithm is to use the available observed data of
the dataset to estimate the missing data of the latent variables and then use
that data to update the values of the parameters in the M-step.

Steps in EM Algorithm:
The EM algorithm is completed mainly in 4 steps, which include
Initialization Step, Expectation Step, Maximization Step, and
convergence Step. These steps are explained as follows:
o 1st Step: The very first step is to initialize the parameter values. Further, the
system is provided with incomplete observed data with the assumption that
data is obtained from a specific model.

o 2nd Step: This step is known as Expectation or E-Step, which is used to

estimate or guess the values of the missing or incomplete data using the
observed data. Further, E-step primarily updates the variables.
o 3rd Step: This step is known as Maximization or M-step, where we use
complete data obtained from the 2nd step to update the parameter values.
Further, M-step primarily updates the hypothesis.
o 4th step: The last step is to check if the values of latent variables are
converging or not. If it gets "yes", then stop the process; else, repeat the
process from step 2 until the convergence occurs.

 Applications of EM algorithm:
The primary aim of the EM algorithm is to estimate the missing data in the
latent variables through observed data in datasets. The EM algorithm or
latent variable model has a broad range of real-life applications in machine
learning. These are as follows:

o The EM algorithm is applicable in data clustering in machine learning.

o It is often used in computer vision and NLP (Natural language
processing).
o It is used to estimate the value of the parameter in mixed models such
as the Gaussian Mixture Model and quantitative genetics.
o It is also used in psychometrics for estimating item parameters and
latent abilities of item response theory models.
o It is also applicable in the medical and healthcare industry, such as in
image reconstruction and structural engineering.
o It is used to determine the Gaussian density of a function.

Advantages of EM algorithm:

o It is very easy to implement the first two basic steps of the EM

algorithm in various machine learning problems, which are E-step and
M- step.
o It is mostly guaranteed that the likelihood will enhance after each
iteration.
o It often generates a solution for the M-step in the closed form.

Disadvantages of EM algorithm:

o The convergence of the EM algorithm is very slow.

o It can make convergence for the local optima only.
o It takes both forward and backward probability into consideration. It is
the opposite of numerical optimization, which takes only forward
probabilities.

ML Unit 2
No ratings yet
ML Unit 2
21 pages
A Comparative Analysis On Linear Regression and Support Vector Regression
No ratings yet
A Comparative Analysis On Linear Regression and Support Vector Regression
5 pages
DL Texturing OBA DTY EFK en
100% (1)
DL Texturing OBA DTY EFK en
28 pages
Motivation Letter For Undergraduate Scholarship
80% (5)
Motivation Letter For Undergraduate Scholarship
4 pages
Tybsc Cs368 Data Analytics Labbook
No ratings yet
Tybsc Cs368 Data Analytics Labbook
58 pages
Unit 6 Ai
No ratings yet
Unit 6 Ai
28 pages
Classification Algorithm
No ratings yet
Classification Algorithm
43 pages
UNIT IV Na-Ve Bayes Classifier Algorithm
No ratings yet
UNIT IV Na-Ve Bayes Classifier Algorithm
33 pages
Unit 2linear Regression Bayesian Learning
No ratings yet
Unit 2linear Regression Bayesian Learning
49 pages
Unit 3 ML
No ratings yet
Unit 3 ML
28 pages
ANSYS Stress Linearization
No ratings yet
ANSYS Stress Linearization
15 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
QUESTIONS
No ratings yet
QUESTIONS
20 pages
Unit 2&3 - 250421 - 215911
No ratings yet
Unit 2&3 - 250421 - 215911
60 pages
Ma1254 - Random Processes: Unit I - Probability and Random Variable
100% (1)
Ma1254 - Random Processes: Unit I - Probability and Random Variable
5 pages
Unit 3
No ratings yet
Unit 3
12 pages
Bead Dealers in Chawri Bazar, Delhi, India - Justdial
No ratings yet
Bead Dealers in Chawri Bazar, Delhi, India - Justdial
6 pages
MLT Unit 2 - Updated
No ratings yet
MLT Unit 2 - Updated
58 pages
ML - Unit 2
No ratings yet
ML - Unit 2
155 pages
ML Research Paper
No ratings yet
ML Research Paper
9 pages
Unit 3
No ratings yet
Unit 3
9 pages
DA CH 2
No ratings yet
DA CH 2
37 pages
Strick Pack Dominador
No ratings yet
Strick Pack Dominador
9 pages
Regression Bayesian SVM Notes
No ratings yet
Regression Bayesian SVM Notes
6 pages
Comparison of Classification Algorithms
No ratings yet
Comparison of Classification Algorithms
11 pages
Cost Constraint/Isocost Line
No ratings yet
Cost Constraint/Isocost Line
38 pages
Unit Ii
No ratings yet
Unit Ii
48 pages
Machine Leraning Unit 2
No ratings yet
Machine Leraning Unit 2
62 pages
Mod 1 Lesson 1 Ict and Its Current State
No ratings yet
Mod 1 Lesson 1 Ict and Its Current State
71 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
HDL Based Synthesis
No ratings yet
HDL Based Synthesis
23 pages
Bayesian Linear Regression For Posterior Predictive Distribution MATLAB
No ratings yet
Bayesian Linear Regression For Posterior Predictive Distribution MATLAB
46 pages
MLT Unit-2
No ratings yet
MLT Unit-2
30 pages
ML Unit 2
No ratings yet
ML Unit 2
33 pages
Supervised Learning
No ratings yet
Supervised Learning
6 pages
ML Imppp
No ratings yet
ML Imppp
12 pages
Unit I
No ratings yet
Unit I
14 pages
Samsung Cx593 Sct12b
No ratings yet
Samsung Cx593 Sct12b
10 pages
Regression
No ratings yet
Regression
45 pages
Unit 2
No ratings yet
Unit 2
133 pages
Machinelearning Algorithm Basics2 NOTES
No ratings yet
Machinelearning Algorithm Basics2 NOTES
72 pages
Unit-2: Logistic Regression
No ratings yet
Unit-2: Logistic Regression
30 pages
Seagate 1.5tb USB2.0 S$168 GSS: Asia Pte LTD Internet TV USB $39.90
No ratings yet
Seagate 1.5tb USB2.0 S$168 GSS: Asia Pte LTD Internet TV USB $39.90
4 pages
Enrolment Form Singapore
No ratings yet
Enrolment Form Singapore
3 pages
4.7 Cam in Motion-Rosenfeld
No ratings yet
4.7 Cam in Motion-Rosenfeld
8 pages
Logistic Regression and Naive Bayes
No ratings yet
Logistic Regression and Naive Bayes
4 pages
Comparison of Machine Learning Algorithms Random Forest, Artificial Neural Network and Support Vector Machine To Maximum Likelihood For Supervised Crop Type Classification
No ratings yet
Comparison of Machine Learning Algorithms Random Forest, Artificial Neural Network and Support Vector Machine To Maximum Likelihood For Supervised Crop Type Classification
7 pages
Business Information System PDF
No ratings yet
Business Information System PDF
4 pages
KCA 034 - Unit 2
No ratings yet
KCA 034 - Unit 2
97 pages
University Institute of Computing: Big Data Analytics 22CAH-782
No ratings yet
University Institute of Computing: Big Data Analytics 22CAH-782
27 pages
ViaLiteHD 1U Rack Chassis HRK1x DS 2
No ratings yet
ViaLiteHD 1U Rack Chassis HRK1x DS 2
2 pages
AIML Unit-3
No ratings yet
AIML Unit-3
34 pages
PHP - Form Introduction: Dynamic Websites
No ratings yet
PHP - Form Introduction: Dynamic Websites
3 pages
UNIT3 Machine Learning
No ratings yet
UNIT3 Machine Learning
53 pages
A Geometrical Approach To Enhance Security Against Cyber Attacks in Digital Substations
No ratings yet
A Geometrical Approach To Enhance Security Against Cyber Attacks in Digital Substations
15 pages
41 Machine Learning Algorithms I
No ratings yet
41 Machine Learning Algorithms I
8 pages
Tag Commander
No ratings yet
Tag Commander
9 pages
Cheatsheet Supervised Learning
100% (1)
Cheatsheet Supervised Learning
4 pages
Work Breakdown Structure
No ratings yet
Work Breakdown Structure
1 page
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
No ratings yet
ICT202B AI ML and Emerging Technologies UNIT 3 (Classification and Regression) 2
23 pages
Get Python For Finance 2nd Edition Yuxing Yan Free All Chapters
No ratings yet
Get Python For Finance 2nd Edition Yuxing Yan Free All Chapters
41 pages
Unit 2 Supervised Learning and Applications
No ratings yet
Unit 2 Supervised Learning and Applications
13 pages
Machine Learning For Time Series Forecasting With Python 1st Edition Francesca Lazzeri
No ratings yet
Machine Learning For Time Series Forecasting With Python 1st Edition Francesca Lazzeri
48 pages
AIML
No ratings yet
AIML
30 pages
Everything You Need To Know About Chatgpt Expeed Software 240314091646 b2188bc5
No ratings yet
Everything You Need To Know About Chatgpt Expeed Software 240314091646 b2188bc5
19 pages
Machine Learning
No ratings yet
Machine Learning
87 pages
Eric Resume
No ratings yet
Eric Resume
2 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Supervised Learning
No ratings yet
Supervised Learning
24 pages
Tutorial 7 Machine Learning Algorithms
No ratings yet
Tutorial 7 Machine Learning Algorithms
30 pages
Enlogic by Nvent G3 Enterprise Power Distribution Units For HPE Datasheet
No ratings yet
Enlogic by Nvent G3 Enterprise Power Distribution Units For HPE Datasheet
3 pages
ML Unit-4
No ratings yet
ML Unit-4
20 pages
Types of Regression
No ratings yet
Types of Regression
8 pages
ML Points
No ratings yet
ML Points
13 pages
Unit Iii
No ratings yet
Unit Iii
18 pages
Machine Learning UNIT-2: Logistic Regression
No ratings yet
Machine Learning UNIT-2: Logistic Regression
12 pages
Cheatsheet Supervised Learning
No ratings yet
Cheatsheet Supervised Learning
4 pages
IC Packaging 2008
No ratings yet
IC Packaging 2008
26 pages
Unit-I Additive Manufacturing Old
No ratings yet
Unit-I Additive Manufacturing Old
62 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
AI ML 3 Updated
No ratings yet
AI ML 3 Updated
34 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
A Study On Regression Algorithm in Machine Learning
No ratings yet
A Study On Regression Algorithm in Machine Learning
3 pages
Assignment-2: 1) Explain Classification With Logistic Regression and Sigmoid Function
No ratings yet
Assignment-2: 1) Explain Classification With Logistic Regression and Sigmoid Function
6 pages
16-2 p30 Mapping of j1939 To Can FD Cia602 Zeltwanger
No ratings yet
16-2 p30 Mapping of j1939 To Can FD Cia602 Zeltwanger
2 pages
Bridgelink User Guide
No ratings yet
Bridgelink User Guide
93 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
Unit1 6thsemCS
No ratings yet
Unit1 6thsemCS
22 pages
VSD 2022 HW1 Explanation
No ratings yet
VSD 2022 HW1 Explanation
26 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

MLT UNIT-2 Notes

Uploaded by

MLT UNIT-2 Notes

Uploaded by

UNIT -2

Regression analysis is a statistical method to model the

Some examples of regression can be as:

o Prediction of rain using temperature and other factors

Below is the mathematical equation for Linear regression:

Here, Y = dependent variables (target variables),

Some popular applications of linear regression are:

o Analyzing trends and sales estimates

o f(x)= Output between the 0 and 1 value.

BL in ML is an approach that uses Bayesian probability

Here are some key aspects of BL in ML are Probability

BC is a type of probabilistics classifier that uses Bayes

There are several types of Bayes Classifier-Naïve Bayes

Bayes theorem is also known as the Bayes Rule or Bayes

P(A|B) = P(B|A) P(A) / P(B)

P(A) and P(B) are the probabilities of events A and B

P(A|B) is the probability of event A when event B

P(B|A) is the probability of event B when A happens.

 Naïve Bayes Theorem:

o Naïve Bayes algorithm is a supervised learning algorithm,

Working of Naïve Bayes' Classifier:

Suppose we have a dataset of weather conditions and corresponding

1. Convert the given dataset into frequency tables.

Solution: To solve this, first consider the below dataset:

Likelihood table weather condition:

All 4/14=0.29 10/14=0.71

P(Yes|Sunny) = P(Sunny|Yes) *P(Yes)/P(Sunny)

P(Sunny|Yes) = 3/10= 0.3

So, P(Yes|Sunny) = 0.3*0.71/0.35= 0.60

P(No|Sunny) = P(Sunny|No) *P(No)/P(Sunny)

So, P(No|Sunny) = 0.5*0.29/0.35 = 0.41

So, as we can see from the above calculation that P(Yes|Sunny)>P(No|

Hence on a Sunny day, Player can play the game.

 Support Vector machine:

Support Vector Machine or SVM is one of the most popular

The goal of the SVM algorithm is to create the best line or

Types of support vector machines

1. Linear SVM. Linear SVMs use a linear kernel to create a straight-line

The linear kernel can be defined as:

The Gaussian kernel can be defined as:

1. K(x, y) = exp(-gamma * ||x - y||^2)

Where x and y are the input feature vectors, gamma is a

 Effective in high-dimensional spaces. High-dimensional data refers to

 Computationally intensive. SVMs can be computationally expensive,

 Bayesian Belief Network:

BBNs are used in machine learning and artificial intelligence to

o Directed Acyclic Graph

o 2nd Step: This step is known as Expectation or E-Step, which is used to

o The EM algorithm is applicable in data clustering in machine learning.

o It is very easy to implement the first two basic steps of the EM

o The convergence of the EM algorithm is very slow.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.