0% found this document useful (0 votes)

112 views17 pages

Naive Bayes Classifiers - Parta

Naïve Bayes classifiers are a type of probabilistic model that use Bayes' theorem with strong independence assumptions. They are useful for text classification problems where the features are the counts of words. An example is classifying news articles using a multinomial Naïve Bayes classifier with a pipeline of TF-IDF feature extraction and the model. The model is trained on document counts, predicts labels for test data, and a confusion matrix is created to evaluate performance.

Uploaded by

Akshay kashyap

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

112 views17 pages

Naive Bayes Classifiers - Parta

Uploaded by

Akshay kashyap

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Naïve Bayes Classifiers

Machine Learning Models - Types

Probabilistic Models
• Use probability theory, random variables, probability distributions

• Assume that there is some underlying random process that generates the
values for variables, according to a well-defined but unknown probability
distribution

• Use data to find out more about the probability distribution

• Example: Naïve Bayes Classifiers, Gaussian Mixture Model (GMM)

Probabilistic Models
Probabilistic Models – Basic Terminology

X denote the variables we know about, e.g., instance’s feature values

Y denote the target variables we are interested in, e.g., the instance’s class
The key question is how to model the relationship between X and Y

Since X is known for a particular instance but Y may not be, we are particularly
interested in the conditional probabilities P(Y|X).

Bayes’ Rule

For instance, Y could indicate whether the e-mail is spam, and X could indicate
whether the e-mail contains the words ‘Money’ and ‘lottery’.
A Loan Application Dataset

Predicting the target old true true fair ?

Probabilistic Models
Understanding Probabilistic Model
Money Lottery P(Y=spam|Money,lottery) P(Y=ham|Money, Lottery)

0 0 0.31 0.69

0 1 0.65 0.35

1 0 0.80 0.20

1 1 0.40 0.60
Bayes Theorem
Bayes theorem provides a
way to calculate the
probability of
a hypothesis given our prior
knowledge.

Prior Probability is the probability of an event before new data is collected i.e. P(spam) is
the probability of spam mails before any new mail is seen.
Marginal Likelihood also called evidence is the probability of the evidence event to
occur i.e. P(money) is the probability of mails include “money” in the text.
Likelihood is the probability of the evidence happen given that event is true i.e.
P(money|spam) is the probability of mail includes “money” given that the mail is spam.
Posterior Probability is the probability of an outcome after the evidence information has
been incorporated i.e. P(spam|money) is the probability of the mail is spam given that
mail includes “money” in the text.
Probabilistic Models
Bayes’ Rule

• P(Y|X) is the posterior probability because it is used after the features X are
observed.
• P(Y) is the prior probability, which in the case of classification tells how likely
each of the classes is a priori, i.e., before we have observed the data X.
• P(X) is the probability of the data, which is independent of Y and in most cases
can be ignored.
• P(X|Y) is likelihood function.
• Posterior probabilities and likelihoods can be easily transformed one into the
other using Bayes’ rule.
The above equation shows only the case where we have 3 evidence variables and
even with only 3 of them it is not easy to find an exact match.
Probabilistic Models
• maximum a posteriori (MAP) decision rule
Maximum a posteriori (MAP) is the hypothesis with the highest posterior
probability. After calculating the posterior probability for several hypotheses we
select the hypothesis with the highest probability.
Example: If P(spam|money) > P(not spam|money) then we can say that the mail
can be classified as spam. This is the maximum probable hypothesis.

• maximum likelihood (ML) decision rule

Probabilistic Models - Example
Suppose that we have the training
data set as shown in the figure,
which has two attributes A and B,
and the class C.
We can compute all the
probability values required to
learn a naïve Bayesian classifier.
Naïve Bayes Classifier
Key points

• Tend to be faster in training than linear classifiers

• Efficient as they learn parameters by looking at each feature individually

and collect simple per-class statistics from each feature

• Generalization performance slightly worse than linear classifiers

like LogisticRegression and LinearSVC

• scikit-learn implements three kinds of naïve bayes classifiers:

• GaussianNB (for continuous data)
• BernoulliNB (for binary data)
• MultinomialNB (for count data – e.g.
how often a word
Naïve Bayes Classifier
Key points

• GaussianNB is mostly used on very high-dimensional data

• Other two variants used for sparse count data

• MultinomialNB usually performs better than BinaryNB

• Share many of the strengths and weaknesses of the linear models

• Very fast to train and to predict

• Naive Bayes models are great baseline models and are often used on very large
datasets, where training even a linear model might take too long
Machine Learning using Naïve Bayes ( GaussianNB)
import matplotlib.pyplot as plt
from sklearn.datasets import make_blobs # make a probabilistic prediction
from sklearn.naive_bayes import GaussianNB yhat_prob = model.predict_proba(X_test)
from sklearn.model_selection import train_test_split # make a classification prediction
from sklearn.datasets import load_iris y_pred = model.predict(X_test)
from sklearn import metrics print(metrics.accuracy_score(y_test, y_pred)) # 1.0
from sklearn.metrics import classification_report, cm=confusion_matrix(y_test, y_pred)
confusion_matrix fig, ax = plt.subplots(figsize=(8, 8))
iris = load_iris() ax.imshow(cm)
# create X (features) and y (response) ax.grid(False)
X = iris.data ax.xaxis.set(ticks=(0, 1,2), ticklabels=('Predicted 0s', 'Predicted 1s', 'Predicted 2s'))
y = iris.target ax.yaxis.set(ticks=(0, 1,2), ticklabels=('Actual 0s', 'Actual 1s', 'Actuals 2s'))
print(X.shape,y.shape) #(150, 4) (150,) ax.set_ylim(2.5, -0.5)
# define the model for i in range(3):
X_train,X_test,y_train,y_test = train_test_split(X,y,test_size = for j in range(3):
0.2,random_state=15) # Spliting into train & test dataset ax.text(j, i, cm[i, j], ha='center', va='center', color='red')
print(X_train.shape,X_test.shape) #(120, 4) (30, 4) plt.show()
model = GaussianNB() print(classification_report(y_test, y_pred))
# fit the model
model.fit(X_train,y_train)
Machine Learning using Naïve Bayes ( GaussianNB)
precision recall f1-score support
0 1.00 1.00 1.00 8
1 1.00 1.00 1.00 13
2 1.00 1.00 1.00 9

accuracy 1.00 30
macro avg 1.00 1.00 1.00 30
weighted avg 1.00 1.00 1.00 30
Naive Bayes classifier for multinomial models
The multinomial Naive Bayes classifier is suitable for classification with discrete
features such as word counts for text classification. It requires integer feature counts
such as bag-of-words or tf-idf feature extraction applied to text.
For this example, the dataset called “Twenty Newsgroups”, which is a collection of
approximately 20,000 newsgroup documents partitioned evenly across 20 different
newsgroups.

from sklearn.datasets import fetch_20newsgroups # import dataset

data = fetch_20newsgroups()
data.target_names
# Selected categories
categories = ['talk.politics.misc', 'talk.religion.misc', 'sci.med', 'sci.space', 'rec.autos']# Create train
and test dataset
train = fetch_20newsgroups(subset='train', categories=categories)
test = fetch_20newsgroups(subset='test', categories=categories)
Naive Bayes classifier for multinomial models
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.naive_bayes import MultinomialNB
from sklearn.pipeline import make_pipeline # Create a pipeline
model = make_pipeline(TfidfVectorizer(), MultinomialNB(alpha=1))
# Fit the model with training set
model.fit(train.data, train.target) #Predict labels for the test set
labels = model.predict(test.data)
from sklearn.metrics import confusion_matrix
import seaborn as sns
import matplotlib.pyplot as plt # Create the confusion matrix
conf_mat = confusion_matrix(test.target, labels, normalize="true") # Plot the confusion matrix
sns.heatmap(conf_mat.T, annot=True, fmt=".0%", cmap="cividis", xticklabels=train.target_names,
yticklabels=train.target_names)
plt.xlabel("True label")
plt.ylabel("Predicted label")
Naive Bayes classifier for multinomial models

Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
No ratings yet
Bayesian Classification: Cse 634 Data Mining - Prof. Anita Wasilewska
66 pages
Unit-3 AML (Bayesian Concept Learning)
No ratings yet
Unit-3 AML (Bayesian Concept Learning)
40 pages
NaiveBayes N Text Analytics
No ratings yet
NaiveBayes N Text Analytics
20 pages
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
No ratings yet
Probabilistic Models in Machine Learning: Unit - III Chapter - 1
18 pages
Lecture 06 Bayesian Networks 07112022 011127pm
No ratings yet
Lecture 06 Bayesian Networks 07112022 011127pm
33 pages
Practical Exam Aug 2021
No ratings yet
Practical Exam Aug 2021
5 pages
3 - Bayesian Classification
No ratings yet
3 - Bayesian Classification
15 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
Lecture Slide 03 - Bayesian Classifier - Summer 2023
No ratings yet
Lecture Slide 03 - Bayesian Classifier - Summer 2023
23 pages
Naive Bayes
No ratings yet
Naive Bayes
38 pages
NLP NB
No ratings yet
NLP NB
52 pages
Naive Bates Classifier
No ratings yet
Naive Bates Classifier
18 pages
Lecture 4
No ratings yet
Lecture 4
36 pages
Unit 3 Bayesian Learning
No ratings yet
Unit 3 Bayesian Learning
49 pages
Naïve Bayes Classifier Algorithm
No ratings yet
Naïve Bayes Classifier Algorithm
11 pages
Naive Bayes Classifier in Machine Learning - Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning - Javatpoint
19 pages
2 Naive Bayes
No ratings yet
2 Naive Bayes
49 pages
Naive Bayes
No ratings yet
Naive Bayes
11 pages
Naive Bayes
No ratings yet
Naive Bayes
37 pages
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
No ratings yet
ML Unit No.4 Naïve Bayes Classifiers PPT Notes
47 pages
Lecture10 - Bayesian Classifier
No ratings yet
Lecture10 - Bayesian Classifier
40 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
11 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
LM3 - Naive Bayes Model
No ratings yet
LM3 - Naive Bayes Model
21 pages
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
No ratings yet
6 Easy Steps To Learn Naive Bayes Algorithm (With Code in Python)
3 pages
Naive Bayes Classifier in Machine Learning
No ratings yet
Naive Bayes Classifier in Machine Learning
16 pages
NBayes Log Reg
No ratings yet
NBayes Log Reg
18 pages
Lecture13 Nbayes
No ratings yet
Lecture13 Nbayes
56 pages
Practical-3 Ritesh
No ratings yet
Practical-3 Ritesh
5 pages
NaiveBayersClassification BA
No ratings yet
NaiveBayersClassification BA
36 pages
Lecture3 Linear Classifiers
No ratings yet
Lecture3 Linear Classifiers
36 pages
Naive Bayes Ons
No ratings yet
Naive Bayes Ons
29 pages
Bayes' Theorem Explained
No ratings yet
Bayes' Theorem Explained
18 pages
Unit 3
No ratings yet
Unit 3
20 pages
Machine Ass
No ratings yet
Machine Ass
33 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
46 pages
Naive Bayes
No ratings yet
Naive Bayes
26 pages
Naive Bayes
No ratings yet
Naive Bayes
12 pages
BSC ML CH2
No ratings yet
BSC ML CH2
79 pages
NOTES
No ratings yet
NOTES
15 pages
07 Naive Bayes
No ratings yet
07 Naive Bayes
6 pages
Naive Bayes
No ratings yet
Naive Bayes
4 pages
Practical 3
No ratings yet
Practical 3
11 pages
An Introduction To Naive Bayes Algorithm For Beginners
No ratings yet
An Introduction To Naive Bayes Algorithm For Beginners
11 pages
ML For ME S15-16 Naïve Bayes
No ratings yet
ML For ME S15-16 Naïve Bayes
17 pages
NBayes 1 20 2011 Ann
No ratings yet
NBayes 1 20 2011 Ann
21 pages
Naive Bayes Classifier Presentation
No ratings yet
Naive Bayes Classifier Presentation
10 pages
Bayes Classifier
No ratings yet
Bayes Classifier
20 pages
Bayes Classifier
No ratings yet
Bayes Classifier
35 pages
lec20-ML I
No ratings yet
lec20-ML I
48 pages
Lecture 12 Dr. Lamiaa
No ratings yet
Lecture 12 Dr. Lamiaa
21 pages
AIML - Ex.3 Manual
No ratings yet
AIML - Ex.3 Manual
4 pages
Purva Rawale - BDA Practical No 2
No ratings yet
Purva Rawale - BDA Practical No 2
9 pages
WK 08
No ratings yet
WK 08
10 pages
Naive Bayes Classifier in Machine Learning Javatpoint
No ratings yet
Naive Bayes Classifier in Machine Learning Javatpoint
23 pages
Ame: Waqar Ali
No ratings yet
Ame: Waqar Ali
22 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
14 pages
ML Module4 Classification
No ratings yet
ML Module4 Classification
79 pages
Baye's Notes
No ratings yet
Baye's Notes
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Naive Bayes Classifiers - Parta

Uploaded by

Naive Bayes Classifiers - Parta

Uploaded by

Naïve Bayes Classifiers

Machine Learning Models - Types

• Use data to find out more about the probability distribution

• Example: Naïve Bayes Classifiers, Gaussian Mixture Model (GMM)

X denote the variables we know about, e.g., instance’s feature values

Predicting the target old true true fair ?

• maximum likelihood (ML) decision rule

• Tend to be faster in training than linear classifiers

• Efficient as they learn parameters by looking at each feature individually

• Generalization performance slightly worse than linear classifiers

• scikit-learn implements three kinds of naïve bayes classifiers:

• GaussianNB is mostly used on very high-dimensional data

• Other two variants used for sparse count data

• MultinomialNB usually performs better than BinaryNB

• Share many of the strengths and weaknesses of the linear models

• Very fast to train and to predict

from sklearn.datasets import fetch_20newsgroups # import dataset

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.