0% found this document useful (0 votes)

77 views

The Bayes Optimal Classifier: Machine Learning

The document discusses the difference between the maximum a posteriori hypothesis and the most probable classification in Bayesian learning. It explains that the hypothesis space H can represent either a collection of functions or possible predictions. And that selecting a function versus considering all options directly could result in different predictions.

Uploaded by

Notout Naga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views

The Bayes Optimal Classifier: Machine Learning

Uploaded by

Notout Naga

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

The

Bayes Optimal Classifier

Machine Learning

1
Most probable classification

• In Bayesian learning, the primary question is: What is

the most probable hypothesis given data?

• We can also ask: For a new test point, what is the

most probable label, given training data?

• Is this the same as the prediction of the maximum a

posteriori hypothesis?

2
Most probable classification

Suppose our hypothesis space H has three functions h1, h2 and h3

• P(h1 | D) = 0.4, P(h2 | D) = 0.3, P(h3 | D) = 0.3

• What is the MAP hypothesis?

• For a new instance x, suppose h1(x) = +1, h2(x) = -1 and h3(x) = -1

• What is the most probable classification of x? -1

P(+1 | x) = 0.4 P(-1| x) = 0.3 + 0.3

The most probable classification is not the same as the prediction of

the MAP hypothesis

3
Most probable classification

Suppose our hypothesis space H has three functions h1, h2 and h3

• P(h1 | D) = 0.4, P(h2 | D) = 0.3, P(h3 | D) = 0.3

• What is the MAP hypothesis? h1

• For a new instance x, suppose h1(x) = +1, h2(x) = -1 and h3(x) = -1

• What is the most probable classification of x? -1

P(+1 | x) = 0.4 P(-1| x) = 0.3 + 0.3

The most probable classification is not the same as the prediction of

the MAP hypothesis

4
Most probable classification

Suppose our hypothesis space H has three functions h1, h2 and h3

• P(h1 | D) = 0.4, P(h2 | D) = 0.3, P(h3 | D) = 0.3

• What is the MAP hypothesis? h1

• For a new instance x, suppose h1(x) = +1, h2(x) = -1 and h3(x) = -1

• What is the most probable classification of x? -1

P(+1 | x) = 0.4 P(-1| x) = 0.3 + 0.3

The most probable classification is not the same as the prediction of

the MAP hypothesis

5
Most probable classification

Suppose our hypothesis space H has three functions h1, h2 and h3

• P(h1 | D) = 0.4, P(h2 | D) = 0.3, P(h3 | D) = 0.3

• What is the MAP hypothesis? h1

• For a new instance x, suppose h1(x) = +1, h2(x) = -1 and h3(x) = -1

• What is the most probable classification of x?

P(+1 | x) = 0.4 P(-1| x) = 0.3 + 0.3

The most probable classification is not the same as the prediction of

the MAP hypothesis

6
Most probable classification

Suppose our hypothesis space H has three functions h1, h2 and h3

• P(h1 | D) = 0.4, P(h2 | D) = 0.3, P(h3 | D) = 0.3

• What is the MAP hypothesis? h1

• For a new instance x, suppose h1(x) = +1, h2(x) = -1 and h3(x) = -1

• What is the most probable classification of x? -1

P(+1 | x) = 0.4 P(-1| x) = 0.3 + 0.3

The most probable classification is not the same as the prediction of

the MAP hypothesis

7
Most probable classification

Suppose our hypothesis space H has three functions h1, h2 and h3

• P(h1 | D) = 0.4, P(h2 | D) = 0.3, P(h3 | D) = 0.3

• What is the MAP hypothesis? h1

• For a new instance x, suppose h1(x) = +1, h2(x) = -1 and h3(x) = -1

• What is the most probable classification of x? -1

P(+1 | x) = 0.4 P(-1| x) = 0.3 + 0.3

The most probable classification is not the same as the prediction of

the MAP hypothesis

8
Bayes Optimal Classifier

• How should we use the general formalism?

– What should H be?

H can be a collection of functions. H can be a collection of possible

predictions
• Given the training data, choose • Given the data, try to directly
an optimal function choose the optimal prediction
• Then, given new data, evaluate
the selected function on it

These two could be different!

Selecting a function vs. entertaining all options until the last minute

9
Bayes Optimal Classifier

• How should we use the general formalism?

– What should H be?

H can be a collection of functions. H can be a collection of possible

predictions
• Given the training data, choose • Given the data, try to directly
an optimal function choose the optimal prediction
• Then, given new data, evaluate
the selected function on it

These two could be different!

Selecting a function vs. entertaining all options until the last minute

10
Bayes Optimal Classifier

• How should we use the general formalism?

– What should H be?

H can be a collection of functions. H can be a collection of possible

predictions
• Given the training data, choose • Given the data, try to directly
an optimal function choose the optimal prediction
• Then, given new data, evaluate
the selected function on it

These two could be different!

Selecting a function vs. entertaining all options until the last minute

11
Bayes Optimal Classification
Defined as the label produced by the most probable classifier

Computing this can be hopelessly inefficient

And yet an interesting theoretical concept because, no other

classification method can outperform this method on average
(using the same hypothesis space and prior knowledge)

TIMO Grade 7 Preliminary Round 2021-2022
No ratings yet
TIMO Grade 7 Preliminary Round 2021-2022
5 pages
ISO IEC TS 29125-2017 Amd1-2020
No ratings yet
ISO IEC TS 29125-2017 Amd1-2020
16 pages
EEE755 Sp23 HW1
No ratings yet
EEE755 Sp23 HW1
4 pages
Bayesian Learning Unit 3 PDF
No ratings yet
Bayesian Learning Unit 3 PDF
18 pages
Finite Element Method Ss Rao Solutions Manual PDF
100% (1)
Finite Element Method Ss Rao Solutions Manual PDF
4 pages
Bayers Optimal Classifier
No ratings yet
Bayers Optimal Classifier
9 pages
##7 Rev ML Module-2 Bayesian Learning
No ratings yet
##7 Rev ML Module-2 Bayesian Learning
7 pages
Lec 1
No ratings yet
Lec 1
42 pages
Wa0002.
No ratings yet
Wa0002.
24 pages
UNIT-3
No ratings yet
UNIT-3
99 pages
Concept Learning
No ratings yet
Concept Learning
33 pages
Bayesian Learning
No ratings yet
Bayesian Learning
81 pages
Bayesian Learning Methods
No ratings yet
Bayesian Learning Methods
57 pages
slide07-bayes
No ratings yet
slide07-bayes
51 pages
module_3_Last Part
No ratings yet
module_3_Last Part
16 pages
L23 Bayesian Naive
No ratings yet
L23 Bayesian Naive
18 pages
Unit 2 Bayesian Learning
No ratings yet
Unit 2 Bayesian Learning
50 pages
Unit III
No ratings yet
Unit III
19 pages
Bayesian Learning
No ratings yet
Bayesian Learning
44 pages
6.1 Bayesian Learning
No ratings yet
6.1 Bayesian Learning
33 pages
Features of Bayesian Learning Methods
No ratings yet
Features of Bayesian Learning Methods
39 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
Naïve Bayes Classifier: April 25, 2006
No ratings yet
Naïve Bayes Classifier: April 25, 2006
19 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
2BAYESIAN LEARNING (1)
No ratings yet
2BAYESIAN LEARNING (1)
22 pages
ML - Unit-3 Chapter - 6 (Bayes Theorem) - Notes
No ratings yet
ML - Unit-3 Chapter - 6 (Bayes Theorem) - Notes
31 pages
MIT18 657F15 LecNote PDF
No ratings yet
MIT18 657F15 LecNote PDF
194 pages
Mathematics of Machine Learning MIT
No ratings yet
Mathematics of Machine Learning MIT
411 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
Linear Classification: 1 1 N N I D I
No ratings yet
Linear Classification: 1 1 N N I D I
33 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
2 Naive Bayes
No ratings yet
2 Naive Bayes
49 pages
7. Statistical Perspective
No ratings yet
7. Statistical Perspective
85 pages
ML Unit-Iii
No ratings yet
ML Unit-Iii
178 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
14 pages
UNIT I-Part 2
No ratings yet
UNIT I-Part 2
35 pages
Slide 1
No ratings yet
Slide 1
37 pages
Lecture 2
No ratings yet
Lecture 2
130 pages
02-Classification - Commented2
No ratings yet
02-Classification - Commented2
22 pages
MAP Classifier For Normal Distributions: Alexander Wong SYDE 372
No ratings yet
MAP Classifier For Normal Distributions: Alexander Wong SYDE 372
54 pages
ML - Unit-3 Chapter - 6 (Bayes Theorem) - Notes
No ratings yet
ML - Unit-3 Chapter - 6 (Bayes Theorem) - Notes
123 pages
Module 4 - Bayesian Learning
No ratings yet
Module 4 - Bayesian Learning
36 pages
Lec 2
No ratings yet
Lec 2
37 pages
Mathematics - Iii: Institute of Science&Technology
No ratings yet
Mathematics - Iii: Institute of Science&Technology
16 pages
AI Mod4@AzDOCUMENTS - in
No ratings yet
AI Mod4@AzDOCUMENTS - in
41 pages
8-Probability Theory Cont''d and BAYESIAN LEARNING-01!08!2024
No ratings yet
8-Probability Theory Cont''d and BAYESIAN LEARNING-01!08!2024
22 pages
Lec04 BayesianLearning
No ratings yet
Lec04 BayesianLearning
39 pages
Predicting The Missing Value by Bayesian Classification: Abstract
No ratings yet
Predicting The Missing Value by Bayesian Classification: Abstract
5 pages
8 ML
No ratings yet
8 ML
22 pages
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
No ratings yet
Bayesian Decision Theory and Learning: Jayanta Mukhopadhyay Dept. of Computer Science and Engg
56 pages
Lecture 03 Bayes Classifier With Prob Concepts
No ratings yet
Lecture 03 Bayes Classifier With Prob Concepts
70 pages
Hypothesis in ML
No ratings yet
Hypothesis in ML
16 pages
ML Unit 3 Part 1
No ratings yet
ML Unit 3 Part 1
36 pages
Naive Bays & Support Vector Machines 2024-PPG
No ratings yet
Naive Bays & Support Vector Machines 2024-PPG
63 pages
5Bayes-gibbs
No ratings yet
5Bayes-gibbs
9 pages
Bayesian Decision Theory
No ratings yet
Bayesian Decision Theory
65 pages
MLE and MAP Classifier
No ratings yet
MLE and MAP Classifier
55 pages
Optimum Statistical Classifiers
100% (1)
Optimum Statistical Classifiers
12 pages
Unit 4
No ratings yet
Unit 4
18 pages
E-Note 14654 Content Document 20231228101425AM
No ratings yet
E-Note 14654 Content Document 20231228101425AM
10 pages
Linearclassification
No ratings yet
Linearclassification
31 pages
15CS73 Module 4
No ratings yet
15CS73 Module 4
60 pages
Random Optimization: Fundamentals and Applications
From Everand
Random Optimization: Fundamentals and Applications
Fouad Sabry
No ratings yet
Bayesian Inference: Fundamentals and Applications
From Everand
Bayesian Inference: Fundamentals and Applications
Fouad Sabry
No ratings yet
Quantum Computing - Program schedule
No ratings yet
Quantum Computing - Program schedule
1 page
Electronics and Communication Engineering: Tneb-Ae Online Test Schedule-2020
No ratings yet
Electronics and Communication Engineering: Tneb-Ae Online Test Schedule-2020
5 pages
Formula - Edc
No ratings yet
Formula - Edc
7 pages
TCAD Simulations and Small Signal Modeling of DMG Algan/Gan Hfet
No ratings yet
TCAD Simulations and Small Signal Modeling of DMG Algan/Gan Hfet
11 pages
8 PGTRB Tamil Qa
No ratings yet
8 PGTRB Tamil Qa
9 pages
"Embedded System Design Using ARM Controller": Hands-on-Training On
No ratings yet
"Embedded System Design Using ARM Controller": Hands-on-Training On
2 pages
Samacheer Kalvi Maths in Tamil
No ratings yet
Samacheer Kalvi Maths in Tamil
100 pages
Anna University, Chennai-25 Faculty Development Training Programme (Summer Vacation 2014)
No ratings yet
Anna University, Chennai-25 Faculty Development Training Programme (Summer Vacation 2014)
1 page
Invigilators For Anna University Theory Examinations
No ratings yet
Invigilators For Anna University Theory Examinations
1 page
Anna University, Chennai-25 Faculty Development Training Programme (Summer Vacation 2014)
No ratings yet
Anna University, Chennai-25 Faculty Development Training Programme (Summer Vacation 2014)
2 pages
IYPT Problems 2019
No ratings yet
IYPT Problems 2019
5 pages
Answers of Adbms
No ratings yet
Answers of Adbms
48 pages
Sci10 SLM Q1M3W5 PlateTectonics
No ratings yet
Sci10 SLM Q1M3W5 PlateTectonics
32 pages
Wave Editor TWE
No ratings yet
Wave Editor TWE
41 pages
Harsh Practical File
No ratings yet
Harsh Practical File
54 pages
Optimizing Tomorrow's Learning Spaces
No ratings yet
Optimizing Tomorrow's Learning Spaces
16 pages
Semantics Syllabus
100% (1)
Semantics Syllabus
3 pages
Periodic Posting Contracts Use:: Procedural Steps
No ratings yet
Periodic Posting Contracts Use:: Procedural Steps
10 pages
True Orthophoto Generation From Uav Images Implementation of
No ratings yet
True Orthophoto Generation From Uav Images Implementation of
7 pages
PURITY OF HYUDROCHLORIC ACID
No ratings yet
PURITY OF HYUDROCHLORIC ACID
2 pages
Cse 5A Introduction To Programming (C/C++) Homework 2
No ratings yet
Cse 5A Introduction To Programming (C/C++) Homework 2
2 pages
1421 SHDSL
No ratings yet
1421 SHDSL
2 pages
Creating and Adjusting Traverses
No ratings yet
Creating and Adjusting Traverses
21 pages
19 - Reteach
No ratings yet
19 - Reteach
4 pages
CO Workshop - Material Ledger - 2
No ratings yet
CO Workshop - Material Ledger - 2
10 pages
Unit 9 Chemical Reactions: Summary Questions
No ratings yet
Unit 9 Chemical Reactions: Summary Questions
2 pages
A3977 Datasheet
No ratings yet
A3977 Datasheet
17 pages
Wilkinson Power Divider
No ratings yet
Wilkinson Power Divider
11 pages
Simulation Based Performance Analysis of Heat Exchangers: A Review
No ratings yet
Simulation Based Performance Analysis of Heat Exchangers: A Review
7 pages
Mathematics: First Quarter WEEK 2 - Module 2
100% (1)
Mathematics: First Quarter WEEK 2 - Module 2
19 pages
TXT2PDF User Guide PDF
No ratings yet
TXT2PDF User Guide PDF
54 pages
Plate Versus Tension-Band Wire Fixation For Olecranon Fractures
No ratings yet
Plate Versus Tension-Band Wire Fixation For Olecranon Fractures
13 pages
Data Wrangling Report
No ratings yet
Data Wrangling Report
3 pages
Experiment 2 Crystal Structures
No ratings yet
Experiment 2 Crystal Structures
5 pages
Pyqt Tutorial PDF
No ratings yet
Pyqt Tutorial PDF
21 pages
Basic Thermodynamics Btech 1 Lecture Notes
No ratings yet
Basic Thermodynamics Btech 1 Lecture Notes
38 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

The Bayes Optimal Classifier: Machine Learning

Uploaded by

The Bayes Optimal Classifier: Machine Learning

Uploaded by

The

Bayes Optimal Classifier

• In Bayesian learning, the primary question is: What is

• We can also ask: For a new test point, what is the

• Is this the same as the prediction of the maximum a

Suppose our hypothesis space H has three functions h1, h2 and h3

• What is the MAP hypothesis?

• For a new instance x, suppose h1(x) = +1, h2(x) = -1 and h3(x) = -1

• What is the most probable classification of x? -1

P(+1 | x) = 0.4 P(-1| x) = 0.3 + 0.3

The most probable classification is not the same as the prediction of

Suppose our hypothesis space H has three functions h1, h2 and h3

• What is the MAP hypothesis? h1

• For a new instance x, suppose h1(x) = +1, h2(x) = -1 and h3(x) = -1

• What is the most probable classification of x? -1

P(+1 | x) = 0.4 P(-1| x) = 0.3 + 0.3

The most probable classification is not the same as the prediction of

Suppose our hypothesis space H has three functions h1, h2 and h3

• What is the MAP hypothesis? h1

• For a new instance x, suppose h1(x) = +1, h2(x) = -1 and h3(x) = -1

• What is the most probable classification of x? -1

P(+1 | x) = 0.4 P(-1| x) = 0.3 + 0.3

The most probable classification is not the same as the prediction of

Suppose our hypothesis space H has three functions h1, h2 and h3

• What is the MAP hypothesis? h1

• For a new instance x, suppose h1(x) = +1, h2(x) = -1 and h3(x) = -1

• What is the most probable classification of x?

P(+1 | x) = 0.4 P(-1| x) = 0.3 + 0.3

The most probable classification is not the same as the prediction of

Suppose our hypothesis space H has three functions h1, h2 and h3

• What is the MAP hypothesis? h1

• For a new instance x, suppose h1(x) = +1, h2(x) = -1 and h3(x) = -1

• What is the most probable classification of x? -1

P(+1 | x) = 0.4 P(-1| x) = 0.3 + 0.3

The most probable classification is not the same as the prediction of

Suppose our hypothesis space H has three functions h1, h2 and h3

• What is the MAP hypothesis? h1

• For a new instance x, suppose h1(x) = +1, h2(x) = -1 and h3(x) = -1

• What is the most probable classification of x? -1

P(+1 | x) = 0.4 P(-1| x) = 0.3 + 0.3

The most probable classification is not the same as the prediction of

• How should we use the general formalism?

H can be a collection of functions. H can be a collection of possible

These two could be different!

• How should we use the general formalism?

H can be a collection of functions. H can be a collection of possible

These two could be different!

• How should we use the general formalism?

H can be a collection of functions. H can be a collection of possible

These two could be different!

Computing this can be hopelessly inefficient

And yet an interesting theoretical concept because, no other

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.