0% found this document useful (0 votes)

12 views30 pages

Lec03 Classifiers KNN+DT

The document outlines foundational concepts in machine learning, focusing on classification methods such as k-Nearest Neighbors (k-NN) and Decision Trees. It discusses the mechanics of k-NN, including the importance of selecting the right 'k' value, the concept of distance-weighted neighbors, and improvements for efficiency. Additionally, it covers decision trees, their structure, impurity measures, and strategies for avoiding overfitting through pruning techniques.

Uploaded by

muskansah099

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views30 pages

Lec03 Classifiers KNN+DT

Uploaded by

muskansah099

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

Foundations of Machine Learning

Classifiers
Aug 2024

Vineeth N Balasubramanian
Classification Methods
• k-Nearest Neighbors
• Decision Trees
• Naïve Bayes
• Support Vector Machines
• Logistic Regression
• Neural Networks
• Ensemble Methods (Boosting, Random Forests)

Foundations of Machine Learning

2
k-Nearest Neighbors
• Basic idea:
• If it walks like a duck, quacks like a duck, then it’s probably a duck

Compute
Distance Test
Record

Training Choose k of the

Records “nearest” records

Foundations of Machine Learning

3
k-Nearest Neighbors
• Majority vote within the k nearest neighbors

new

K= 1: blue
K= 3: green

Foundations of Machine Learning

4
k-Nearest Neighbors
• Choosing k is important
• If k is too small, sensitive to noise points
• If k is too large, neighborhood may include points from other classes

X X X

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

Foundations of Machine Learning

5
k-Nearest Neighbors
• An arbitrary instance is represented by (a1(x), a2(x), a3(x),.., an(x))
• ai(x) denotes features
• Euclidean distance between two instances
• d(xi, xj)=sqrt (sum for r=1 to n (ar(xi) - ar(xj))2)
• In case of continuous-valued target function
• Mean value of k nearest training examples

Foundations of Machine Learning

6
How to determine k
• Determined experimentally
• Start with k=1 and use a test set to validate the error rate of the
classifier
• Repeat with k=k+2
• Choose the value of k for which the error rate is minimum
• Note: k typically an odd number to avoid ties in binary
classification

Foundations of Machine Learning

7
k-Nearest Neighbors
• Eager Learning (Induction)
• Explicit description of target function on the whole training set
• Instance-based Learning (Transduction)
• Learning=storing all training instances
• Classification=assigning target function to a new instance
• Referred to as “Lazy” learning
Similar Keywords: K-Nearest Neighbors, Memory-Based Reasoning,
Example-Based Reasoning, Instance-Based Learning, Case-Based
Reasoning, Lazy Learning

Foundations of Machine Learning

8
Voronoi Diagram

Decision surface formed by the training examples!

Foundations of Machine Learning

9
Improvements
• Distance-Weighted Nearest Neighbors
• Assign weights to the neighbors based on their ‘distance’ from the query point
(E.g., weight ‘may’ be inverse square of the distances)
• Scaling (normalization) attributes for fair computation of distances

Foundations of Machine Learning

10
Improvements
• Distance-Weighted Nearest Neighbors
• Assign weights to the neighbors based on their ‘distance’ from the query point
(E.g., weight ‘may’ be inverse square of the distances)
• Scaling (normalization) attributes for fair computation of distances
• Measure “closeness” differently
• Finding “close” examples in a large training set quickly
• E.g. Efficient memory indexing using kd-trees

Foundations of Machine Learning

11
k-NN: Summary
• Pros
• Highly effective inductive inference method for noisy training data and complex target
functions
• Target function for a whole space may be described as a combination of less complex
local approximations
• Trains very fast (“Lazy” learner)
• Cons
• Curse of dimensionality
• In higher dimensions, all the data points lie on the surface of the unit hypersphere (the inside
is empty!) Check: http://www.cs.cmu.edu/~venkatg/teaching/CStheory-infoage/chap1-high-dim-
space.pdf
• Storage: all training examples are saved in memory
• A decision tree or linear classifier is much smaller
• Slow at query time
• Can be overcome and presorting and indexing training samples

Foundations of Machine Learning

Foundations of Machine Learning

13
Convergence of 1-NN
P(Y|x) P(Y|x’’) x2
P (knnError) x
= 1 − Pr( y = y1 ) neighbor y2
= 1 −  Pr(Y = y ' | x) 2
y
y'
x1

Possible to show 
= − − =
2 2
1 Pr( y* | x ) Pr(Y y ' | x )
that: as the size of training data set
P(Y|xapproaches
) 1
y ' y*
... the one nearest neighbor classifier guarantees
infinity, y1 an error rate of
noworse
2(1 − Pr(
thany*twice
| x)) the Bayes error rate (the minimum achievable
error rate given the distribution
= 2(Bayes optimal error rate) of the data). We will see this later.
assume equal let y*=argmax Pr(y|x)

Foundations of Machine Learning

14
Non-parametric Density Estimation using kNNs
• K-Nearest Neighbor
estimator
• Instead of fixing bin width
h and counting the
number of instances, fix
the instances (neighbors)
k andMore
checklater
bin width
when we move to unsupervised learning
k
pˆ (x ) =
2Nd k (x )
dk(x), distance to kth
closest instance to x
Source: Ethem Alpaydin, Introduction to Machine Learning, 3rd Edition (Slides)

Foundations of Machine Learning

15
Classification Methods
• k-Nearest Neighbors
• Decision Trees
• Naïve Bayes
• Support Vector Machines
• Logistic Regression
• Neural Networks
• Ensemble Methods (Boosting, Random Forests)

Foundations of Machine Learning

16
Example

Foundations of Machine Learning

17
Decision Trees
Node

• An efficient
nonparametric
method
• A hierarchical
model
• Divide-and–
conquer
Leaf
strategy
Source: Ethem Alpaydin, Introduction to Machine Learning, 3rd Edition (Slides)

Foundations of Machine Learning

18
Divide and Conquer
• Internal decision nodes
• Univariate: Uses a single attribute, xi
• Numeric xi :
• Binary split : xi > wm
• Discrete xi :
• n-way split for n possible values
• Multivariate: Uses more than one attributes, x
• Leaves
• Classification: Class labels, or proportions
• Regression: Numeric; r average, or local fit
• Learning is greedy; find the best split recursively
Source: Ethem Alpaydin, Introduction to Machine Learning, 3rd Edition (Slides)

Foundations of Machine Learning

19
Classification Trees (C4.5, J48)
• For node m, Nm instances reach m, Nim belong to Ci
i
N
Pˆ(C i | x, m)  pmi = m
2-class problem
Nm
• Node m is pure if pim is 0 or 1
• Measure of impurity is entropy
K
I m = −  pmi log2 pmi
i =1

Entropy in information theory specifies the average (expected) amount of information derived from observing
an event
Source: Ethem Alpaydin, Introduction to Machine Learning, 3rd Edition (Slides)

Foundations of Machine Learning

20
Classification Trees
• If node m is pure, generate a leaf and stop, otherwise split and continue
recursively
• Impurity after split: Nmj of Nm take branch j. Nimj belong to Ci
i
Nmj
Pˆ(C i |x, m, j )  pmj
i
=
Nmj
n Nmj K
I'm = −  mj 2 mj
p i
log p i

j =1 Nm i =1

• Information Gain: Expected reduction in impurity measure after split

• Choose the best attribute(s) (with maximum information gain) to split
the remaining instances and make that attribute a decision node
• You can use same logic to find best splitting value too
Source: Ethem Alpaydin, Introduction to Machine Learning, 3rd Edition (Slides)

Foundations of Machine Learning

21
Other Measures of Impurity
• The properties of functions measuring the impurity of a split:
•  (1 / 2,1 / 2)   ( p,1 − p), for any p  [0,1]
•  (0,1) =  (1,0) = 0
1
•  ( p,1 − p ) is increasing in p on [0, ]
2
1
and decreasing in p on [ ,1]
2
• Examples (other than entropy)
• Gini impurity/index:

Foundations of Machine Learning

22
Decision Trees: Example

Foundations of Machine Learning

23
Decision
Trees:
Example

Foundations of Machine Learning

24
Decision
Trees:
Example

Foundations of Machine Learning

25
Overfitting and Generalization
• Overfitting can occur with noisy training examples, also when
small numbers of examples are associated with leaf nodes.
How to handle?
• Pruning: Remove subtrees for better generalization
(decrease variance)
• Prepruning: Early stopping
• Postpruning: Grow the whole tree then prune subtrees which overfit
on the pruning set
• Prepruning is faster, postpruning is more accurate

Foundations of Machine Learning

26
Overfitting and Generalization
• Occam’s Razor principle: when multiple hypotheses can
solve a problem, choose the simplest one
• a short hypothesis that fits data unlikely to be coincidence
• a long hypothesis that fits data might be coincidence

• How to select “best” tree:

• Measure performance over training data
• Measure performance over separate validation data set
• Minimum Description Length: Minimize size(tree) +
size(misclassifications(tree))

Foundations of Machine Learning

27
Rule Extraction from Trees
• Convert tree to equivalent set
of rules
• Prune each rule independently
of others, by removing any
preconditions that result in
improving its estimated
accuracy
• Sort final rules into desired
sequence for use

Foundations of Machine Learning

28
Multivariate Trees

Foundations of Machine Learning

29
Readings
• Chapters 8, 9, EA Introduction to ML 2nd Edn

Foundations of Machine Learning

STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
18 pages
Introduction To Classification - PPT Slides 1
No ratings yet
Introduction To Classification - PPT Slides 1
62 pages
ML-3-Decision Tree
No ratings yet
ML-3-Decision Tree
17 pages
Learning AI
No ratings yet
Learning AI
34 pages
Unit II - 2 - Supervised Learning
No ratings yet
Unit II - 2 - Supervised Learning
23 pages
A Major Project Report-2
No ratings yet
A Major Project Report-2
82 pages
ML Unit II - Final
No ratings yet
ML Unit II - Final
138 pages
ML4 ML Algorithms
No ratings yet
ML4 ML Algorithms
123 pages
Module 2 - Deep - Learning - Fundamentals
No ratings yet
Module 2 - Deep - Learning - Fundamentals
98 pages
ML Assignment 2 PDF
No ratings yet
ML Assignment 2 PDF
9 pages
Supervised Learning
No ratings yet
Supervised Learning
67 pages
Chapter 3
No ratings yet
Chapter 3
67 pages
Machine Learning Notes 1
No ratings yet
Machine Learning Notes 1
120 pages
Classification Algorithm in Machine Learning
No ratings yet
Classification Algorithm in Machine Learning
7 pages
Introduction To Classification and Classification Algorithms
No ratings yet
Introduction To Classification and Classification Algorithms
9 pages
Projects 2021 B4
No ratings yet
Projects 2021 B4
96 pages
Lec02 ClassifierEvaluation
No ratings yet
Lec02 ClassifierEvaluation
36 pages
My Document
No ratings yet
My Document
1 page
CSCI946 W5-Classification
No ratings yet
CSCI946 W5-Classification
72 pages
Machine Learning
100% (6)
Machine Learning
115 pages
Big Data Notes
No ratings yet
Big Data Notes
33 pages
NNML
No ratings yet
NNML
113 pages
Unit4 PPT
No ratings yet
Unit4 PPT
118 pages
Lec04 Classifiers NBC
No ratings yet
Lec04 Classifiers NBC
24 pages
Free Download Data Science Curriculum - Innomatics Research Labs Hyderabad, India
No ratings yet
Free Download Data Science Curriculum - Innomatics Research Labs Hyderabad, India
14 pages
UNIT 2 - Notes
No ratings yet
UNIT 2 - Notes
31 pages
Chapter
100% (1)
Chapter
101 pages
ML.4-Classification Techniques (Week 5,6,7)
No ratings yet
ML.4-Classification Techniques (Week 5,6,7)
56 pages
Datamining Lect7knearst
No ratings yet
Datamining Lect7knearst
62 pages
2021 Lecture10 BasicML
No ratings yet
2021 Lecture10 BasicML
76 pages
U02Lecture08 Statistical Machine Learning
No ratings yet
U02Lecture08 Statistical Machine Learning
41 pages
Lec08 Classification KNN ANN
No ratings yet
Lec08 Classification KNN ANN
39 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
21 pages
UNIT 3 - Final
No ratings yet
UNIT 3 - Final
37 pages
Mod3 Classification
No ratings yet
Mod3 Classification
32 pages
ML UNIT 2 Sir
No ratings yet
ML UNIT 2 Sir
46 pages
49 Machine Learning
No ratings yet
49 Machine Learning
300 pages
Unit Ii
No ratings yet
Unit Ii
102 pages
Missing Value Treatment
No ratings yet
Missing Value Treatment
22 pages
Machine Learning and Neural Networks: Riccardo Rizzo
100% (1)
Machine Learning and Neural Networks: Riccardo Rizzo
113 pages
Week 11 - Decision Tree Learning
No ratings yet
Week 11 - Decision Tree Learning
43 pages
Multi Modal Hate Speech Detection Using Machine Learning
100% (1)
Multi Modal Hate Speech Detection Using Machine Learning
5 pages
CCST9017 (2023-24lecture11printed Version) MachineLearning
No ratings yet
CCST9017 (2023-24lecture11printed Version) MachineLearning
55 pages
05 Classification Part1
No ratings yet
05 Classification Part1
35 pages
7.classification Before
No ratings yet
7.classification Before
27 pages
Refer For KNNDecison Tree SVM
No ratings yet
Refer For KNNDecison Tree SVM
90 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Unit 4 Classification & Prediction
No ratings yet
Unit 4 Classification & Prediction
10 pages
7.classification After
No ratings yet
7.classification After
51 pages
DM - MP
No ratings yet
DM - MP
15 pages
Unit 3,4,5 ML (CS - AI)
No ratings yet
Unit 3,4,5 ML (CS - AI)
37 pages
Datamites Certified Data Scientist Brochure
No ratings yet
Datamites Certified Data Scientist Brochure
19 pages
AI and DS - Syllabus&Structure 2022-23
No ratings yet
AI and DS - Syllabus&Structure 2022-23
154 pages
DWDM Unit 4
No ratings yet
DWDM Unit 4
22 pages
DSP Unit - III
No ratings yet
DSP Unit - III
49 pages
Data Mining Classification Algorithms: Credits: Padhraic Smyth
No ratings yet
Data Mining Classification Algorithms: Credits: Padhraic Smyth
54 pages
Data Mining Lecture 10B: Classification
No ratings yet
Data Mining Lecture 10B: Classification
62 pages
Sayan Das - Machine Learning
No ratings yet
Sayan Das - Machine Learning
4 pages
DW&M Unit 3 Part I
No ratings yet
DW&M Unit 3 Part I
101 pages
Movie Recommender System Using K-Means
No ratings yet
Movie Recommender System Using K-Means
7 pages
10 1109icesc48915 2020 9155879
No ratings yet
10 1109icesc48915 2020 9155879
7 pages
Investigating AI-Powered Tutoring Systems That Ada
No ratings yet
Investigating AI-Powered Tutoring Systems That Ada
7 pages
ML ModuleUntitled 2
No ratings yet
ML ModuleUntitled 2
8 pages
Bike Buyer Prediction Using Classification Algorithm
No ratings yet
Bike Buyer Prediction Using Classification Algorithm
19 pages
MLunit 2 Mynotes
No ratings yet
MLunit 2 Mynotes
15 pages
Module Iii
No ratings yet
Module Iii
15 pages
5 9 2024 NNFS - Theory - Course - File TP
No ratings yet
5 9 2024 NNFS - Theory - Course - File TP
50 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
Amitesh Sharma ML
No ratings yet
Amitesh Sharma ML
28 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
50 pages
ML Unit 2
No ratings yet
ML Unit 2
8 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
04 Chap04 ClassificationMethods LDA QDA
No ratings yet
04 Chap04 ClassificationMethods LDA QDA
28 pages
K-Nearest Neighbor Methods: William Cohen 10-601 April 2008
No ratings yet
K-Nearest Neighbor Methods: William Cohen 10-601 April 2008
35 pages
An Effective Air Pollution Prediction Model Using Machine Learning Algorithms
No ratings yet
An Effective Air Pollution Prediction Model Using Machine Learning Algorithms
8 pages
Notes: KNN: K-Nearest Neighbors
No ratings yet
Notes: KNN: K-Nearest Neighbors
4 pages
Reducing Power Consumption of Digital Predistortion For RF Power Amplifiers Using Real-Time Model Switching
No ratings yet
Reducing Power Consumption of Digital Predistortion For RF Power Amplifiers Using Real-Time Model Switching
9 pages
Python ML Post Assessment Quiz
No ratings yet
Python ML Post Assessment Quiz
2 pages
Skin Cancer Detection Using Machine Learning Techniques: Vidya M Dr. Maya V Karki
No ratings yet
Skin Cancer Detection Using Machine Learning Techniques: Vidya M Dr. Maya V Karki
5 pages
2-IJCI Vol. 3 No. 3-March 2024-Paper1-Dr. Elrasheed
No ratings yet
2-IJCI Vol. 3 No. 3-March 2024-Paper1-Dr. Elrasheed
33 pages
Multivariate Short-Term Traffic Flow Prediction Based On Real-Time Expressway Toll Plaza Data Using Non-Parametric Techniques
No ratings yet
Multivariate Short-Term Traffic Flow Prediction Based On Real-Time Expressway Toll Plaza Data Using Non-Parametric Techniques
19 pages
Analysis and Prediction of Diabetes Using Machine Learning
No ratings yet
Analysis and Prediction of Diabetes Using Machine Learning
9 pages
Data Mining CS4168 Lecture 5 Basics of Classification 1
No ratings yet
Data Mining CS4168 Lecture 5 Basics of Classification 1
25 pages
5.1.8 K-Nearest-Neighbor Algorithm
No ratings yet
5.1.8 K-Nearest-Neighbor Algorithm
8 pages
Main
No ratings yet
Main
12 pages
EBSCO-Metadata-01 17 2025
No ratings yet
EBSCO-Metadata-01 17 2025
2 pages
Riadsaboundji 2020
No ratings yet
Riadsaboundji 2020
8 pages
Digital Image Processing
No ratings yet
Digital Image Processing
8 pages
Quiz 4 - Attempt Review
No ratings yet
Quiz 4 - Attempt Review
3 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lec03 Classifiers KNN+DT

Uploaded by

Lec03 Classifiers KNN+DT

Uploaded by

Foundations of Machine Learning

Foundations of Machine Learning

Training Choose k of the

Foundations of Machine Learning

Foundations of Machine Learning

(a) 1-nearest neighbor (b) 2-nearest neighbor (c) 3-nearest neighbor

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Decision surface formed by the training examples!

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

• Information Gain: Expected reduction in impurity measure after split

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

• How to select “best” tree:

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

Foundations of Machine Learning

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.