0% found this document useful (0 votes)

9 views21 pages

Lecture5

Uploaded by

personalspotify007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views21 pages

Lecture5

Uploaded by

personalspotify007

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Classification : principles

and examples

Terminology

✓ Classification (En) = «classification » (Fr)

✓ Clustering (En) = « partitionnement (de données) » (Fr)
✓ A category or label in a classification problem is called a class
✓ Data points are called samples

© Nicolas Navet University of Luxembourg 3

Basics of Classification
- Classification is assigning labels to data
- Classification is usually done with Supervised Learning (SL): training a model
with examples (the training set) and applying the trained model to unseen
data.
- A great diversity of techniques can be employed : Neural-Networks, k-NN,
Support Vector Machines, decision trees, …
A feature is a property or
Geom. problem: characteristic of a sample that
segmenting serves an input to the
algorithm. Selecting good
features, ones having genuine
Figures from [AG20] predictive ability, is crucial !
Classification: a labeled training set for
spam evaluation What are some possible features for
spam classification?
© Nicolas Navet University of Luxembourg 4
Classification with supervised learning
Training
Examples
Learning Trained model applied to
Algorithm classify unseen data
Class of
Model

K-Nearest Neighbors (k-NN)

in this lecture

© Nicolas Navet University of Luxembourg 5

Multi-label classification

Multi-label classification: predicting

classes which are not mutually
exclusive

This lecture does not cover multi-label classification, it focuses

on classification between mutually exclusive classes.

© Nicolas Navet University of Luxembourg 6

K-Nearest Neighbours (K-NN) principles Samples of coins to “train” a
vending machine
- Principle: classify observations by assigning them to the
same category as their similar / "nearest" neighbors.
- Supervised learning: use training data already
classified into categories
- K-NN identifies the k samples in the training set that are the “nearest” to
the unseen data, then classify it in the category that is most frequent
among these k neighbors
- Similarity of data measured (usually) by the distance between the data
points in the feature space. Each “feature” is a coordinate: n features → n
dimensional space What will be the result of
- Parameters of K-NN to be chosen by user: the classification if k is set
- The set of features to be considered equal to the size of the
- k : the number of nearest neighbors to be used training set?

© Nicolas Navet University of Luxembourg 7

Different concepts of distance

𝑥, 𝑦: 2 data points in a 𝑛 dimensional space

𝑝 = 1 : Manhattan distance (aka L1 or city-block distance)

𝑑 𝑥, 𝑦 = 𝑝 = 2 : Euclidean distance

There are other various concepts of distance, which one will work best is application dependent

© Nicolas Navet University of Luxembourg 8

Different concepts of distance
(6,6)

What is the Euclidean and Manhattan

distances between the two points?

(0,0) Figure from wikipedia

𝑝 = 1 : Manhattan distance (aka city-block distance)

𝑑 𝑥, 𝑦 = 𝑝 = 2 : Euclidean distance

© Nicolas Navet University of Luxembourg 9

K-Nearest Neighbours (K-NN) applications
- Simple but powerful, K-NN successfully used in character recognition, face
recognition in image and video, recommendations, diagnosing diseases
based on patient data like symptoms, blood test results, etc
- Good choice when relationships between features and classes are numerous
and complex to understand, but data in same class tend to be homogeneous
and there is a clear distinction between classes
- “lazy” learning algorithm since computation is deferred until classification (≠
eager learning where the algorithm processes the training data before
receiving queries) → rely heavily on quality of training set
- A good first approach: if k-NN yields positive results, classification is possible,
and a more powerful approach like Neural Networks will perform better.

© Nicolas Navet University of Luxembourg 10

Example : food classification
- Plot the following foods in Python with “How sweet the food tastes” on the
x-axis and “How crunchy the food is” on the y-axis. There should be a label
near the points indicating the name of the ingredients.
Hint: look at https://www.tutorialspoint.com/matplotlib/matplotlib_scatter_plot.htm then add names e.g.
for i, txt in enumerate(Ingredient):
ax.annotate(txt, (Sweetness[i], Crunchiness[i]))

Example from [1]

© Nicolas Navet University of Luxembourg 11

Example : more food types

Propose a
classification of the
foods shown on the
plot into a few
categories

Figure from [1]

© Nicolas Navet University of Luxembourg 12

Training set : classified food types

This is the training set,

meaning the data has already
been classified by food type -
regardless of how this
classification was done

Figure from [1]

© Nicolas Navet University of Luxembourg 13

Classifying tomatoes

Figure from [1]

- Euclidean distance (2D):

Generalizes to higher dimensions,

here 3D [Wikipedia]

[Wikipedia]

- pk is the value of the k-th feature for the first data and qk is the value of
the k-th feature for the second data

Illustration : food example

Calculate the distance between the

tomato (sweetness = 6, crunchiness = 4)
and its four closest neighbors listed in
the table

Classify the tomato with a 1-NN and 3-NN classification,

which class does it belong to?

Choosing the appropriate K value
- Larger K will reduce the negative impact of noisy data, but rare patterns might
be ignored
- With smaller K, such as 1-NN, noisy data can negatively impact classification and
lead to incorrect results

The challenge is that we don't know in

advance which value of K is best for
capturing the true underlying pattern

Common practices: start with K equal to the square root of the training set size, use a larger K with a
weighted voting process based on the distances of the neighbors, and/or use cross-validation to evaluate
the model’s performance (this will be discussed in a later lecture).

Feature scaling aka Data Normalization
- Classification algorithms – and machine learning algorithms at large - do not
perform well when their input (i.e., values of the features) have very different
scales (because features will have very different weights in the classification)

- The usual method is min-max normalization :

X is the value of a feature
Limitations:
• Not robust to outliers, or data errors in data (e.g.,
extremely large values)
• Requires knowledge of plausible minimum and Xnew will be in [0,1]
maximum values in advance, as the full range of
values may not be represented in the training set.

Data Normalization Continued
- Alternative method is z-score normalization :

- If a value is exactly equal to the mean of all the feature values, it will be
normalized to 0. If it is below the mean, it will result in a negative value, and if
it is above the mean it will result in a positive number (see examples here).

• Handles outliers better

• But does not produce normalized data with the exact
same scale (e.g. not in [0,1]).

References
1. B. Lantz, “Machine learning with R”, second edition, 2015.
2.Peter Bruce et al, “Practical Statistics for Data Scientists”, O’Reilly, second
edition, 2020.

Appendix
- “The unreasonable effectiveness of data” in supervised learning
- Neural networks excel at utilizing data

Supervised Learning:
“The Unreasonable Effectiveness of Data”
- “Garbage in, Garbage out” principle: quality of data is crucial, it is very hard to
compensate for bad data (e.g., wrong labels)
All techniques perform
- Famous studies in the 2000s (before deep-learning) similarly
showing that very different ML algorithms performed
almost identically well on a natural language problem
(deciding when to write “two”, “to”, “too”) once they had
“enough” data
- It suggests to spend more time on collecting
quality data than on algorithms
- Deep-learning has since proven to make a better
use of data (esp. large data set) than “traditional”
ML algorithms (which tend to plateau off after a certain
point
Figure from [AG20]

Introduction to Machine Learning 22

Neural Networks VS traditional machine learning
Performance,
e.g. accuracy of
classification

Amount of data
✓ Neural networks excel at utilizing large amounts of data, while traditional machine learning techniques
tend to plateau after reaching a certain data threshold.
✓ If a traditional ML algorithm is performing well, and there is a significant amount of data, it is highly
likely that using a neural network could yield even better results.
Introduction to Machine Learning 23

ML unit 3
No ratings yet
ML unit 3
106 pages
Unit 4_KVR
No ratings yet
Unit 4_KVR
111 pages
Classification Methods I
No ratings yet
Classification Methods I
20 pages
Lazy LearningClassification Using Nearest Neighbors
No ratings yet
Lazy LearningClassification Using Nearest Neighbors
36 pages
KNN & Support Vector Machines: Dr.S.Vasantharathna
No ratings yet
KNN & Support Vector Machines: Dr.S.Vasantharathna
22 pages
DSA1101 2019 Week3 Part1
No ratings yet
DSA1101 2019 Week3 Part1
38 pages
mod 4
No ratings yet
mod 4
49 pages
ML Unit V
No ratings yet
ML Unit V
10 pages
8.predictive Analytics - Classification 2
No ratings yet
8.predictive Analytics - Classification 2
28 pages
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
No ratings yet
JNTUK R20 B.tech CSE 3-2 Machine Learning Unit 2 Notes
33 pages
Unit 2 ML
No ratings yet
Unit 2 ML
89 pages
UNIT-3
No ratings yet
UNIT-3
100 pages
ML UNIT - III-Complete
No ratings yet
ML UNIT - III-Complete
52 pages
Slide 2 ML Basics
No ratings yet
Slide 2 ML Basics
42 pages
ml6
No ratings yet
ml6
26 pages
Mlfa Autumn 22 Lec 03
No ratings yet
Mlfa Autumn 22 Lec 03
61 pages
ML Unit 4
No ratings yet
ML Unit 4
76 pages
02-knn__slides
No ratings yet
02-knn__slides
57 pages
SRU ADA Unit-3
No ratings yet
SRU ADA Unit-3
78 pages
Unit4_PPT
No ratings yet
Unit4_PPT
118 pages
Unit-5
No ratings yet
Unit-5
73 pages
W1
No ratings yet
W1
15 pages
Module1 ML2 Final
No ratings yet
Module1 ML2 Final
12 pages
Ml 7th Sem Aiml Ite Notes Complete Long[1]-63-155
No ratings yet
Ml 7th Sem Aiml Ite Notes Complete Long[1]-63-155
93 pages
A Case Study On Data Classification Approach Using K-Nearest Neighbor
No ratings yet
A Case Study On Data Classification Approach Using K-Nearest Neighbor
7 pages
Week 09 Lesson 1 Intro Machine Learning 1 to 32 (4)
No ratings yet
Week 09 Lesson 1 Intro Machine Learning 1 to 32 (4)
61 pages
08 Classification Using K NN
No ratings yet
08 Classification Using K NN
23 pages
ML Unit-2
No ratings yet
ML Unit-2
55 pages
ML UNIT-2
No ratings yet
ML UNIT-2
33 pages
Class10 14 PatternClassification - 13 24sept2019
No ratings yet
Class10 14 PatternClassification - 13 24sept2019
50 pages
LFD 2005 Nearest Neighbour
No ratings yet
LFD 2005 Nearest Neighbour
6 pages
ml unit2
No ratings yet
ml unit2
38 pages
Lecture 2: Basics and Definitions: Networks As Data Models
No ratings yet
Lecture 2: Basics and Definitions: Networks As Data Models
28 pages
ML Unit 2 r20 Jntuk
No ratings yet
ML Unit 2 r20 Jntuk
34 pages
DSBDUNITIII_T1729232981820-1
No ratings yet
DSBDUNITIII_T1729232981820-1
26 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
47 pages
Chapter#10 (Part#01) SL (K-NN)
No ratings yet
Chapter#10 (Part#01) SL (K-NN)
27 pages
1 KNN-Algo
No ratings yet
1 KNN-Algo
27 pages
ch2
No ratings yet
ch2
30 pages
08 - kNN
No ratings yet
08 - kNN
39 pages
Self Reading - KNN - Notes
No ratings yet
Self Reading - KNN - Notes
7 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
lec02 (1)
No ratings yet
lec02 (1)
27 pages
Decision Sciences MCQ by Prof. Sujeet Tambe: Multiple Choice Questions Decision Science
0% (1)
Decision Sciences MCQ by Prof. Sujeet Tambe: Multiple Choice Questions Decision Science
49 pages
2023 AN2DL Lez 1 Image Classification
No ratings yet
2023 AN2DL Lez 1 Image Classification
120 pages
AIML-Unit 4 Notes-Assignment 4
No ratings yet
AIML-Unit 4 Notes-Assignment 4
21 pages
CSE445 NSU Week_5
No ratings yet
CSE445 NSU Week_5
26 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
K_Nearest_Neighbour_Classifier
No ratings yet
K_Nearest_Neighbour_Classifier
24 pages
ML Mid2 Ans
No ratings yet
ML Mid2 Ans
24 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
19-K-Nearest Neighbor Learning.-22-08-2024
No ratings yet
19-K-Nearest Neighbor Learning.-22-08-2024
25 pages
Fuzzy Sem Question Paper
No ratings yet
Fuzzy Sem Question Paper
4 pages
Assignment 3 B
No ratings yet
Assignment 3 B
7 pages
3.1 K Nearest Neighbour Classifier (1)
No ratings yet
3.1 K Nearest Neighbour Classifier (1)
24 pages
Unit 4 Teaching Notes 2020
No ratings yet
Unit 4 Teaching Notes 2020
40 pages
FDB For Exit Exam
No ratings yet
FDB For Exit Exam
284 pages
Data Mining: Kabith Sivaprasad (BE/1234/2009) Rimjhim (BE/1134/2009) Utkarsh Ahuja (BE/1226/2009)
No ratings yet
Data Mining: Kabith Sivaprasad (BE/1234/2009) Rimjhim (BE/1134/2009) Utkarsh Ahuja (BE/1226/2009)
32 pages
Immediate Download Business Analytics, 5e Jeffrey D. Camm Ebooks 2024
100% (14)
Immediate Download Business Analytics, 5e Jeffrey D. Camm Ebooks 2024
38 pages
Summer of Science-Final Report
100% (1)
Summer of Science-Final Report
7 pages
Flat Mid II Obj 2024
No ratings yet
Flat Mid II Obj 2024
2 pages
Classification Algorithms I
No ratings yet
Classification Algorithms I
14 pages
Different Paradigms of Pattern Recognition
No ratings yet
Different Paradigms of Pattern Recognition
8 pages
COS4852 2023 Unit 2 - KNN
No ratings yet
COS4852 2023 Unit 2 - KNN
10 pages
Data Strcuture Lab - 03 AIUB
No ratings yet
Data Strcuture Lab - 03 AIUB
11 pages
Distance-Based Methods - KNN
No ratings yet
Distance-Based Methods - KNN
8 pages
Jntuk r20 ML Unit-II
No ratings yet
Jntuk r20 ML Unit-II
33 pages
826 SAR Processing Algorithms Overview-F15
No ratings yet
826 SAR Processing Algorithms Overview-F15
52 pages
Breadth First Search Animat Ion
No ratings yet
Breadth First Search Animat Ion
25 pages
Research Article: Chaotic Behavior of The Biharmonic Dynamics System
No ratings yet
Research Article: Chaotic Behavior of The Biharmonic Dynamics System
21 pages
MECH261 Control Principles: Tutorial #3
No ratings yet
MECH261 Control Principles: Tutorial #3
27 pages
Chapter 7-2: Signature Schemes
No ratings yet
Chapter 7-2: Signature Schemes
37 pages
15056-Article Text-44992-2-10-20210906
No ratings yet
15056-Article Text-44992-2-10-20210906
15 pages
CS502 Finaltermsolved Mcqswithreferencesby Moaaz
No ratings yet
CS502 Finaltermsolved Mcqswithreferencesby Moaaz
43 pages
On Improving Tunstall Codes: Shmuel T. Klein and Dana Shapira
No ratings yet
On Improving Tunstall Codes: Shmuel T. Klein and Dana Shapira
16 pages
KBS ملخص
No ratings yet
KBS ملخص
14 pages
Research On A New Adaptive Integral Sliding Mode Controller Based On A Small BLDC
No ratings yet
Research On A New Adaptive Integral Sliding Mode Controller Based On A Small BLDC
10 pages
Data Structure and Algorithms-LAB
No ratings yet
Data Structure and Algorithms-LAB
5 pages
Sigworth Matrix
No ratings yet
Sigworth Matrix
10 pages
Grade 11 Mathematics Social Science
No ratings yet
Grade 11 Mathematics Social Science
9 pages
Python Lab End Exam Questions CSE5
No ratings yet
Python Lab End Exam Questions CSE5
14 pages
Telecom-Wavelength Quantum Repeater Node Based On A Trapped-Ion Processor
No ratings yet
Telecom-Wavelength Quantum Repeater Node Based On A Trapped-Ion Processor
7 pages
Cryptography and Network Security
No ratings yet
Cryptography and Network Security
25 pages
2023 Math Grade 11 June Paper 1 Johannesburg Region PLC Paper
No ratings yet
2023 Math Grade 11 June Paper 1 Johannesburg Region PLC Paper
8 pages
FAANGPath Simple Template 2 (12)
No ratings yet
FAANGPath Simple Template 2 (12)
2 pages
Curriculum CSE
No ratings yet
Curriculum CSE
2 pages
Assignment: Q.1 A Firm Makes Two Products X and Y, and Has A Total Production Capacity of 9 Tonnes
No ratings yet
Assignment: Q.1 A Firm Makes Two Products X and Y, and Has A Total Production Capacity of 9 Tonnes
2 pages
Project Poster (IT Group 22)
No ratings yet
Project Poster (IT Group 22)
1 page
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lecture5

Uploaded by

Lecture5

Uploaded by

Classification : principles

✓ Classification (En) = «classification » (Fr)

© Nicolas Navet University of Luxembourg 3

K-Nearest Neighbors (k-NN)

© Nicolas Navet University of Luxembourg 5

Multi-label classification: predicting

This lecture does not cover multi-label classification, it focuses

© Nicolas Navet University of Luxembourg 6

© Nicolas Navet University of Luxembourg 7

𝑥, 𝑦: 2 data points in a 𝑛 dimensional space

𝑝 = 1 : Manhattan distance (aka L1 or city-block distance)

© Nicolas Navet University of Luxembourg 8

What is the Euclidean and Manhattan

(0,0) Figure from wikipedia

𝑝 = 1 : Manhattan distance (aka city-block distance)

© Nicolas Navet University of Luxembourg 9

© Nicolas Navet University of Luxembourg 10

Example from [1]

© Nicolas Navet University of Luxembourg 11

Figure from [1]

© Nicolas Navet University of Luxembourg 12

This is the training set,

Figure from [1]

© Nicolas Navet University of Luxembourg 13

Figure from [1]

- Euclidean distance (2D):

Generalizes to higher dimensions,

© Nicolas Navet University of Luxembourg 15

Calculate the distance between the

Classify the tomato with a 1-NN and 3-NN classification,

© Nicolas Navet University of Luxembourg 16

The challenge is that we don't know in

© Nicolas Navet University of Luxembourg 17

- The usual method is min-max normalization :

© Nicolas Navet University of Luxembourg 18

• Handles outliers better

© Nicolas Navet University of Luxembourg 19

© Nicolas Navet University of Luxembourg 20

© Nicolas Navet University of Luxembourg 21

Introduction to Machine Learning 22

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.