0% found this document useful (0 votes)

40 views28 pages

Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering

[0, 1] indicating which Hamming distance Network friends are online

Uploaded by

Boul chandra Garai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views28 pages

Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering

[0, 1] indicating which Hamming distance Network friends are online

Uploaded by

Boul chandra Garai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Machine Learning by ambedkar@IISc

I Unsupervised Learning

I Dimensionality Reduction

I K-means Clustering
Agenda

What is Unsupervised Learning

Principle Component Analysis and

Dimensionality Detection

Clustering

2
What is Unsupervised Learning
Unsupervised Learning

I Input: A set of unlabeled examples, D = {xn }N

n=1

I Objective: Find patterns in observed data

I Challenge: Since there is no ground-truth or labels it is

very difficult to evaluate the algorithms.

3
Unsupervised Learning

I Examples:
I Clustering - Grouping observed data into unlabeled clusters
I identifying social circles, summarizing observed data etc.

I Dimensionality Reduction - Finding a low-dimensional

representation of the data
I visualization, compression, structure analysis etc.

I Anomaly Detection - Spotting outliers in the data

I detecting fraudulent transactions, data cleaning etc.1

I Density Estimation - Finding the underlying probability

distribution from which D has been sampled.
1
The discovery of Higgs Boson relied on one such algorithm
4
Principle Component Analysis
and Dimensionality Detection
Dimensionality Reduction

I Input: A dataset D = {xn }N

n=1 where each xn ∈ R
d

I Objective: Find a low-dimensional representation of each

point x̃n ∈ Rk where k < d

I In other words: Find a k-dimensional coordinate system

and represent all the points in this coordinate system
I Need to find orthonormal vectors v1 , v2 , . . . , vK which form
the basis of the new coordinate system
I Need to way to represent the original points in this new
coordinate system

I Main Question: How to choose the low dimensional space

and embed the points in it?

5
Dimensionality Reduction - Applications

I Visualization: Find a 2 or 3 dimensional representation of

data such that the essence of data is not lost
I Visualizing financial profile of individuals in two dimensions
to identify patterns

I Compression: Embed the points in a lower dimensional

space such that various topological properties are preserved
to optimize storage
I Minimizing the number of colours needed to represent an
image. Efficient encoding schemes can then be used for
compression

I Feature Selection: Remove redundant or less informative

features
I Identifying and eliminating functionally related or highly
correlated features like density, mass and volume 6
Dimensionality Reduction - Toy Example

I Given 7 points in two dimensions. Need 14 numbers to

store x and y coordinates of all points

I Idea 1: Discard y coordinate of all points (green points).

Only 7 numbers needed now. Lot of information lost.

I Idea 2: Discard x coordinate of all points (orange points).

Only 7 numbers needed now. Better than green points.

I Idea 3: Save the slope of pink line and the x (or y)

coordinate of each point. Need to store 8 numbers. No
information lost.

7
Dimensionality Reduction - Toy Example

8
Dimensionality Reduction - Toy Example - Findings

I Simply discarding coordinates is not a good idea

I Not all ways of dimensionality reduction are equally good
I Need to quantify the amount of information lost while
performing dimensionality reduction
I Real data is not as neat as the toy example, need a way to
deal with noise

I Revised Objective: To find a k dimensional subspace of

Rd and linearly project data onto this subspace while
minimizing the “loss of information”
I Non-linear dimensionality reduction methods exist but are
beyond the current scope
I We will consider Principle Component Analysis (PCA)

9
Dimensionality Reduction - Principle Component Analysis

I Let u ∈ Rd be a direction along which we want to project

data
I Thus, x̃n = (xn | u)u. Note that one only needs to store
xn | u for each n
I PCA uses variance in projected data as a measure of
information
I Information content is assumed to be proportional to
variance of projected data
I Need to retain maximum information, thus, need to find u
such that variance of projected data is maximized

u∗ = arg max Var({x̃n }N

n=1 )
u:||u||=1

10
Dimensionality Reduction - PCA (contd. . . )

N
1 X 2
Var({x̃n }N
n=1 ) = x̃n − E[x̃n ]2
N
n=1

I Assume WLOG that E[xn ] = 0, thus E[x̃n ] = E[xn ]| u = 0

I Also, x̃2n = u| xn x|n u, thus we get:

N
X N
X
x̃2n − E[x̃n ]2 = u| xn x|n u
n=1 n=1

|
Note that N
P
n=1 xn xn is the covariance matrix X of
I

observed data since E[xn ] = 0. Thus:

1 |
Var({x̃n }N
n=1 ) = u Xu
N 11
Dimensionality Reduction - PCA (contd. . . )

I The constant can be dropped for the purpose of

optimization. Hence the optimization problem becomes:

u∗ = arg max u| Xu
u:||u||=1

I This is a constrained optimization problem, the Lagrangian

is given by:
L(u, µ) = u| Xu + µ(u| u − 1)
∇u L = 0 ⇒ 2Xu + 2µu = 0
⇒ Xu = −µu
I Thus, the optimal u must be an eigenvector of X. Since we
want to maximize u| Xu, u must be the eigenvector
corresponding to largest eigenvalue. Hence:
u∗ = eigenvector of X corresponding to largest eigenvalue 12
Dimensionality Reduction - PCA (contd. . . )

I Usually k > 1 thus we want to find u1 , u2 , . . . , uk and not

just u∗
I Setting u1 = u∗ , one can find u2 as follows:
u2 = arg maxu:||u||=1,u| u1 =0 u| Xu
I u| u1 = 0 is needed to avoid correlations in projected data
I One can show that u2 is the eigenvector of X corresponding
to second largest eigenvalue
I Similarly u1 , u2 , . . . , uk are the eigenvectors of X
corresponding to k largest eigenvalues. Also:
x̃n = U| xn
where, U ∈ Rd×k is a matrix containing u1 , u2 , . . . , uk in its
columns
13
Dimensionality Reduction - PCA (contd. . . )

Algorithm 1 Principle Component Analysis

Input: Dataset D = {xn }N n=1 and number of dimensions k
Output: Low dimensional vectors D̃ = {x̃n }N n=1
Normalize the data so that it is zero mean
|
Compute X = N
P
n=1 xn xn
Find U ∈ Rd×k containing top k eigenvectors of X as columns
Compute x̃n ∈ Rk such that x̃n = U| xn , for all n = 1, . . . , N

14
PCA Example

15
PCA Example

16
Clustering
Clustering

I Input: Data points D = {xn }N n=1 , a similarity/distance

function d(. , .) defined on elements of D and the number of
clusters K
I Objective: Partition the given N points into K subsets
C1 , C2 , . . . , CK such that:
I Ck ⊂ D, Cj 6= Φ for all k = 1, . . . , K
I Ci ∩ Cj = Φ for all i, j = 1, . . . , k, i 6= j
I ∪K
k=1 Cj = D
I Points in the same cluster are more similar than points
across clusters (w.r.t. d(. , .))
I Variants that allow fractional membership of points to
clusters or overlapping clusters exist but we will assume
that each point belongs to exactly one cluster

17
Clustering (contd. . . )

x(i) d(. , .) Clusters

Eye (x, y) coordinate
Hot-spots on
Gaze on screen where Euclidean distance
screen
Tracker user is looking
A binary vector
Social 1 Friendship
indicating friends #common friends
Media groups
of person i
Bag of words
Docu- 1
representation of #common words Topics
ments
document i
Gene
Biology Genes Task dependent expression
patterns
Table 1: Some examples related to clustering
18
Clustering - Approaches

I Agglomerative (bottom-up) vs Divisive (top-down)

I Monothetic (considers features sequentially) vs
Polythetic (considers features all at once)
I Hard (single cluster membership) vs Fuzzy (mixed
memberships allowed)
I Hierarchical (creates hierarchy) vs Partitional (disjoint,
unordered clusters)

Any clustering algorithm can be classified based on this scheme

Example: We will see that k-Means is a polythetic, hard and

partitional clustering algorithm
19
Clustering - Toy Example

Data Hard Clustering Fuzzy Clustering

Clustering is an exploratory data anal-

ysis problem.
There is no single “correct” solution.

Hierarchical
Clustering 20
Clustering - Popular Algorithms

I k-Means and k-Medoids

I Spectral clustering

I Expectation Maximization for Gaussian Mixture Models

I Density-Based Spatial Clustering of Applications with

Noise (DBSCAN)

I etc.

21
Clustering - k-Means

I Let C denote the set of all possible cluster assignment for

the given dataset D
I c ∈ C is such that c ∈ {1, . . . , k}m , where ci = j iff
x(i) ∈ Cj . Recall that:
I k is the number of clusters
I m is the number of data points
I Cj is the j th cluster
I Ideally one would like to solve the following problem:
k
X m
X
c∗ = arg min 1{ci1 = j, ci2 = j}||x(i1 ) − x(i2 ) ||2 ,
c∈C
j=1 i1 ,i2 =1

i.e. minimize the distance between points in same cluster

I This optimization is NP hard so k-Means clustering solves a
relaxed version of this problem
22
Clustering - k-Means (contd. . . )

Algorithm 2 k-Means Clustering Algorithm

Input: Dataset D = {x(i) }m i=1 and number of clusters k
Output: Cluster assignment vector c ∈ {1, . . . k}m , cluster
centers µ1 , . . . , µk
Initialize µ1 , . . . , µk by randomly choosing k distinct points
from D
repeat
Set ci = arg minj ||x(i) − µj ||2 for i = 1, 2, . . . , m
Set µj = |{i:c1i =j}| m (i) for all j = 1, 2, . . . , k
P
i=1 1{ci = j}x
until convergence

Breaks the optimization problem into two parts

I Optimization over memberships c keeping µ1 , . . . , µk fixed
23
I Optimization over cluster centers µ1 , . . . , µk keeping c fixed
Clustering - k-Means (contd. . . )

k-Means on a toy dataset2

24
2
Clustering - k-Means (contd. . . )

Limitations of k-Means:

I Not suitable for non-spherical clusters because of the use of

Euclidean distance
I Transform data appropriately before performing k-Means
(as we will see later for spectral clustering) or use kernel
k-Means

I Not robust to outliers because of the use of arithmetic mean

I Remove outliers before clustering

I Susceptible to sub-optimal solutions

I Run the algorithm multiple times with random
initializations
25

Techniques of Value Analysis and Engineering by Lawrence D Miles
84% (38)
Techniques of Value Analysis and Engineering by Lawrence D Miles
383 pages
DATA - Dist
No ratings yet
DATA - Dist
90 pages
Lecture 16 - 25.09.2024 - PCA, Unsupervised Learning-Clustring & Metrics
No ratings yet
Lecture 16 - 25.09.2024 - PCA, Unsupervised Learning-Clustring & Metrics
51 pages
ILS Science - Question Paper 1 (Jun 2024)
No ratings yet
ILS Science - Question Paper 1 (Jun 2024)
36 pages
Unit 6
No ratings yet
Unit 6
102 pages
Obstetics Simplified El-Mowafi
50% (2)
Obstetics Simplified El-Mowafi
515 pages
Session 7 Clustering
No ratings yet
Session 7 Clustering
93 pages
14
No ratings yet
14
72 pages
ML Unit 4
No ratings yet
ML Unit 4
10 pages
MLLecture 1
No ratings yet
MLLecture 1
56 pages
Unit 3 - MLnotes-WPS Office
No ratings yet
Unit 3 - MLnotes-WPS Office
18 pages
ML - Unit - 2
No ratings yet
ML - Unit - 2
13 pages
Introduction To Machine Learning Prof. Anirban Santara Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur
No ratings yet
Introduction To Machine Learning Prof. Anirban Santara Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur
15 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
78 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
66 pages
14 - Machine Learning and Big Data in Bioinformatics
No ratings yet
14 - Machine Learning and Big Data in Bioinformatics
37 pages
Data Mining Algorithms in R PDF
No ratings yet
Data Mining Algorithms in R PDF
266 pages
Machine Learning For Humans, Part 3 - Unsupervised Learning - by Vishal Maini - Machine Learning For Humans - Medium
No ratings yet
Machine Learning For Humans, Part 3 - Unsupervised Learning - by Vishal Maini - Machine Learning For Humans - Medium
23 pages
14: Dimensionality Reduction (PCA) : Motivation 1: Data Compression
No ratings yet
14: Dimensionality Reduction (PCA) : Motivation 1: Data Compression
7 pages
QSRI Lecture4
No ratings yet
QSRI Lecture4
56 pages
Lecture 08 Slides
No ratings yet
Lecture 08 Slides
43 pages
Part1 Lecture 11 Annotated
No ratings yet
Part1 Lecture 11 Annotated
15 pages
07 Clustering
No ratings yet
07 Clustering
54 pages
ML - Unit - 4 - Part Ii
No ratings yet
ML - Unit - 4 - Part Ii
79 pages
2002 Spring CS525 Lecture 2
No ratings yet
2002 Spring CS525 Lecture 2
37 pages
Introduction-To-Ml-Part-3 Edited
No ratings yet
Introduction-To-Ml-Part-3 Edited
73 pages
Lecture 9 - PCA
No ratings yet
Lecture 9 - PCA
44 pages
Machine Learning Section3 Ebook v05
No ratings yet
Machine Learning Section3 Ebook v05
15 pages
U L D R: Nsupervised Earning and Imensionality Eduction
No ratings yet
U L D R: Nsupervised Earning and Imensionality Eduction
58 pages
2021 Clustering
No ratings yet
2021 Clustering
50 pages
Ruiz Modified I2ml3e Chap6
No ratings yet
Ruiz Modified I2ml3e Chap6
38 pages
Introduction To (Statistical) Machine Learning
No ratings yet
Introduction To (Statistical) Machine Learning
30 pages
Ann Unit V
No ratings yet
Ann Unit V
30 pages
Engineering Drawing - Lettering and Lines Presentation
No ratings yet
Engineering Drawing - Lettering and Lines Presentation
67 pages
03 Dimensionality Reduction
No ratings yet
03 Dimensionality Reduction
38 pages
HV2 650V Datasheet
No ratings yet
HV2 650V Datasheet
6 pages
Dimensionality Reduction 22-01-22
No ratings yet
Dimensionality Reduction 22-01-22
47 pages
Module 4
No ratings yet
Module 4
63 pages
Machine Learning Numpy
No ratings yet
Machine Learning Numpy
39 pages
Pca&kmean
No ratings yet
Pca&kmean
6 pages
Ai Notes V
No ratings yet
Ai Notes V
7 pages
Dimensionality Reduction 22-01-22
No ratings yet
Dimensionality Reduction 22-01-22
47 pages
T3 Scheme 24 25
No ratings yet
T3 Scheme 24 25
4 pages
Advanced Data Analysis Techniques 2
No ratings yet
Advanced Data Analysis Techniques 2
32 pages
10 Autoencoders
No ratings yet
10 Autoencoders
42 pages
Clustering and Visualisation of Data - 2020
No ratings yet
Clustering and Visualisation of Data - 2020
5 pages
315 F19 27 Pca1
No ratings yet
315 F19 27 Pca1
28 pages
Firmax Rf3 Product Only
No ratings yet
Firmax Rf3 Product Only
61 pages
Cooling Water Treatment Advanced Training Course Cooling Water Treatment ... (Pdfdrive)
100% (3)
Cooling Water Treatment Advanced Training Course Cooling Water Treatment ... (Pdfdrive)
266 pages
Unit 3
No ratings yet
Unit 3
102 pages
Lecture Note 08
No ratings yet
Lecture Note 08
6 pages
Clustering and Dimensionality Reduction Techniques PCA T SNE K Means
No ratings yet
Clustering and Dimensionality Reduction Techniques PCA T SNE K Means
15 pages
Waec 2000 Past Questions
No ratings yet
Waec 2000 Past Questions
9 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
STUDYBLUE - Find and Share Online Flashcards and Notes From StudyBlue
No ratings yet
STUDYBLUE - Find and Share Online Flashcards and Notes From StudyBlue
15 pages
Clustering
No ratings yet
Clustering
65 pages
ML Chapter 4
No ratings yet
ML Chapter 4
38 pages
Genome Organization
100% (1)
Genome Organization
23 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Capture One 9.3 Release Notes (Rev 1.6)
No ratings yet
Capture One 9.3 Release Notes (Rev 1.6)
37 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
Noun and Question Tag
No ratings yet
Noun and Question Tag
8 pages
3DS Max 2011 Shortcuts
100% (1)
3DS Max 2011 Shortcuts
16 pages
Medical Imabmnge Analysis
No ratings yet
Medical Imabmnge Analysis
41 pages
ML Mod 4 Part 2
No ratings yet
ML Mod 4 Part 2
32 pages
Graph Partitioning Advance Clustering Technique
No ratings yet
Graph Partitioning Advance Clustering Technique
14 pages
CSA Section by Section 8.6
100% (3)
CSA Section by Section 8.6
4 pages
Mi Presentation
No ratings yet
Mi Presentation
65 pages
1.supervised and Unsupervised
No ratings yet
1.supervised and Unsupervised
42 pages
AML Unit - 1 Material
No ratings yet
AML Unit - 1 Material
36 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
Psoc CH 4 Zub
No ratings yet
Psoc CH 4 Zub
96 pages
Linear Regression: Dimensionality Reduction
No ratings yet
Linear Regression: Dimensionality Reduction
7 pages
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
No ratings yet
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
20 pages
Machine Learning: Feed Forward Neural Networks Backpropagation Algorithm Cnns and Rnns
No ratings yet
Machine Learning: Feed Forward Neural Networks Backpropagation Algorithm Cnns and Rnns
127 pages
Be1227 Inductica 2012 Paper MDL PDF
No ratings yet
Be1227 Inductica 2012 Paper MDL PDF
8 pages
Science - Q2 Lesson
No ratings yet
Science - Q2 Lesson
5 pages
Machine Learning: What Is Data and Model? Machine Learning Workflow Distance Based Classifiers Bayes Decision Theory
No ratings yet
Machine Learning: What Is Data and Model? Machine Learning Workflow Distance Based Classifiers Bayes Decision Theory
81 pages
Postal Ballot Team 315
No ratings yet
Postal Ballot Team 315
23 pages
Computer Architecture & Design-Slides
No ratings yet
Computer Architecture & Design-Slides
34 pages
7th Grade General Science Proficiency Scales
No ratings yet
7th Grade General Science Proficiency Scales
10 pages
Stem Cell Reflection
No ratings yet
Stem Cell Reflection
2 pages
Chap1-2 Markov Chain
No ratings yet
Chap1-2 Markov Chain
82 pages
BHEL Sample Placement Paper
No ratings yet
BHEL Sample Placement Paper
12 pages
PADANG 0308 Test
No ratings yet
PADANG 0308 Test
122 pages
Machine Learning: Support Vector Machines Kernel Methods
No ratings yet
Machine Learning: Support Vector Machines Kernel Methods
87 pages
Kidde Submittal With BOQ - Approved-1
No ratings yet
Kidde Submittal With BOQ - Approved-1
1 page
Final
No ratings yet
Final
145 pages
Linear Programming: - Socrates
No ratings yet
Linear Programming: - Socrates
21 pages
Test (POS)
No ratings yet
Test (POS)
2 pages
Machine Learning
No ratings yet
Machine Learning
64 pages
05.03 OBDP2021 Steenari
No ratings yet
05.03 OBDP2021 Steenari
9 pages
A European Roadmap To Leverage RISC-V in Space Applications
No ratings yet
A European Roadmap To Leverage RISC-V in Space Applications
7 pages
20 Induction
No ratings yet
20 Induction
25 pages
Diannao Asplos2014
No ratings yet
Diannao Asplos2014
15 pages
Hardware Is The New Software
No ratings yet
Hardware Is The New Software
8 pages
E0294 Scribe Lecture 9
No ratings yet
E0294 Scribe Lecture 9
24 pages
Future Trends in Computer Architecture
No ratings yet
Future Trends in Computer Architecture
4 pages
HighPerformanceSpaceflightComputing HPSC
No ratings yet
HighPerformanceSpaceflightComputing HPSC
19 pages
PLNB1BRE 202402010147235 Customer
No ratings yet
PLNB1BRE 202402010147235 Customer
6 pages
Skyline Daa 1
No ratings yet
Skyline Daa 1
8 pages
MP Set 092 17
No ratings yet
MP Set 092 17
14 pages
Introduction
No ratings yet
Introduction
10 pages
Sub 287
No ratings yet
Sub 287
16 pages
VDRIVE Manual
No ratings yet
VDRIVE Manual
24 pages
Is RISC V Ready For Space A Security Perspective
No ratings yet
Is RISC V Ready For Space A Security Perspective
6 pages
Chemistry Lab Report 3
No ratings yet
Chemistry Lab Report 3
22 pages
Patient Safety N Professionalism
No ratings yet
Patient Safety N Professionalism
12 pages
Maxima
No ratings yet
Maxima
3 pages
No Cs
No ratings yet
No Cs
3 pages
Homework 3 2005
No ratings yet
Homework 3 2005
2 pages
Adjacency Matrix To List
No ratings yet
Adjacency Matrix To List
2 pages
Why So Few Women On The Street at Night - Sarona Abuaker
No ratings yet
Why So Few Women On The Street at Night - Sarona Abuaker
118 pages
Homework 9 2005
No ratings yet
Homework 9 2005
1 page
Asg 0
No ratings yet
Asg 0
1 page
HW 4
No ratings yet
HW 4
1 page
List of Projects
No ratings yet
List of Projects
1 page
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
Pavani Ishta Ecity Bill
No ratings yet
Pavani Ishta Ecity Bill
1 page
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering

Uploaded by

Machine Learning: Unsupervised Learning Dimensionality Reduction K-Means Clustering

Uploaded by

Machine Learning by ambedkar@IISc

What is Unsupervised Learning

Principle Component Analysis and

I Input: A set of unlabeled examples, D = {xn }N

I Objective: Find patterns in observed data

I Challenge: Since there is no ground-truth or labels it is

I Dimensionality Reduction - Finding a low-dimensional

I Anomaly Detection - Spotting outliers in the data

I Density Estimation - Finding the underlying probability

I Input: A dataset D = {xn }N

I Objective: Find a low-dimensional representation of each

I In other words: Find a k-dimensional coordinate system

I Main Question: How to choose the low dimensional space

I Visualization: Find a 2 or 3 dimensional representation of

I Compression: Embed the points in a lower dimensional

I Feature Selection: Remove redundant or less informative

I Given 7 points in two dimensions. Need 14 numbers to

I Idea 1: Discard y coordinate of all points (green points).

I Idea 2: Discard x coordinate of all points (orange points).

I Idea 3: Save the slope of pink line and the x (or y)

I Simply discarding coordinates is not a good idea

I Revised Objective: To find a k dimensional subspace of

I Let u ∈ Rd be a direction along which we want to project

u∗ = arg max Var({x̃n }N

I Assume WLOG that E[xn ] = 0, thus E[x̃n ] = E[xn ]| u = 0

observed data since E[xn ] = 0. Thus:

I The constant can be dropped for the purpose of

I This is a constrained optimization problem, the Lagrangian

I Usually k > 1 thus we want to find u1 , u2 , . . . , uk and not

Algorithm 1 Principle Component Analysis

I Input: Data points D = {xn }N n=1 , a similarity/distance

x(i) d(. , .) Clusters

I Agglomerative (bottom-up) vs Divisive (top-down)

Any clustering algorithm can be classified based on this scheme

Example: We will see that k-Means is a polythetic, hard and

Data Hard Clustering Fuzzy Clustering

Clustering is an exploratory data anal-

I k-Means and k-Medoids

I Expectation Maximization for Gaussian Mixture Models

I Density-Based Spatial Clustering of Applications with

I Let C denote the set of all possible cluster assignment for

i.e. minimize the distance between points in same cluster

Algorithm 2 k-Means Clustering Algorithm

Breaks the optimization problem into two parts

k-Means on a toy dataset2

I Not suitable for non-spherical clusters because of the use of

I Not robust to outliers because of the use of arithmetic mean

I Susceptible to sub-optimal solutions

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.