Aiml Ece Unit-4
Aiml Ece Unit-4
• Our goal is
• to find an assignment of data points to clusters and a set of vectors {µk}, such that the
sum of the squares of the distances of each data point to its closest vector µk, is a
minimum.
Source: Pattern Recognition and Machine Learning by Christopher Bishop
• Notation to describe the assignment of data points to
clusters.
• For each data point xn, we introduce a corresponding set of
binary indicator variables rnk∈{0,1}, where k=1,...,K describing
which of the K clusters the data point xn is assigned to, so that if
data point xn is assigned to cluster k then rnk=1, and rnj =0 for j≠k.
• This is known as the 1-of-K coding scheme.
Merged 42 and 43
Min distance
between 25 and 27
Merge 25 and 27
Merged 42 and 43
Min distance between
22 and (25,27)
• The principle is based on identifying the dense area and sparse area
within the data set and then run the clustering algorithm.
https://youtu.be/
jqKAAVEwX9M?
si=cnvsppj2X-KFBWA6
A