0% found this document useful (0 votes)
154 views

K Means Example

The document describes using the k-means clustering algorithm to cluster eight points into three clusters over two iterations. In the first iteration, the points are assigned to the closest initial cluster center and the centers are recomputed. In the second iteration, the points are reassigned based on the new centers and final cluster centers are reported.

Uploaded by

harshit T
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
154 views

K Means Example

The document describes using the k-means clustering algorithm to cluster eight points into three clusters over two iterations. In the first iteration, the points are assigned to the closest initial cluster center and the centers are recomputed. In the second iteration, the points are reassigned based on the new centers and final cluster centers are reported.

Uploaded by

harshit T
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 8

Problem-01:

 Cluster the following eight points (with (x, y) representing locations) into three clusters:

A1(2, 10), A2(2, 5), A3(8, 4), A4(5, 8), A5(7, 5), A6(6, 4), A7(1, 2), A8(4, 9)

 Initial cluster centers are: A1(2, 10), A4(5, 8) and A7(1, 2).

The distance function between two points a = (x1, y1) and b = (x2, y2) is defined as-

Ρ(a, b) = |x2 – x1| + |y2 – y1|

Use K-Means Algorithm to find the three cluster centers after the second iteration.

 Solution-

 We follow the above discussed K-Means Clustering Algorithm-

 Iteration-01:

 We calculate the distance of each point from each of the center of the three clusters.

 The distance is calculated by using the given distance function.

 The following illustration shows the calculation of distance between point A1(2, 10) and
each of the center of the three clusters-

Calculating Distance Between A1(2, 10) and C1(2, 10)-

 Ρ(A1, C1)

= |x2 – x1| + |y2 – y1|

= |2 – 2| + |10 – 10|

=0

 Calculating Distance Between A1(2, 10) and C2(5, 8)-

 Ρ(A1, C2)

= |x2 – x1| + |y2 – y1|

= |5 – 2| + |8 – 10|

=3+2

=5

 Calculating Distance Between A1(2, 10) and C3(1, 2)-

 
Ρ(A1, C3)

= |x2 – x1| + |y2 – y1|

= |1 – 2| + |2 – 10|

=1+8

=9

 In the similar manner, we calculate the distance of other points from each of the center of the
three clusters.

 Next,

 We draw a table showing all the results.


 Using the table, we decide which point belongs to which cluster.
 The given point belongs to that cluster whose center is nearest to it.

Distance from
Distance from center Distance from center Point belongs to
 Given Points center (2, 10) of
(5, 8) of Cluster-02 (1, 2) of Cluster-03 Cluster
Cluster-01
A1(2, 10) 0 5 9 C1
A2(2, 5) 5 6 4 C3
A3(8, 4) 12 7 9 C2
A4(5, 8) 5 0 10 C2
A5(7, 5) 10 5 9 C2
A6(6, 4) 10 5 7 C2
A7(1, 2) 9 10 0 C3
A8(4, 9) 3 2 10 C2

 From here, New clusters are-

 Cluster-01:

 First cluster contains points-

 A1(2, 10)

 Cluster-02:

 Second cluster contains points-

 A3(8, 4)
 A4(5, 8)
 A5(7, 5)
 A6(6, 4)
 A8(4, 9)

Cluster-03:

Third cluster contains points-


 A2(2, 5)
 A7(1, 2)

 Now,

 We re-compute the new cluster clusters.


 The new cluster center is computed by taking mean of all the points contained in that
cluster.

 For Cluster-01:

 We have only one point A1(2, 10) in Cluster-01.

 So, cluster center remains the same.

 For Cluster-02:

 Center of Cluster-02

= ((8 + 5 + 7 + 6 + 4)/5, (4 + 8 + 5 + 4 + 9)/5)

= (6, 6)

 For Cluster-03:

 Center of Cluster-03

= ((2 + 1)/2, (5 + 2)/2)

= (1.5, 3.5)

 This is completion of Iteration-01.

 Iteration-02:

 We calculate the distance of each point from each of the center of the three clusters.

 The distance is calculated by using the given distance function.

 The following illustration shows the calculation of distance between point A1(2, 10) and
each of the center of the three clusters-

 Calculating Distance Between A1(2, 10) and C1(2, 10)-

 Ρ(A1, C1)

= |x2 – x1| + |y2 – y1|

= |2 – 2| + |10 – 10|

=0

 Calculating Distance Between A1(2, 10) and C2(6, 6)-

Ρ(A1, C2)
= |x2 – x1| + |y2 – y1|

= |6 – 2| + |6 – 10|

=4+4

=8

 Calculating Distance Between A1(2, 10) and C3(1.5, 3.5)-

 Ρ(A1, C3)

= |x2 – x1| + |y2 – y1|

= |1.5 – 2| + |3.5 – 10|

= 0.5 + 6.5

=7

 In the similar manner, we calculate the distance of other points from each of the center of the
three clusters.

 Next,

 We draw a table showing all the results.


 Using the table, we decide which point belongs to which cluster.
 The given point belongs to that cluster whose center is nearest to it.

Distance from Distance from Distance from center


Point belongs to
 Given Points center (2, 10) of center (6, 6) of (1.5, 3.5) of Cluster-
Cluster
Cluster-01 Cluster-02 03
A1(2, 10) 0 8 7 C1
A2(2, 5) 5 5 2 C3
A3(8, 4) 12 4 7 C2
A4(5, 8) 5 3 8 C2
A5(7, 5) 10 2 7 C2
A6(6, 4) 10 2 5 C2
A7(1, 2) 9 9 2 C3
A8(4, 9) 3 5 8 C1

 From here, New clusters are-

 Cluster-01:

 First cluster contains points-

 A1(2, 10)
 A8(4, 9)

 Cluster-02:

Second cluster contains points-


 A3(8, 4)
 A4(5, 8)
 A5(7, 5)
 A6(6, 4)

 Cluster-03:

 Third cluster contains points-

 A2(2, 5)
 A7(1, 2)

 Now,

 We re-compute the new cluster clusters.


 The new cluster center is computed by taking mean of all the points contained in that
cluster.

 For Cluster-01:

 Center of Cluster-01

= ((2 + 4)/2, (10 + 9)/2)

= (3, 9.5)

 For Cluster-02:

 Center of Cluster-02

= ((8 + 5 + 7 + 6)/4, (4 + 8 + 5 + 4)/4)

= (6.5, 5.25)

 For Cluster-03:

 Center of Cluster-03

= ((2 + 1)/2, (5 + 2)/2)

= (1.5, 3.5)

 This is completion of Iteration-02.

 After second iteration, the center of the three clusters are-

 C1(3, 9.5)
 C2(6.5, 5.25)
 C3(1.5, 3.5)
Problem-02:
 

Use K-Means Algorithm to create two clusters-

Solution-
 

We follow the above discussed K-Means Clustering Algorithm.

Assume A(2, 2) and C(1, 1) are centers of the two clusters.

Iteration-01:

 We calculate the distance of each point from each of the center of the two clusters.
 The distance is calculated by using the euclidean distance formula.

The following illustration shows the calculation of distance between point A(2, 2) and each
of the center of the two clusters-

Calculating Distance Between A(2, 2) and C1(2, 2)-

Ρ(A, C1)
= sqrt [ (x2 – x1)2 + (y2 – y1)2 ]

= sqrt [ (2 – 2)2 + (2 – 2)2 ]

= sqrt [ 0 + 0 ]

=0

Calculating Distance Between A(2, 2) and C2(1, 1)-

Ρ(A, C2)

= sqrt [ (x2 – x1)2 + (y2 – y1)2 ]

= sqrt [ (1 – 2)2 + (1 – 2)2 ]

= sqrt [ 1 + 1 ]

= sqrt [ 2 ]

= 1.41

In the similar manner, we calculate the distance of other points from each of the center of the
two clusters.

Next,

 We draw a table showing all the results.


 Using the table, we decide which point belongs to which cluster.
 The given point belongs to that cluster whose center is nearest to it.

Distance from center (2, Distance from center (1,


Given Points Point belongs to Cluster
2) of Cluster-01 1) of Cluster-02

A(2, 2) 0 1.41 C1
B(3, 2) 1 2.24 C1
C(1, 1) 1.41 0 C2
D(3, 1) 1.41 2 C1
E(1.5, 0.5) 1.58 0.71 C2

From here, New clusters are-

 
Cluster-01:

First cluster contains points-

 A(2, 2)
 B(3, 2)
 E(1.5, 0.5)
 D(3, 1)

Cluster-02:

Second cluster contains points-

 C(1, 1)
 E(1.5, 0.5)

Now,

 We re-compute the new cluster clusters.


 The new cluster center is computed by taking mean of all the points contained in that
cluster.

For Cluster-01:

Center of Cluster-01

= ((2 + 3 + 3)/3, (2 + 2 + 1)/3)

= (2.67, 1.67)

 For Cluster-02:

 Center of Cluster-02

= ((1 + 1.5)/2, (1 + 0.5)/2)

= (1.25, 0.75)

This is completion of Iteration-01.

Next, we go to iteration-02, iteration-03 and so on until the centers do not change anymore.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy