0% found this document useful (0 votes)

12 views6 pages

Non Hierarchical Clustering

Uploaded by

Reetika Choudhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views6 pages

Non Hierarchical Clustering

Uploaded by

Reetika Choudhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Non hierarchical clustering techniques

• Designed to group items into a collection of K clusters.

• The number of clusters, K, may either be specified in advance or determined as part of the clustering
procedure.

• As the matrix of distances (similarities) does not have to be determined, and the basic data do not
have to be stored during the computer run, non hierarchical methods can be applied to much larger data
sets than can hierarchical techniques.

• Non hierarchical methods start from either

A) an initial partition of items into groups or
B) an initial set of seed points, which will form the nuclei of clusters.

• Good choices for starting configurations should be free of overt biases. One way to start is to randomly
select seed points from among the items or to randomly partition the items into initial groups.

Dr. Durba Bhattacharya, St. Xavier’s College(Autonomous), Kolkata

K-means Method
This algorithm assigns each item to the cluster having the nearest centroid (mean).

In its simplest version, the process is composed of these three steps:

1. Partition the items into K initial clusters.

2. Proceed through the list of items, assigning an item to the cluster whose centroid (mean) is nearest.

(Distance is usually computed using Euclidean distance with either standardized or unstandardized observations.)

Recalculate the centroid for the cluster receiving the new item and for the cluster losing the item.

3. Repeat Step 2 until no more reassignments take place.

Note that rather than starting with a partition of all items into K preliminary groups in Step 1, we could specify K
initial centroids (seed points) and then proceed to Step 2.

The final assignment of items to clusters is dependent upon the initial partition or the initial selection of seed points.

Dr. Durba Bhattacharya, St. Xavier’s College(Autonomous), Kolkata

Example: Clustering using the K-means method

Suppose we measure two variables 𝑋1 and 𝑋2 for each of four individuals A, B, C, and D:

Observations
Individuals 𝑿𝟏 𝑿𝟐
A 5 3
B -1 1
C 1 -2
D -3 -2

Objective: To divide these items into K = 2 clusters such that the items within a cluster are closer to one another
than they are to the items in different clusters.

We arbitrarily partition the items into two clusters: (AB) and (CD), and compute the coordinates (𝑥1 , 𝑥2) of the
cluster centroid (mean).

Dr. Durba Bhattacharya, St. Xavier’s College(Autonomous), Kolkata

Step1: We arbitrarily partition the items into two clusters: (AB) and (CD), and compute the coordinates (𝑥1 , 𝑥2) of the
cluster centroid (mean).

Coordinates of the centroid

cluster 𝑥1 𝑥2
(AB) 5−1 3+1
=2 =2
2 2
(CD) 1−3 −2−2
= -1 = -2
2 2

Dr. Durba Bhattacharya, St. Xavier’s College(Autonomous), Kolkata

Step 2: We compute the Euclidean distance of each item from the group centroids and reassign each item to the
nearest group.

If an item is moved from the initial configuration, the cluster centroids (means) must be updated before proceeding.

The i-th coordinate, i = 1,2,..., p, of the centroid is easily updated using the formulas:

• To check the stability of the clustering, it is desirable to rerun the algorithm with a new initial partition.

• A table of the cluster centroids (means) and within-cluster variances also helps to delineate group differences.

Dr. Durba Bhattacharya, St. Xavier’s College(Autonomous), Kolkata

Some observations on the K-means Clustering:

Following are some strong arguments for not fixing the number of clusters, K, in advance:

• If two or more seed points inadvertently lie within a single cluster, their resulting clusters
will be poorly differentiated.

• The existence of an outlier might produce at least one group with very disperse items.

• Even if the population is known to consist of K groups, the sampling method may be such
that data from the rarest group do not appear in the sample.

• Forcing the data into K groups might lead to nonsensical clusters.

• In cases in which a single run of the algorithm requires the user to specify K, it is always a
good idea to rerun the algorithm for several choices.

Dr. Durba Bhattacharya, St. Xavier’s College(Autonomous), Kolkata

K-Mean Clustering Final
No ratings yet
K-Mean Clustering Final
21 pages
Untitled
100% (3)
Untitled
1,437 pages
Lesson8 Clustering
100% (1)
Lesson8 Clustering
33 pages
5. Clustering
No ratings yet
5. Clustering
89 pages
Unit V - Clustering
No ratings yet
Unit V - Clustering
19 pages
Clustering
No ratings yet
Clustering
39 pages
Ml Module5 Clustering
No ratings yet
Ml Module5 Clustering
71 pages
CSE3506 - Essentials of Data Analytics: Facilitator: DR Sathiya Narayanan S
No ratings yet
CSE3506 - Essentials of Data Analytics: Facilitator: DR Sathiya Narayanan S
17 pages
Cluster Analysis: Dr. Bernard Chen Ph.D. Assistant Professor
No ratings yet
Cluster Analysis: Dr. Bernard Chen Ph.D. Assistant Professor
43 pages
UNIT 4
No ratings yet
UNIT 4
125 pages
21csc305p Machine Learning Unit 3_updated (2)
No ratings yet
21csc305p Machine Learning Unit 3_updated (2)
147 pages
Chapter 3: Cluster Analysis: 3.1 Basic Concepts of Clustering
No ratings yet
Chapter 3: Cluster Analysis: 3.1 Basic Concepts of Clustering
33 pages
k Means Clustering
No ratings yet
k Means Clustering
29 pages
Clustering Analysis: What Is Cluster Analysis?
No ratings yet
Clustering Analysis: What Is Cluster Analysis?
5 pages
kmea
No ratings yet
kmea
53 pages
Machine_Learning_Unit_4
No ratings yet
Machine_Learning_Unit_4
22 pages
KMean Merged
No ratings yet
KMean Merged
13 pages
DMW Unit-V
No ratings yet
DMW Unit-V
47 pages
19.1. Partitioning-Based Clustering Algorithms
No ratings yet
19.1. Partitioning-Based Clustering Algorithms
27 pages
K Mean Clustering
No ratings yet
K Mean Clustering
45 pages
The International Journal of Engineering and Science (The IJES)
No ratings yet
The International Journal of Engineering and Science (The IJES)
4 pages
6 Clustering
No ratings yet
6 Clustering
15 pages
Unsupervised Learning - Clustering
No ratings yet
Unsupervised Learning - Clustering
55 pages
Session 18-Cluster Analysis
No ratings yet
Session 18-Cluster Analysis
20 pages
DM UNIT IV (1)
No ratings yet
DM UNIT IV (1)
45 pages
16 K Mean Clustring 1 18052023 095249am 08042024 093324am
No ratings yet
16 K Mean Clustring 1 18052023 095249am 08042024 093324am
20 pages
unsupervised_learning_1
No ratings yet
unsupervised_learning_1
40 pages
Clustering
No ratings yet
Clustering
125 pages
7.introduction To Clustering
No ratings yet
7.introduction To Clustering
11 pages
AI-AG-Day-2-28th Feb 2023
No ratings yet
AI-AG-Day-2-28th Feb 2023
44 pages
Chapter 5. Clustering Algorithms-Stud
No ratings yet
Chapter 5. Clustering Algorithms-Stud
44 pages
ML Unit-4 Final 2024-25
No ratings yet
ML Unit-4 Final 2024-25
28 pages
Feature Scaling (Standardization & Normalization)
No ratings yet
Feature Scaling (Standardization & Normalization)
35 pages
Chapter 04 Clustering
No ratings yet
Chapter 04 Clustering
36 pages
Unit 3 - KmeansClustering
No ratings yet
Unit 3 - KmeansClustering
17 pages
CH-6 DM Clustering
No ratings yet
CH-6 DM Clustering
28 pages
k Mean Clustering
No ratings yet
k Mean Clustering
32 pages
MODULE 4 - 5TH SEM (2)
No ratings yet
MODULE 4 - 5TH SEM (2)
23 pages
Clustering
No ratings yet
Clustering
10 pages
Machine Learning
No ratings yet
Machine Learning
23 pages
Introduction To Data Science: Clustering
No ratings yet
Introduction To Data Science: Clustering
45 pages
An Efficient Incremental Clustering Algorithm
No ratings yet
An Efficient Incremental Clustering Algorithm
3 pages
EED 301
No ratings yet
EED 301
38 pages
Unit 4
No ratings yet
Unit 4
74 pages
Working of K Means Algorithm - YashBhure
No ratings yet
Working of K Means Algorithm - YashBhure
14 pages
K Means
No ratings yet
K Means
23 pages
Clustering Explanation
No ratings yet
Clustering Explanation
8 pages
K Means Algo
No ratings yet
K Means Algo
7 pages
AI Chapter 3 Part 5
No ratings yet
AI Chapter 3 Part 5
30 pages
K Means
No ratings yet
K Means
33 pages
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
No ratings yet
What Is Cluster Analysis?: - Cluster: A Collection of Data Objects
42 pages
Lecture 1 Multivariate Analysis PDF
No ratings yet
Lecture 1 Multivariate Analysis PDF
28 pages
K-Means and PCA
No ratings yet
K-Means and PCA
69 pages
ML Unit-2
No ratings yet
ML Unit-2
31 pages
Lecture 14 Clustering
0% (1)
Lecture 14 Clustering
57 pages
Unit 3 Data
No ratings yet
Unit 3 Data
37 pages
K Mean Clustering
No ratings yet
K Mean Clustering
27 pages
Kmean
No ratings yet
Kmean
24 pages
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
No ratings yet
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
16 pages
K-Means Clustering
No ratings yet
K-Means Clustering
8 pages
Data Analyst Interview Question and Answer
100% (3)
Data Analyst Interview Question and Answer
11 pages
V5I5201647
No ratings yet
V5I5201647
13 pages
Mutual Fund Project
100% (1)
Mutual Fund Project
70 pages
SQL Aggregate Functions- Explore 5 Types of Functions
No ratings yet
SQL Aggregate Functions- Explore 5 Types of Functions
27 pages
K Mean Clustering1
No ratings yet
K Mean Clustering1
23 pages
Introduction Statistical Learning
No ratings yet
Introduction Statistical Learning
39 pages
Chi-Square Test & McNemar Test - D.Boduszek
No ratings yet
Chi-Square Test & McNemar Test - D.Boduszek
25 pages
A Tutorial On Clustering Algorithms
No ratings yet
A Tutorial On Clustering Algorithms
4 pages
(ML) Machine Learning Lab Manual
No ratings yet
(ML) Machine Learning Lab Manual
25 pages
Automatic Yield Management System
No ratings yet
Automatic Yield Management System
5 pages
SPSS Uber Output
No ratings yet
SPSS Uber Output
12 pages
Multiple Linear Regression Model: Pampanga State Agricultural University
No ratings yet
Multiple Linear Regression Model: Pampanga State Agricultural University
15 pages
Lab 3
No ratings yet
Lab 3
10 pages
Challenges of Measuring Performance of The Sales and Operations Planning Process
No ratings yet
Challenges of Measuring Performance of The Sales and Operations Planning Process
13 pages
CHAPTER 4 With ANOVA
No ratings yet
CHAPTER 4 With ANOVA
11 pages
MBA Business Analytics
No ratings yet
MBA Business Analytics
47 pages
MCQ Chapter 11
100% (1)
MCQ Chapter 11
3 pages
4 Influencing Factors of Learners Career Choice Parents Choice Vs Personal Descision
No ratings yet
4 Influencing Factors of Learners Career Choice Parents Choice Vs Personal Descision
24 pages
Probability Assignment
No ratings yet
Probability Assignment
53 pages
Bab Iii
No ratings yet
Bab Iii
5 pages
Laboratory Exercise 3
No ratings yet
Laboratory Exercise 3
9 pages
Magcam Magscope Software
No ratings yet
Magcam Magscope Software
2 pages
Chapter 3 (Data Intr Analysis) 3 PDF
No ratings yet
Chapter 3 (Data Intr Analysis) 3 PDF
20 pages
Grade Level Quarter / Domain Week & Day No. Page No.: Detailed Lesson Plan in Eapp
No ratings yet
Grade Level Quarter / Domain Week & Day No. Page No.: Detailed Lesson Plan in Eapp
2 pages
5.1 Review of Simple Linear Regression
No ratings yet
5.1 Review of Simple Linear Regression
2 pages
Example of A Proposal
No ratings yet
Example of A Proposal
13 pages
Final Microbiology Method Guidance 110409.Pdf11
No ratings yet
Final Microbiology Method Guidance 110409.Pdf11
107 pages
Linear Regression
No ratings yet
Linear Regression
2 pages
96527495190
No ratings yet
96527495190
2 pages
Practise Mathematics: Grade 7 Book 1
From Everand
Practise Mathematics: Grade 7 Book 1
Esther Chen
4/5 (2)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Non Hierarchical Clustering

Uploaded by

Non Hierarchical Clustering

Uploaded by

Non hierarchical clustering techniques

• Designed to group items into a collection of K clusters.

• Non hierarchical methods start from either

Dr. Durba Bhattacharya, St. Xavier’s College(Autonomous), Kolkata

In its simplest version, the process is composed of these three steps:

1. Partition the items into K initial clusters.

3. Repeat Step 2 until no more reassignments take place.

Dr. Durba Bhattacharya, St. Xavier’s College(Autonomous), Kolkata

Dr. Durba Bhattacharya, St. Xavier’s College(Autonomous), Kolkata

Coordinates of the centroid

Dr. Durba Bhattacharya, St. Xavier’s College(Autonomous), Kolkata

Dr. Durba Bhattacharya, St. Xavier’s College(Autonomous), Kolkata

• Forcing the data into K groups might lead to nonsensical clusters.

Dr. Durba Bhattacharya, St. Xavier’s College(Autonomous), Kolkata

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.