0% found this document useful (0 votes)

6 views2 pages

5 Minute Summary Lecture - 1

Unsupervised learning is a machine learning approach that identifies patterns in untagged data, focusing on relationships rather than predictions. K-Means is a popular clustering algorithm that partitions data into clusters based on proximity to centroids, utilizing Lloyd's Algorithm for convergence. Evaluating K-Means can be challenging due to the lack of labels, with methods like the Elbow Method and Silhouette Score helping to determine the optimal number of clusters.

Uploaded by

Bishal chinmay das

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views2 pages

5 Minute Summary Lecture - 1

Uploaded by

Bishal chinmay das

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

🧶

5 minute Summary Lecture - 1

Introduction to Unsupervised Learning
Unsupervised learning is a type of machine learning where the algorithm learns
patterns from untagged data. The goal is to model the underlying structure or
distribution in the data to learn more about it. Unlike supervised learning,
unsupervised learning doesn't work with predictions or labels; instead, it focuses on
finding relationships within the data.

Clustering and K-Means Algorithm

Clustering is a significant part of unsupervised learning aimed at grouping sets of
objects in such a way that objects in the same group (called a cluster) are more
similar to each other than to those in other groups. K-Means is one of the simplest
and most popular clustering algorithms. It aims to partition n observations into k
clusters in which each observation belongs to the cluster with the nearest mean.

The Math Behind K-Means

K-Means clustering works by initializing k centroids randomly, then iterating over two
steps:

1. Assignment step: Assign each data point to the nearest centroid.

2. Update step: Update the centroids to be the mean of the points assigned to
them.

5 minute Summary Lecture - 1 1

The process repeats until the assignments no longer change or the changes are
below a certain threshold, indicating that the algorithm has converged.

Lloyd's Algorithm
Lloyd's Algorithm, often synonymous with K-Means, refers to the two-step iterative
approach K-Means uses to converge on a solution. It's worth noting that while
efficient, Lloyd's algorithm can sometimes fall into local minima, and its performance
can be sensitive to the initial choice of centroids.

Implementing K-Means from Scratch

When implementing K-Means from scratch, the key steps involve initializing centroids
randomly, then iteratively updating the cluster assignments and the centroids until the
algorithm converges. This process involves calculating distances between data
points and centroids, typically using Euclidean distance, and recalculating cluster
centroids after every assignment step.

Determining the Optimal Number of Clusters (K)

Determining the right number of clusters, K, is crucial for K-Means performance.
Techniques like the Elbow Method involve plotting the within-cluster sum of squares
(WCSS) against the number of clusters and looking for the "elbow" point where the
rate of decrease sharply changes. The Silhouette Score is another method, provides
a measure of how similar an object is to its own cluster compared to other clusters.

Practical Application: Customer Segmentation

Applying K-Means for customer segmentation involves preprocessing steps like
feature scaling, followed by fitting the K-Means model to the data and interpreting the
resulting clusters. Each cluster can represent a different customer segment, which
can then be targeted with tailored marketing strategies.

Evaluation and Challenges

Evaluating unsupervised learning models like K-Means can be challenging due to the
absence of ground truth labels. Metrics like the Dunn Index, which involves the ratio
of the smallest distance between observations not in the same cluster to the largest
intra-cluster distance, can be used. However, the ultimate measure often involves
domain-specific knowledge and the practical usefulness of the clustering results.

5 minute Summary Lecture - 1 2

Polynomials - Practice Sheet - Warrior 2025
No ratings yet
Polynomials - Practice Sheet - Warrior 2025
5 pages
K Means Presentation
No ratings yet
K Means Presentation
69 pages
1694601073-Unit 3.1 Unsupervised Learning CU 2.0
No ratings yet
1694601073-Unit 3.1 Unsupervised Learning CU 2.0
35 pages
Lesson 1 Polynomial Function
0% (1)
Lesson 1 Polynomial Function
35 pages
Design and Analysis of Algorithm
No ratings yet
Design and Analysis of Algorithm
2 pages
Unit 4
No ratings yet
Unit 4
96 pages
CE345 - Lecture #9 - Clustering
No ratings yet
CE345 - Lecture #9 - Clustering
56 pages
ML Unit 4
No ratings yet
ML Unit 4
110 pages
WINSEM2023-24 BEEE410L TH VL2023240502246 2024-03-22 Reference-Material-I
No ratings yet
WINSEM2023-24 BEEE410L TH VL2023240502246 2024-03-22 Reference-Material-I
95 pages
Souvik Pal - 60
No ratings yet
Souvik Pal - 60
9 pages
Unit 4
No ratings yet
Unit 4
125 pages
Unit 4
No ratings yet
Unit 4
53 pages
Unit IV
No ratings yet
Unit IV
96 pages
Lecture Unsupervised (17!04!2024)
No ratings yet
Lecture Unsupervised (17!04!2024)
61 pages
M3 - Unsupervised Machine Learning
No ratings yet
M3 - Unsupervised Machine Learning
35 pages
UNIT-5 Material
No ratings yet
UNIT-5 Material
42 pages
Week 14 and 15 Machine Learning Unsupervised 2
No ratings yet
Week 14 and 15 Machine Learning Unsupervised 2
25 pages
Week 9. Unsupervised Learning
No ratings yet
Week 9. Unsupervised Learning
32 pages
Week 9
No ratings yet
Week 9
66 pages
Module 6 - Un-Supervised Learning Algorithms
No ratings yet
Module 6 - Un-Supervised Learning Algorithms
31 pages
DM Lecture 06
No ratings yet
DM Lecture 06
32 pages
Unit 4 Aiml
No ratings yet
Unit 4 Aiml
24 pages
04-FSSR DS610 2024 2025T1 Kmeans
No ratings yet
04-FSSR DS610 2024 2025T1 Kmeans
57 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
27 pages
Module09 TreeBasedMethods
No ratings yet
Module09 TreeBasedMethods
36 pages
Som New
No ratings yet
Som New
21 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Malika
No ratings yet
Malika
21 pages
Chapter 1
No ratings yet
Chapter 1
23 pages
Chapter 14 - Nonlinear Regression Models
No ratings yet
Chapter 14 - Nonlinear Regression Models
20 pages
Unsupervised Learning Final
No ratings yet
Unsupervised Learning Final
17 pages
UnSupervised Learning
No ratings yet
UnSupervised Learning
40 pages
2 - K-Mean
No ratings yet
2 - K-Mean
39 pages
Machine Learning - Iv
No ratings yet
Machine Learning - Iv
13 pages
EAI13
No ratings yet
EAI13
19 pages
KMeans Clustering Report
No ratings yet
KMeans Clustering Report
2 pages
Chapter 8
No ratings yet
Chapter 8
15 pages
Session4 KMeansClustering
No ratings yet
Session4 KMeansClustering
10 pages
ML Unit5 Notes
No ratings yet
ML Unit5 Notes
18 pages
Clustering
No ratings yet
Clustering
18 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
Class6 Unsupervised Learning Clustering
No ratings yet
Class6 Unsupervised Learning Clustering
13 pages
Clustering FinancialData
No ratings yet
Clustering FinancialData
38 pages
ML Unit 4 V1
No ratings yet
ML Unit 4 V1
30 pages
Lecture - 10 Unsupervised Learning & K-Means Clustering
No ratings yet
Lecture - 10 Unsupervised Learning & K-Means Clustering
31 pages
EXP5MP Merged
No ratings yet
EXP5MP Merged
14 pages
K-Means Clustering Clearly Explained
No ratings yet
K-Means Clustering Clearly Explained
12 pages
Lesson 5 - Unsupervised Learning
No ratings yet
Lesson 5 - Unsupervised Learning
11 pages
Machine Learning Chapter 3
No ratings yet
Machine Learning Chapter 3
12 pages
K Means - Ipynb - Colab
No ratings yet
K Means - Ipynb - Colab
10 pages
LIFO Search and FIFO Search
No ratings yet
LIFO Search and FIFO Search
9 pages
ML Unit 2 Notes
No ratings yet
ML Unit 2 Notes
14 pages
K Means Final
No ratings yet
K Means Final
10 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
9 pages
K Means Clustering
No ratings yet
K Means Clustering
13 pages
10.lab Activity
No ratings yet
10.lab Activity
11 pages
Discrete-Time Systems: - Fe-b-r-U-Qr-Y - 2-0-00 - 3 - 9
No ratings yet
Discrete-Time Systems: - Fe-b-r-U-Qr-Y - 2-0-00 - 3 - 9
11 pages
Kmean
No ratings yet
Kmean
24 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
Facebook Live Seller
No ratings yet
Facebook Live Seller
8 pages
Digital Comm Sheet3
No ratings yet
Digital Comm Sheet3
6 pages
Python New Lab Question
No ratings yet
Python New Lab Question
12 pages
Aiml 8
No ratings yet
Aiml 8
7 pages
Clustering Kmeans
No ratings yet
Clustering Kmeans
6 pages
K - Means Clustering
No ratings yet
K - Means Clustering
13 pages
K Means
No ratings yet
K Means
9 pages
Mod4 - Unsupervised Learning
No ratings yet
Mod4 - Unsupervised Learning
9 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
12 pages
LP Balanced Transportation V2
No ratings yet
LP Balanced Transportation V2
28 pages
Set Cover
No ratings yet
Set Cover
4 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
10 pages
May Jun 2024
No ratings yet
May Jun 2024
2 pages
Filtered QB
No ratings yet
Filtered QB
3 pages
Clustering in Machine Learning: Prepared by
No ratings yet
Clustering in Machine Learning: Prepared by
10 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Pattern Recognition-Theory
No ratings yet
Pattern Recognition-Theory
2 pages
Higher Data Structures and Algorithms This Is A Sample Only Time Allowed: 3 Hours Total Marks: 100 Number of Parts: 5
No ratings yet
Higher Data Structures and Algorithms This Is A Sample Only Time Allowed: 3 Hours Total Marks: 100 Number of Parts: 5
9 pages
EE247 - Lecture 2 Filters
No ratings yet
EE247 - Lecture 2 Filters
27 pages
17CS43 2019 Jan
No ratings yet
17CS43 2019 Jan
3 pages
A SECTION - Record of Attendance and Assessment
No ratings yet
A SECTION - Record of Attendance and Assessment
5 pages
Javascript Implementation of Dijkastra's Algorithm
No ratings yet
Javascript Implementation of Dijkastra's Algorithm
3 pages
Wave Let
No ratings yet
Wave Let
14 pages
Longest Common Subsequence (DP Approach)
No ratings yet
Longest Common Subsequence (DP Approach)
4 pages
K Mean
No ratings yet
K Mean
12 pages
Simple Technique CFD
No ratings yet
Simple Technique CFD
13 pages
Hashing
No ratings yet
Hashing
4 pages
3.5 Coefficients of The Interpolating Polynomial: Interpolation and Extrapolation
No ratings yet
3.5 Coefficients of The Interpolating Polynomial: Interpolation and Extrapolation
4 pages
Signals and Networks Assignment 2
No ratings yet
Signals and Networks Assignment 2
6 pages
Binomial Expansion A (4037)
No ratings yet
Binomial Expansion A (4037)
9 pages
Division of Polynomials - Long Division
No ratings yet
Division of Polynomials - Long Division
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

5 Minute Summary Lecture - 1

Uploaded by

5 Minute Summary Lecture - 1

Uploaded by

🧶

5 minute Summary Lecture - 1

Clustering and K-Means Algorithm

The Math Behind K-Means

1. Assignment step: Assign each data point to the nearest centroid.

5 minute Summary Lecture - 1 1

Implementing K-Means from Scratch

Determining the Optimal Number of Clusters (K)

Practical Application: Customer Segmentation

Evaluation and Challenges

5 minute Summary Lecture - 1 2

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.