0% found this document useful (0 votes)

45 views41 pages

Clustering Part-A

This document provides an overview of unsupervised learning techniques, specifically clustering using the k-means algorithm. It explains that k-means clustering is an iterative algorithm that groups unlabeled data points into k clusters based on similarity. It works by initially assigning data points to the closest cluster centroid and then iteratively updating the centroid positions until the clusters converge. The document also discusses applications of unsupervised learning like customer segmentation and anomaly detection.

Uploaded by

Waseem Sajjad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views41 pages

Clustering Part-A

Uploaded by

Waseem Sajjad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 41

High Impact Skills Development Program

in Artificial Intelligence, Data Science, and Blockchain

Module 2: Unsupervised Learning

Lecture 1: Clustering

Instructor: Dr. Nazia Perwaiz

Assistant Professor, SEECS, NUST

1
Someone Messed Up the Library!

French

German

Spanish
Someone Messed Up the Library!
French

German

Spanish
Supervised Learning

Feature 2

Feature 1
Unsupervised Learning

Feature 2

Feature 1
Unsupervised Learning
Supervised vs Unsupervised Learning
Properties Unsupervised Learning Supervised Learning
Definition Type of machine learning Type of machine learning that
that happens without happens under human supervision,
human supervision and meaning people label input data
machine tries to find out with answer keys, that guided/
the patterns in data itself supervises the machine to learn the
desired outputs
Input data Unlabeled Labeled
Use of data Model is given only the Model is given input variables (X),
input variables (X) and no output variables (Y) and an
corresponding output data algorithm to learn the function
from input to output
7
Supervised vs Unsupervised
Learning
Properties Unsupervised Learning Supervised Learning
When to use You don’t know what you You know what you are looking for
are looking for in data in data

Applicable in To identify patterns and To determine a specific output

relationships in data relating to classification and
(Clustering and regression problems
association problems)
Accuracy of May provide less accurate Provides more accurate results
results results

Methods Computationally Simple

Complex 8
Why/ When Unsupervised Learning?
• Easier to get unlabeled data and less time-consuming
• no need to manually label the data

• Understanding raw data

• To find unknown patterns to get useful insights from raw data
• e.g. user categorization by their social media activity

• Similar to human mind

• baby-cat example

9
When Unsupervised Learning?
• If you need to identify patterns and relationships in data

• The data is pretty large

• where labeling the data may be time-consuming or impractical

10
Applications of Unsupervised Learning
• Medical diagnosis

• Customer segmentation

• Recommendation systems

• Anomaly Detection

• Cyber security

• Preparing data for supervised learning

11
Applications of Unsupervised Learning
• Medical diagnosis

12
Applications of Unsupervised Learning
• Customer segmentation

13
Applications of Unsupervised Learning
• Recommendation systems

14
Applications of Unsupervised Learning
• Anomaly Detection

15
Applications of Unsupervised Learning
• Cyber security (data preparation for unknown threats)

16
Applications of Unsupervised Learning
• Preparing data for supervised learning (Image segmentation)

17
Unsupervised ML Approaches
• Clustering: identifies similarities and differences between
unlabelled data entries and groups them based on their
properties.

• Dimensionality reduction: reduces some data while

maintaining the integrity of a data, when there's so much data
to analyse which may reduce the algorithms' performance.

• Association: can find relationships between variables, i.e.

identifies sets of items which often occur together in a dataset

18
Unsupervised ML Approaches
• Clustering: identifies similarities and differences between
unlabelled data entries and groups them based on their
properties.

19
Unsupervised ML Approaches
• Dimensionality reduction: reduces some data while
maintaining the integrity of a data, when there's so much data
to analyse which may reduce the algorithms' performance.

20
Unsupervised ML Approaches
• Association: can find relationships between variables, i.e.
identifies sets of items which often occur together in a dataset

21
Clustering

K-means Algorithm

22
K-means Clustering
Motivation:

• to summarize a complex real-valued data point with

a single categorical variable
K-means Clustering
K-means Clustering
• Is an Iterative algorithm

• that divides a group of n datasets

• into k different clusters/ subgroups

• based on the similarity and their mean distance from the

central point (centroid) of that particular subgroup/
formed.
K-means Clustering
Start:

• Pick K random
points as cluster
centeroids.

Here,

K=2
K-means Clustering

Iterative Step 1

• Assign data
points to closest
cluster centroid
K-means Clustering

Iterative Step 2

• Compute the
average position
of all data points
assigned to a
centroid
K-means Clustering

Iterative Step 3

• Move the cluster

centroid to the
average of the
assigned points
K-means Clustering

Repeat:

• Calculate average
of data points

• Move centroid to
the new average
position
K-means Clustering

Repeat:

• Until
Convergence

• i.e. Reassignment
of data points
occurs
K-means Clustering

Repeat:

• Until
Convergence

• i.e. Reassignment
of data points
occurs
K-means Clustering

Repeat:

• Until
Convergence

• i.e. Reassignment
of data points
occurs
K-means Clustering

Repeat:

• Until
Convergence

Converged
or
Not Converged?
K-means Clustering
When K-means Algorithm ends?

1. No re-assignment of the data points occurs

2. No relocation/ re-positioning of centroids

K-means Algorithm
K-means Algorithm

For all data points of

training data:
Find closest centroid c(i)

Average of points is recomputed

for all centroids relocation
K-means Optimization Objective

x(i) training example i

uc(i) cluster centroid of x(i)

Using K-means Clustering

from sklearn.cluster import KMeans

kmeans = KMeans(n_clusters=k, **kmeans_kwargs)
kmeans.fit(features)
Happy
Learning!

Selective Mutism Treatment Case Samples
No ratings yet
Selective Mutism Treatment Case Samples
18 pages
1 Developing A Risk Profile
100% (1)
1 Developing A Risk Profile
15 pages
Balanced Scorecards: Human Resource Management 638
100% (2)
Balanced Scorecards: Human Resource Management 638
32 pages
DLL - ARALING PANLIPUNAN 6 - Q4 - W2 - Reflections
No ratings yet
DLL - ARALING PANLIPUNAN 6 - Q4 - W2 - Reflections
3 pages
Journeys Spelling Homework First Grade
50% (2)
Journeys Spelling Homework First Grade
5 pages
Week 1 Doing Philosphy
No ratings yet
Week 1 Doing Philosphy
5 pages
What Did You Learn From Understanding The Self
No ratings yet
What Did You Learn From Understanding The Self
3 pages
Oral Fluency The Neglected Component in The Communicative Language Classroom
No ratings yet
Oral Fluency The Neglected Component in The Communicative Language Classroom
25 pages
Turn Your Shit in
100% (1)
Turn Your Shit in
2 pages
Thesis Statements For 8th Graders
100% (3)
Thesis Statements For 8th Graders
6 pages
Types of Organizational Structure
No ratings yet
Types of Organizational Structure
9 pages
RPS - English Semantics Ok
No ratings yet
RPS - English Semantics Ok
13 pages
The Oxford Handbook of Sociolinguistics
No ratings yet
The Oxford Handbook of Sociolinguistics
4 pages
Math and Science Reading K
No ratings yet
Math and Science Reading K
2 pages
2012 DSE Paper 2 Q5 Sample
No ratings yet
2012 DSE Paper 2 Q5 Sample
2 pages
Chapter 2 Helping Material
No ratings yet
Chapter 2 Helping Material
13 pages
Group 16 Chapter 17 How To Write Chapter 5 Conclusion. and Recommendations
No ratings yet
Group 16 Chapter 17 How To Write Chapter 5 Conclusion. and Recommendations
10 pages
Cot 1 Oral Com DLP Objective 10
No ratings yet
Cot 1 Oral Com DLP Objective 10
9 pages
Lesson Plan Template For Teachers
No ratings yet
Lesson Plan Template For Teachers
3 pages
LRP Orientation
No ratings yet
LRP Orientation
17 pages
Presentation Emotion Introduction To Psychology
No ratings yet
Presentation Emotion Introduction To Psychology
21 pages
ChanceLight - RFP Questions by School 2 - 8 - 24 - Chancelight
No ratings yet
ChanceLight - RFP Questions by School 2 - 8 - 24 - Chancelight
5 pages
MVP Explained - A Systematic Mapping Study On The Definitions of Minimal Viable Product
No ratings yet
MVP Explained - A Systematic Mapping Study On The Definitions of Minimal Viable Product
8 pages
XXXXXXXXXX 4444444
No ratings yet
XXXXXXXXXX 4444444
7 pages
Business Communication CH 15
No ratings yet
Business Communication CH 15
4 pages
ClickOn Starter Unit 3 Day 2
No ratings yet
ClickOn Starter Unit 3 Day 2
6 pages
Update Resume
No ratings yet
Update Resume
1 page
1-Argumentative Essay
No ratings yet
1-Argumentative Essay
1 page
Daftar Pustaka
No ratings yet
Daftar Pustaka
3 pages
Call For Papers: Virtual
No ratings yet
Call For Papers: Virtual
2 pages
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6458)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (643)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (1005)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (582)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (464)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5181)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1139)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2016)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4103)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (629)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2885)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2814)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1022)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (280)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4135)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4372)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
3.5/5 (2141)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Clustering Part-A

Uploaded by

Clustering Part-A

Uploaded by

High Impact Skills Development Program

in Artificial Intelligence, Data Science, and Blockchain

Module 2: Unsupervised Learning

Instructor: Dr. Nazia Perwaiz

Applicable in To identify patterns and To determine a specific output

Methods Computationally Simple

• Understanding raw data

• Similar to human mind

• The data is pretty large

• Preparing data for supervised learning

• Dimensionality reduction: reduces some data while

• Association: can find relationships between variables, i.e.

• to summarize a complex real-valued data point with

• that divides a group of n datasets

• into k different clusters/ subgroups

• based on the similarity and their mean distance from the

• Move the cluster

1. No re-assignment of the data points occurs

2. No relocation/ re-positioning of centroids

For all data points of

Average of points is recomputed

x(i) training example i

uc(i) cluster centroid of x(i)

from sklearn.cluster import KMeans

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.