0% found this document useful (0 votes)

11 views38 pages

Clustering and K-Mean Algorithm

This lecture covers the concepts of supervised and unsupervised learning, focusing on clustering and the K-Means algorithm. It explains the steps involved in the K-Means algorithm, including initialization, membership calculation, and centroid updating, as well as methods for choosing the optimal number of clusters (K) using the Elbow method. Additionally, applications of K-Means, such as image color quantization and handling outliers, are discussed.

Uploaded by

aliahmed23456u857

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views38 pages

Clustering and K-Mean Algorithm

Uploaded by

aliahmed23456u857

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Machine Learning

Clustering and k-Means Algorithm

Lecture – 7

Instructor: Qamar Askari

Headlines
• Supervised vs. Unsupervised Learning
• Clustering
• K-Means algorithm
• Random initialization
• Choosing K – Elbow Method
• Implementation of K-Means Algorithm in Python
Supervised learning

Training set:
Unsupervised learning

Training set:
Clustering
• It is the task of identifying subgroups in the data such that data points
in the same subgroup (cluster) are very similar while data points in
different clusters are very different.

Market segmentation Social network analysis

K-means algorithm
An example on board
K-Means Algorithm
• 1.Initialize:
• Choose K random data points from the data set to represent the initial centers
of the K partitions

• 2.Calculate the group memberships:


1 if x t
− mi = min j x t − mj
bi = 
t


0 otherwise
• 3.Update the centroids:
 ix
b
t
t t

mi =
 i
b t

t
• 4.Repeat steps 2 and 3
• until converge to stable values
An empirical study
Initial centres

Ref: K. Javed et. al, “The behavior of K-Means: An empirical study”,

International Conference on Electrical Engineering, 2008
Ref: K. Javed et. al, “The behavior of K-Means: An empirical study”,
International Conference on Electrical Engineering, 2008
Ref: K. Javed et. al, “The behavior of K-Means: An empirical study”,
International Conference on Electrical Engineering, 2008
Ref: K. Javed et. al, “The behavior of K-Means: An empirical study”,
International Conference on Electrical Engineering, 2008
Ref: K. Javed et. al, “The behavior of K-Means: An empirical study”,
International Conference on Electrical Engineering, 2008
Application – Image color quantization
20

100

120

140

160

180

200

220

50 100 150 200 250 300

•Look at the above picture….it does not have all 2563 colors.
•Suppose we want to represent it using even less colors
•Kmeans has an application called color quantization
20 20 20

40 40 40

60 60 60

80 80 80

100 100 100

120 120 120

140 140 140

160 160 160

180 180 180

200 200 200

220 220 220

50 100 150 200 250 300 50 100 150 200 250 300 50 100 150 200 250 300

Original Image: K=2 K=10

37859 Shades of Color

20 20

40 40

60 60

80 80

100 100
KMEANS
120 120

140 140

160 160

180 180

200 200

220 220

50 100 150 200 250 300 50 100 150 200 250 300

K=15 K=20
k=2 k=3 k=10

Reference: Bishop 2006.

K-means for non-separated clusters

T-shirt sizing

Weight
Height
K-Means and Globular/Non-Globular
structures

Globular Structure Non-Globular Structure

K-Means and Globular/Non-Globular
structures
K-Means on Non-Globular Structure
K-Means is sensitive to outliners
Handling Outliers
1. Remove/ignore outlier(s). Outliers can be identified from distance
of point from centroid
2. Make extra cluster for outlier as shown below
Random initialization
Should have

Randomly pick training

examples.

Set equal to these

examples.
Local optima
Random initialization
For i = 1 to 100 {

Randomly initialize K-means.

Run K-means. Get .
Compute cost function (distortion/WCSS)

Pick clustering that gave lowest cost

What is the right value of K?
Choosing the value of K
Elbow method:
Cost function

Cost function
1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8

(no. of clusters) (no. of clusters)

Choosing the value of K
Sometimes, you’re running K-means to get clusters to use for some
later/downstream purpose. Evaluate K-means based on a metric for
how well it performs for that later purpose.

E.g. T-shirt sizing T-shirt sizing

Weight
Weight

Height Height
Implementation of K-Means Algorithm

Discussion from Google Colab

Clustering
No ratings yet
Clustering
6 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
39 pages
Clustering: Unsupervised Learning
No ratings yet
Clustering: Unsupervised Learning
29 pages
Lecture 13
No ratings yet
Lecture 13
29 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
29 pages
13: Clustering: Unsupervised Learning - Introduction
No ratings yet
13: Clustering: Unsupervised Learning - Introduction
4 pages
Clusterin G: Unsupervised Learning
No ratings yet
Clusterin G: Unsupervised Learning
29 pages
K Means - Ipynb - Colab
No ratings yet
K Means - Ipynb - Colab
10 pages
P-3 1 2-Kmeans
No ratings yet
P-3 1 2-Kmeans
43 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
27 pages
K Means Algorithm
No ratings yet
K Means Algorithm
4 pages
1 The K-Medoids Algorithm
No ratings yet
1 The K-Medoids Algorithm
5 pages
Clustering (Kmeans)
No ratings yet
Clustering (Kmeans)
10 pages
2021 Clustering
No ratings yet
2021 Clustering
50 pages
K-means Clustering in Unsupervised Learning
No ratings yet
K-means Clustering in Unsupervised Learning
44 pages
Neural Network Clustering Guide
No ratings yet
Neural Network Clustering Guide
168 pages
K-Means Clustering: Unsupervised Learning
No ratings yet
K-Means Clustering: Unsupervised Learning
5 pages
02.1 K-Means Example
No ratings yet
02.1 K-Means Example
12 pages
Session 10 Unsupervised K-Mean
No ratings yet
Session 10 Unsupervised K-Mean
26 pages
Lect 6 - Clustering
No ratings yet
Lect 6 - Clustering
50 pages
1 s2.0 S0031320319301608 Main
No ratings yet
1 s2.0 S0031320319301608 Main
18 pages
K-Means Clustering Algorithm Overview
No ratings yet
K-Means Clustering Algorithm Overview
47 pages
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
No ratings yet
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
65 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
24 pages
4.1.2. K Means Clustering
No ratings yet
4.1.2. K Means Clustering
38 pages
K, Eans
No ratings yet
K, Eans
4 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
26 pages
Clustering Part1
No ratings yet
Clustering Part1
84 pages
A Tutorial On Clustering Algorithms
No ratings yet
A Tutorial On Clustering Algorithms
4 pages
Unit 4
No ratings yet
Unit 4
22 pages
Week 10
No ratings yet
Week 10
41 pages
Partitioning-Based Clustering Overview
No ratings yet
Partitioning-Based Clustering Overview
27 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
10.lab Activity
No ratings yet
10.lab Activity
11 pages
Clustering Kmeans
No ratings yet
Clustering Kmeans
6 pages
Lecture7 KMeans
No ratings yet
Lecture7 KMeans
30 pages
Day 3
No ratings yet
Day 3
74 pages
Unit 4
No ratings yet
Unit 4
46 pages
Unit 4
No ratings yet
Unit 4
125 pages
K-Means Clustering Algorithm
No ratings yet
K-Means Clustering Algorithm
40 pages
Kmeans
No ratings yet
Kmeans
92 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
Partitioning-Based Clustering Methods
No ratings yet
Partitioning-Based Clustering Methods
27 pages
Mini Project
No ratings yet
Mini Project
8 pages
K-Means Clustering Guide for Beginners
No ratings yet
K-Means Clustering Guide for Beginners
19 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
12 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
24 pages
Mod4 - Unsupervised Learning
No ratings yet
Mod4 - Unsupervised Learning
9 pages
Algo
No ratings yet
Algo
59 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
4 pages
Clustering (Class 38-39)
No ratings yet
Clustering (Class 38-39)
45 pages
Da Exp 10 66
No ratings yet
Da Exp 10 66
6 pages
K-Means Clustering Tutorial
No ratings yet
K-Means Clustering Tutorial
16 pages
Digital Image Processing: Segmentation-5
No ratings yet
Digital Image Processing: Segmentation-5
43 pages
K-Means Clustering
No ratings yet
K-Means Clustering
5 pages
K-Means Clustering
No ratings yet
K-Means Clustering
4 pages
UNIT III Part-1
No ratings yet
UNIT III Part-1
69 pages
House Plan Architectural and Structural
No ratings yet
House Plan Architectural and Structural
7 pages
WILO Household Pump Catalogue
100% (2)
WILO Household Pump Catalogue
28 pages
Risk Assessment For Cold Storage Warehousing
80% (5)
Risk Assessment For Cold Storage Warehousing
6 pages
Frocus .-. Satellite .-. Frequency Tables .-. Astra 4A, SES-5, 4.8°E
No ratings yet
Frocus .-. Satellite .-. Frequency Tables .-. Astra 4A, SES-5, 4.8°E
15 pages
Data Base Normalization and ERD
No ratings yet
Data Base Normalization and ERD
23 pages
Overview of the Electromagnetic Spectrum
No ratings yet
Overview of the Electromagnetic Spectrum
12 pages
9.internal Business Communication Writing Memos Circular and Notices
50% (2)
9.internal Business Communication Writing Memos Circular and Notices
15 pages
Project Management for Engineers
No ratings yet
Project Management for Engineers
2 pages
Brosur Vyntus
No ratings yet
Brosur Vyntus
4 pages
Passenger Car Construction Terms
100% (1)
Passenger Car Construction Terms
3 pages
Edpm p2 2023 PDF
No ratings yet
Edpm p2 2023 PDF
1 page
Duplex Unit Specifications Guide
No ratings yet
Duplex Unit Specifications Guide
12 pages
Facebook, Inc. (FB) : Buy Sell
No ratings yet
Facebook, Inc. (FB) : Buy Sell
17 pages
SYLLABUS
No ratings yet
SYLLABUS
2 pages
Explosion-Proof Junction Box Guide
No ratings yet
Explosion-Proof Junction Box Guide
2 pages
Sony Battery Charger BCG34-HH
No ratings yet
Sony Battery Charger BCG34-HH
1 page
DBMS Korth
No ratings yet
DBMS Korth
21 pages
Ficep Logistics Lean Management Plan
No ratings yet
Ficep Logistics Lean Management Plan
56 pages
Effective HRD Training Methods
100% (1)
Effective HRD Training Methods
39 pages
Clay Brick Properties and Uses
No ratings yet
Clay Brick Properties and Uses
1 page
Medication Reminder Project Proposal
No ratings yet
Medication Reminder Project Proposal
2 pages
Control Valve Basics for Students
No ratings yet
Control Valve Basics for Students
30 pages
PizzaInn BrandGuide FINAL
No ratings yet
PizzaInn BrandGuide FINAL
55 pages
Arkavathy Layout Gazette Notification
No ratings yet
Arkavathy Layout Gazette Notification
13 pages
A Industrial Training Project Report ON: VHDL Coding For Decade Counter On Xilinx and Create Test Waveform"
No ratings yet
A Industrial Training Project Report ON: VHDL Coding For Decade Counter On Xilinx and Create Test Waveform"
40 pages
GD-1250 Total Pump Parts List - 300TWC997 - D
100% (1)
GD-1250 Total Pump Parts List - 300TWC997 - D
36 pages
Rajkumari Prodcuiton Warrant Form
No ratings yet
Rajkumari Prodcuiton Warrant Form
1 page
Advt 2013 Ver 3
No ratings yet
Advt 2013 Ver 3
6 pages
AVN - Exn.pdf اكسبل برووف
No ratings yet
AVN - Exn.pdf اكسبل برووف
10 pages
1 Normal Tower 2 +3M Extn. Only 3 +6M Extn Only 4 Stub
No ratings yet
1 Normal Tower 2 +3M Extn. Only 3 +6M Extn Only 4 Stub
11 pages

Clustering and K-Mean Algorithm

Uploaded by

Clustering and K-Mean Algorithm

Uploaded by

Machine Learning

Clustering and k-Means Algorithm

Instructor: Qamar Askari

Market segmentation Social network analysis

• 2.Calculate the group memberships:

Ref: K. Javed et. al, “The behavior of K-Means: An empirical study”,

50 100 150 200 250 300

100 100 100

120 120 120

140 140 140

160 160 160

180 180 180

200 200 200

220 220 220

Original Image: K=2 K=10

Reference: Bishop 2006.

Globular Structure Non-Globular Structure

Randomly pick training

Set equal to these

Randomly initialize K-means.

Pick clustering that gave lowest cost

(no. of clusters) (no. of clusters)

E.g. T-shirt sizing T-shirt sizing

Discussion from Google Colab

You might also like