Customer Segmentation Using Clustering

The document explains customer segmentation using K-Means clustering, which groups customers based on purchasing behavior. It details the steps involved, including data standardization, choosing the number of clusters using the Elbow Method, and running K-Means to identify customer profiles. Additionally, it provides runnable code for generating synthetic data, visualizing distributions, and plotting clusters.

Uploaded by

Sakthi Priya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views6 pages

Customer Segmentation Using Clustering

Uploaded by

Sakthi Priya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Customer Segmentation using

Clustering (K-Means)
Customer segmentation means grouping customers into different clusters
based on their purchasing behavior or attributes. This helps businesses tailor
marketing strategies or services to each group.
Clustering is an unsupervised machine learning technique that automatically
finds natural groupings in data without pre-labeled categories.

Why K-Means Clustering?

 It partitions data points into K clusters where each point belongs to the
cluster with the nearest mean.
 It’s simple, efficient, and widely used in marketing segmentation.

Step-by-step Explanation
1. Dataset
We consider two features for each customer:
o Annual Income (in thousands)
o Spending Score (a score from 1 to 100 that indicates how much
the customer spends)
2. Data Standardization
Since these features have different scales, we standardize them to have
zero mean and unit variance. This prevents bias where features with
larger scales dominate the clustering.
3. Choosing Number of Clusters (K)
We use the Elbow Method:
o Run K-Means for a range of K (say 1 to 10).
o Calculate the sum of squared distances (WCSS) between points
and their cluster centers.
o Plot WCSS vs K and look for the "elbow" where adding more
clusters does not reduce WCSS significantly.
o This “elbow” point indicates a good trade-off between model
complexity and explained variance.
4. Run K-Means
Using the chosen K, cluster the data points.
5. Interpretation
Each cluster represents a group of customers with similar income and
spending patterns, helping businesses understand customer profiles like:
o High income, high spending
o Low income, low spending
o Medium income, high spending, etc.

Full runnable code with output plot

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.cluster import KMeans
from sklearn.preprocessing import StandardScaler

# 1. Create sample data

np.random.seed(42)
data = {
'CustomerID': range(1, 201),
'Annual Income (k$)': np.random.randint(15, 150, 200),
'Spending Score (1-100)': np.random.randint(1, 100, 200)
}
df = pd.DataFrame(data)

# 2. Visualize data distribution

plt.figure(figsize=(8,5))
sns.scatterplot(x='Annual Income (k$)', y='Spending Score (1-100)', data=df)
plt.title('Customer Data Distribution')
plt.show()

# 3. Scale features
features = df[['Annual Income (k$)', 'Spending Score (1-100)']]
scaler = StandardScaler()
scaled_features = scaler.fit_transform(features)

# 4. Elbow method to find optimal K

wcss = []
for i in range(1, 11):
kmeans = KMeans(n_clusters=i, random_state=42)
kmeans.fit(scaled_features)
wcss.append(kmeans.inertia_)

plt.figure(figsize=(8,5))
plt.plot(range(1, 11), wcss, marker='o')
plt.title('Elbow Method For Optimal K')
plt.xlabel('Number of clusters')
plt.ylabel('WCSS')
plt.show()

# 5. From the elbow plot, let's choose K=5

kmeans = KMeans(n_clusters=5, random_state=42)
df['Cluster'] = kmeans.fit_predict(scaled_features)

# 6. Visualize clusters
plt.figure(figsize=(8,5))
sns.scatterplot(x='Annual Income (k$)', y='Spending Score (1-100)',
hue='Cluster', palette='Set1', data=df)
plt.title('Customer Segments (K=5)')
plt.show()

# 7. Print cluster centers in original scale (optional)

centers = scaler.inverse_transform(kmeans.cluster_centers_)
print("Cluster centers (Annual Income, Spending Score):")
print(centers)

 We created synthetic customer data.

 Standardized features.
 Used the Elbow method to find optimal clusters.
 Applied K-Means clustering.
 Visualized customer segments.

23dscp206 Ex11
No ratings yet
23dscp206 Ex11
3 pages
Experiment 2 KMeans Clustering
No ratings yet
Experiment 2 KMeans Clustering
3 pages
LAB 4 - K-Means and Elbow Technique
No ratings yet
LAB 4 - K-Means and Elbow Technique
3 pages
K Means Clustering
No ratings yet
K Means Clustering
5 pages
LP I Assignment A4 Clustering
No ratings yet
LP I Assignment A4 Clustering
13 pages
Customer Spent Analysis Using K-Means Clustering
No ratings yet
Customer Spent Analysis Using K-Means Clustering
1 page
K Means Clustering
No ratings yet
K Means Clustering
4 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
Experiment-7: Implementation of K-Means Clustering Algorithm
No ratings yet
Experiment-7: Implementation of K-Means Clustering Algorithm
3 pages
Clustering Mall Data Students
No ratings yet
Clustering Mall Data Students
11 pages
Słowacja Wszystko PDF
No ratings yet
Słowacja Wszystko PDF
379 pages
Da Exp 10
No ratings yet
Da Exp 10
6 pages
Customer Clustering with K-Means
No ratings yet
Customer Clustering with K-Means
3 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
K Means Clustering
No ratings yet
K Means Clustering
13 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
K Means Clustering
No ratings yet
K Means Clustering
5 pages
IEE Paper
No ratings yet
IEE Paper
5 pages
Elbow Method For Optimal Cluster Number in K-Means
No ratings yet
Elbow Method For Optimal Cluster Number in K-Means
8 pages
K-Means Clustering Algorithm Overview
No ratings yet
K-Means Clustering Algorithm Overview
47 pages
BDA LabReport-9
No ratings yet
BDA LabReport-9
17 pages
Customer Segmentation
No ratings yet
Customer Segmentation
43 pages
Report ML 2
No ratings yet
Report ML 2
10 pages
ML Assignment 4
No ratings yet
ML Assignment 4
6 pages
Syakur 2018 IOP Conf. Ser. Mater. Sci. Eng. 336 012017
No ratings yet
Syakur 2018 IOP Conf. Ser. Mater. Sci. Eng. 336 012017
7 pages
1746593166-Lecture#41 Customer Segmentation K Means Clustering
No ratings yet
1746593166-Lecture#41 Customer Segmentation K Means Clustering
9 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
Understanding K-Means Clustering Techniques
No ratings yet
Understanding K-Means Clustering Techniques
152 pages
Prrethy-Dr. Huma Lone - AL
No ratings yet
Prrethy-Dr. Huma Lone - AL
7 pages
K-Means Clustering
No ratings yet
K-Means Clustering
6 pages
Customer Segmentation via Ensemble Clustering
No ratings yet
Customer Segmentation via Ensemble Clustering
20 pages
KMeans Clustering Report
No ratings yet
KMeans Clustering Report
2 pages
Bone Suplement Market Segmentation
No ratings yet
Bone Suplement Market Segmentation
20 pages
Experiment-3 ML Lab
No ratings yet
Experiment-3 ML Lab
20 pages
K Clustering
No ratings yet
K Clustering
28 pages
Aiml Assignment 10
No ratings yet
Aiml Assignment 10
6 pages
Customer Segmentation Using K
No ratings yet
Customer Segmentation Using K
16 pages
Phase 2
No ratings yet
Phase 2
5 pages
K-Means Clustering Algorithm
No ratings yet
K-Means Clustering Algorithm
13 pages
K Means - Ipynb - Colab
No ratings yet
K Means - Ipynb - Colab
10 pages
Mall Customer Segmentation Guide
No ratings yet
Mall Customer Segmentation Guide
8 pages
Exp 8ml
No ratings yet
Exp 8ml
5 pages
Syakur 2018 IOP Conf. Ser. Mater. Sci. Eng. 336 012017
No ratings yet
Syakur 2018 IOP Conf. Ser. Mater. Sci. Eng. 336 012017
7 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
AAM 7th Prac
No ratings yet
AAM 7th Prac
4 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
4 pages
Customer Segmentation
No ratings yet
Customer Segmentation
15 pages
K, Eans
No ratings yet
K, Eans
4 pages
Customer Categorization by Data Analysis Using Clustering Algorithms of Machine Learning
No ratings yet
Customer Categorization by Data Analysis Using Clustering Algorithms of Machine Learning
4 pages
Research Paper Mini Project
No ratings yet
Research Paper Mini Project
13 pages
Kman 07
No ratings yet
Kman 07
9 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
16 pages
ML0101EN Clus K Means Customer Seg Py v1
100% (1)
ML0101EN Clus K Means Customer Seg Py v1
8 pages
DWDM PPT
No ratings yet
DWDM PPT
13 pages
Project Explanation
No ratings yet
Project Explanation
17 pages
Ds Un4
No ratings yet
Ds Un4
11 pages
Chapter 11 - Fine-Tuning BERT - Ipynb
No ratings yet
Chapter 11 - Fine-Tuning BERT - Ipynb
80 pages
Abstract and Literature Survey 1
No ratings yet
Abstract and Literature Survey 1
14 pages
2021 - RefineDNet A Weakly Supervised Refinement Framework For Single Image Dehazing
No ratings yet
2021 - RefineDNet A Weakly Supervised Refinement Framework For Single Image Dehazing
14 pages
Plant Disease Detection
No ratings yet
Plant Disease Detection
9 pages
Home Work Reinforcement Learning Multi Armed Bandit
No ratings yet
Home Work Reinforcement Learning Multi Armed Bandit
8 pages
ML Assignment
No ratings yet
ML Assignment
7 pages
Decision and Classification Trees
No ratings yet
Decision and Classification Trees
18 pages
Diabetes Prediction Using Gradient Boosting Algorithm
No ratings yet
Diabetes Prediction Using Gradient Boosting Algorithm
8 pages
AI Based Automated Essay Grading System Using NLP
No ratings yet
AI Based Automated Essay Grading System Using NLP
6 pages
Optimizer Comparison Table
No ratings yet
Optimizer Comparison Table
1 page
Topic - 1 Introduction
No ratings yet
Topic - 1 Introduction
19 pages
Customer Sentiment Analysis Guide
No ratings yet
Customer Sentiment Analysis Guide
3 pages
Drug Classification Using State of Art ML Algorithm
No ratings yet
Drug Classification Using State of Art ML Algorithm
7 pages
AI Concepts, Detailed Explanation
No ratings yet
AI Concepts, Detailed Explanation
53 pages
1725888984module 4 Deep Learning For Natural Language Processing (NLP)
No ratings yet
1725888984module 4 Deep Learning For Natural Language Processing (NLP)
15 pages
IADeep Learning
No ratings yet
IADeep Learning
57 pages
Unit 4
No ratings yet
Unit 4
118 pages
Application of Natural Language Processing in Financial
No ratings yet
Application of Natural Language Processing in Financial
10 pages
Lecture9-IoT and AI
No ratings yet
Lecture9-IoT and AI
17 pages
DLT Unit-5
No ratings yet
DLT Unit-5
10 pages
3 Samsung Innovation Campus Artificial Intelligence Course Details
No ratings yet
3 Samsung Innovation Campus Artificial Intelligence Course Details
6 pages
Ieee
No ratings yet
Ieee
17 pages
Me Seminar Report - Merged
No ratings yet
Me Seminar Report - Merged
26 pages
Malla Reddy University School of Engineering - Iot B.Tech. Iii Year - I Semester Examinations (R23) Question Bank
No ratings yet
Malla Reddy University School of Engineering - Iot B.Tech. Iii Year - I Semester Examinations (R23) Question Bank
3 pages
Deep Learning For Logo Detection A Survey
No ratings yet
Deep Learning For Logo Detection A Survey
13 pages
Yao 2023 Study
No ratings yet
Yao 2023 Study
17 pages
Revisiting Depth Completion From A Stereo Matching Perspective For Cross-Domain Generalization
No ratings yet
Revisiting Depth Completion From A Stereo Matching Perspective For Cross-Domain Generalization
22 pages
30.assignment V
No ratings yet
30.assignment V
2 pages
Tigrinya Dialect Identification
No ratings yet
Tigrinya Dialect Identification
5 pages
Java AI Mastery Guide
No ratings yet
Java AI Mastery Guide
10 pages

Customer Segmentation Using Clustering

Uploaded by

Customer Segmentation Using Clustering

Uploaded by

Customer Segmentation using

Why K-Means Clustering?

Full runnable code with output plot

# 1. Create sample data

# 2. Visualize data distribution

# 4. Elbow method to find optimal K

# 5. From the elbow plot, let's choose K=5

# 7. Print cluster centers in original scale (optional)

 We created synthetic customer data.

You might also like