0% found this document useful (0 votes)

10 views10 pages

Workshop Project Report

This project report details using DBSCAN and K-Means clustering algorithms to segment customers based on their purchasing behavior from transactional record data. The methodology included data preprocessing, implementing the clustering algorithms in Python, comparing model performance using metrics like silhouette score and inertia, and concluding that K-Means demonstrated simplicity and identified well-defined customer clusters while requiring parameter tuning for DBSCAN. Key results and recommendations are provided.

Uploaded by

Rajveer Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views10 pages

Workshop Project Report

Uploaded by

Rajveer Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Workshop Project Report

Year of Submission: - 2023-24

Submit by,
Divyanshu Khandelwal_2115500055_3S_Class roll no. :- 22
Suryansh Agrawal_2115500147_3S_Class roll no. :- 42
Sonal Mittal_2115500140_3S_Class roll no. :- 40
Anshika Singh_2115500024_3S_Class roll no. :- 10

Department of Computer Engineering and Applications

GLA University, Mathura
Project Report: Customer Segmentation through Clustering
Analysis

Introduction:
Customer segmentation is a crucial aspect of marketing
strategies. Clustering algorithms aid in identifying patterns
within data to categorize customers into groups with similar
traits. This project utilizes two clustering algorithms—DBSCAN
and K-Means—to segment customers based on their
purchasing behavior.

Dataset:
The dataset used in this project contains transactional records
from a retail store. It includes attributes such as customer ID,
purchase history, frequency of purchases, and total amount
spent.
Methodology:

Data Preprocessing

1. Data Cleaning: Removing duplicates, handling missing

values, and ensuring data consistency.

2. Feature Selection: Choosing relevant attributes for

clustering, such as purchase frequency and total
spending.

3. Feature Scaling: Normalizing numerical features to ensure

uniformity.
Clustering Algorithms

1. DBSCAN (Density-Based Spatial Clustering of Applications

with Noise)
- DBSCAN identifies clusters based on density. It groups
together points that are closely packed.
- Parameters: Epsilon (ε) and Minimum Points (MinPts).
- Advantages: Robust to outliers and doesn’t require
specifying the number of clusters.
- Implementation: Using scikit-learn's DBSCAN algorithm.

2. K-Means Clustering
- K-Means partitions data into K clusters based on centroids'
proximity.
- Parameters: Number of clusters (K).
- Advantages: Simple, scalable, and efficient for large
datasets.
- Implementation: Utilizing scikit-learn's KMeans algorithm.
Model Building and Evaluation

DBSCAN Model
- Identified clusters based on varying epsilon values and
minimum points.
- Evaluated silhouette scores and visualized clusters using
scatter plots.

K-Means Model
- Explored different K values to find optimal clusters.
- Assessed the inertia scores and visualized clusters using
scatter plots.
Comparative Study

Performance Metrics
- Silhouette Score: Measures the compactness and separation
between clusters. Higher scores indicate better-defined
clusters.
-Inertia: Measures how internally coherent clusters are. Lower
values represent better clustering.
Results and Observations

- DBSCAN: Showed varying performance with different

parameter settings. Achieved silhouette score of X.
- K-Means: Found an optimal number of clusters (K) with
silhouette score of Y and inertia value of Z.
Conclusion

- Both algorithms effectively segmented customers based on

purchasing behavior.
- DBSCAN proved robust to outliers but required careful
parameter tuning.
- K-Means demonstrated simplicity and scalability, providing
well-defined clusters with optimal K values.
Recommendations

- For datasets with clear cluster densities, DBSCAN can be a

suitable choice.
- In scenarios where scalability and simplicity are vital, K-
Means can be preferred.
Future Work

- Experiment with other clustering algorithms like Hierarchical

Clustering or Gaussian Mixture Models.
- Incorporate additional features or external data sources for
more robust segmentation.

---

This report provides an overview of customer segmentation

using DBSCAN and K-Means algorithms, highlighting their
strengths, weaknesses, and comparative performance.

Review2 A15
No ratings yet
Review2 A15
14 pages
Aiml Project Review
No ratings yet
Aiml Project Review
22 pages
Comparison of K-Means and DBSCAN
No ratings yet
Comparison of K-Means and DBSCAN
20 pages
DWDM PPT
No ratings yet
DWDM PPT
13 pages
A Cluster-Based Analysis For Targeting Potential Customers in A Real-World Marketing System
No ratings yet
A Cluster-Based Analysis For Targeting Potential Customers in A Real-World Marketing System
8 pages
I Love Merge
No ratings yet
I Love Merge
56 pages
Customer Segmentation via Data Science
No ratings yet
Customer Segmentation via Data Science
21 pages
Enhanced Customer Segmentation in E-commerce
No ratings yet
Enhanced Customer Segmentation in E-commerce
5 pages
Segmentation Analysis
No ratings yet
Segmentation Analysis
17 pages
Customer Segmentation Using K
No ratings yet
Customer Segmentation Using K
16 pages
K-Means Clustering for Customer Segmentation
No ratings yet
K-Means Clustering for Customer Segmentation
22 pages
Data Science for Customer Segmentation
No ratings yet
Data Science for Customer Segmentation
13 pages
Ensemble Clustering for Customer Segmentation
No ratings yet
Ensemble Clustering for Customer Segmentation
21 pages
Behavioural Customer Segmentation Based
No ratings yet
Behavioural Customer Segmentation Based
7 pages
ML Assignment 1
No ratings yet
ML Assignment 1
23 pages
Ads Phase 4
No ratings yet
Ads Phase 4
12 pages
Automated Customer Segmentation System
No ratings yet
Automated Customer Segmentation System
29 pages
MCA Thesis: K-Means for Segmentation
No ratings yet
MCA Thesis: K-Means for Segmentation
15 pages
DWDM Report
No ratings yet
DWDM Report
6 pages
Research Paper Mini Project
No ratings yet
Research Paper Mini Project
13 pages
Retail Customer Segmentation Report
No ratings yet
Retail Customer Segmentation Report
27 pages
IJCRT2407525
No ratings yet
IJCRT2407525
9 pages
Customer Segmentation IEEE Report
No ratings yet
Customer Segmentation IEEE Report
2 pages
JPSP202244
No ratings yet
JPSP202244
7 pages
Utkaarshhhhhhhhhhhhhhhhh
No ratings yet
Utkaarshhhhhhhhhhhhhhhhh
50 pages
Mall Customer Segmentation Analysis
No ratings yet
Mall Customer Segmentation Analysis
18 pages
Universiti Teknologi: Mohamad Amir Salihin
No ratings yet
Universiti Teknologi: Mohamad Amir Salihin
5 pages
DW&DM PROJECT Sawan
No ratings yet
DW&DM PROJECT Sawan
14 pages
Mall Customer Segmentation Using Machine Learning Techniques
No ratings yet
Mall Customer Segmentation Using Machine Learning Techniques
17 pages
BT 4065 Report
No ratings yet
BT 4065 Report
32 pages
Customer Segmentation with K-Means
No ratings yet
Customer Segmentation with K-Means
2 pages
Customer Segmentation with K-Means
No ratings yet
Customer Segmentation with K-Means
2 pages
Mall Customer Segmentation Guide
No ratings yet
Mall Customer Segmentation Guide
8 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
31 pages
E-Commerce Customer Segmentation
No ratings yet
E-Commerce Customer Segmentation
7 pages
Report
No ratings yet
Report
22 pages
Chapter 1,2 Report
No ratings yet
Chapter 1,2 Report
5 pages
CUSTOMER - MALL - SEGMENTATION.1 (1) (1) (Autosaved)
No ratings yet
CUSTOMER - MALL - SEGMENTATION.1 (1) (1) (Autosaved)
9 pages
Final
No ratings yet
Final
48 pages
A Cluster-Based Analysis For Targeting Potential Customers in A Real-World Marketing System
No ratings yet
A Cluster-Based Analysis For Targeting Potential Customers in A Real-World Marketing System
8 pages
Honey Research Paper
No ratings yet
Honey Research Paper
4 pages
Customer Segmentation Literature Review 1
No ratings yet
Customer Segmentation Literature Review 1
8 pages
Data Mining Lab: Classification & Clustering
No ratings yet
Data Mining Lab: Classification & Clustering
2 pages
Interships 10037
No ratings yet
Interships 10037
31 pages
Clustering Part 2
No ratings yet
Clustering Part 2
9 pages
Updated Thesis
No ratings yet
Updated Thesis
29 pages
IJCSP23D1055
No ratings yet
IJCSP23D1055
9 pages
Da cs-1
No ratings yet
Da cs-1
11 pages
Customer Segmentation Report
No ratings yet
Customer Segmentation Report
31 pages
DM Lab Report
No ratings yet
DM Lab Report
13 pages
UNIT II-Segmentation, Positioning, and Product Optimization
No ratings yet
UNIT II-Segmentation, Positioning, and Product Optimization
48 pages
Data Science for Customer Segmentation
No ratings yet
Data Science for Customer Segmentation
7 pages
Mall Customer Segmentation: Submitted By: Batch No:8
No ratings yet
Mall Customer Segmentation: Submitted By: Batch No:8
17 pages
Low Code AIML USL Project CreditCardCustomerSegmentation Vijay Borade Aug23
67% (3)
Low Code AIML USL Project CreditCardCustomerSegmentation Vijay Borade Aug23
66 pages
Customer Segmentation
No ratings yet
Customer Segmentation
21 pages
2629 Gembali Maneesh
No ratings yet
2629 Gembali Maneesh
59 pages
Customer Segmentation Using K Means Clustering IJERTV11IS030152
No ratings yet
Customer Segmentation Using K Means Clustering IJERTV11IS030152
6 pages
Customer Segmentation New
No ratings yet
Customer Segmentation New
11 pages
288175101
No ratings yet
288175101
51 pages
Understanding Relational Database Models
No ratings yet
Understanding Relational Database Models
26 pages
Experiment No 2
No ratings yet
Experiment No 2
6 pages
DBMS Normalization: 1NF to BCNF Explained
No ratings yet
DBMS Normalization: 1NF to BCNF Explained
18 pages
Database Structure
No ratings yet
Database Structure
3 pages
Unit III Interactive SQL and Performance Tuning
No ratings yet
Unit III Interactive SQL and Performance Tuning
20 pages
MO201 Exam Study Guide
100% (1)
MO201 Exam Study Guide
130 pages
BioStar 2 - AC Database Description (V2.8.10) - TS - Updated
No ratings yet
BioStar 2 - AC Database Description (V2.8.10) - TS - Updated
24 pages
Computer Science Exam Paper (10+2)
No ratings yet
Computer Science Exam Paper (10+2)
8 pages
Itdw
No ratings yet
Itdw
44 pages
86 - NETSCOUT Arbor Insight
No ratings yet
86 - NETSCOUT Arbor Insight
3 pages
Tribon: Interface Guide
No ratings yet
Tribon: Interface Guide
47 pages
Stock Market Manipulation Detection
No ratings yet
Stock Market Manipulation Detection
8 pages
AgriResponse A Real Time Agricultural Query Response Generation System
No ratings yet
AgriResponse A Real Time Agricultural Query Response Generation System
9 pages
DBMS Unit 4
No ratings yet
DBMS Unit 4
12 pages
SECOND HALF PORTION - Q WITH ANS Ds - Dbms and SQL 22.10.24
No ratings yet
SECOND HALF PORTION - Q WITH ANS Ds - Dbms and SQL 22.10.24
12 pages
Excel Data Management to SQLite
No ratings yet
Excel Data Management to SQLite
3 pages
s3 Userguide
No ratings yet
s3 Userguide
1,167 pages
Urdaneta City University: Measures of Dispersion and Shape
No ratings yet
Urdaneta City University: Measures of Dispersion and Shape
10 pages
Jetpack Compose: Composables Guide
No ratings yet
Jetpack Compose: Composables Guide
12 pages
DNS
No ratings yet
DNS
9 pages
HHLD - Youtube
No ratings yet
HHLD - Youtube
71 pages
DBMS Unit 1 Notes
100% (1)
DBMS Unit 1 Notes
22 pages
BDT Notes
No ratings yet
BDT Notes
40 pages
DBMS Question Bank Full
No ratings yet
DBMS Question Bank Full
6 pages
Data Engineering For Beginners
No ratings yet
Data Engineering For Beginners
129 pages
CBSE Class 12 Computer Science Sample Paper Set 2 For 2024-25 FREE PDF
No ratings yet
CBSE Class 12 Computer Science Sample Paper Set 2 For 2024-25 FREE PDF
45 pages
Predicting Bike Rental Duration with BQML
100% (1)
Predicting Bike Rental Duration with BQML
17 pages
Data Architecture Module 3
No ratings yet
Data Architecture Module 3
44 pages
CHAPTER 7 Past Paper Marking Scheme
No ratings yet
CHAPTER 7 Past Paper Marking Scheme
10 pages
How Optimize A SpringBoot App To Handle 1M Requests - Second - Jan 2025
No ratings yet
How Optimize A SpringBoot App To Handle 1M Requests - Second - Jan 2025
9 pages

Workshop Project Report

Uploaded by

Workshop Project Report

Uploaded by

Workshop Project Report

Year of Submission: - 2023-24

Department of Computer Engineering and Applications

1. Data Cleaning: Removing duplicates, handling missing

2. Feature Selection: Choosing relevant attributes for

3. Feature Scaling: Normalizing numerical features to ensure

1. DBSCAN (Density-Based Spatial Clustering of Applications

- DBSCAN: Showed varying performance with different

- Both algorithms effectively segmented customers based on

- For datasets with clear cluster densities, DBSCAN can be a

- Experiment with other clustering algorithms like Hierarchical

This report provides an overview of customer segmentation

You might also like