Nata Code

The document outlines a Python script that performs KMeans clustering on a dataset imported from an Excel file. It calculates the Within-Cluster Sum of Squares (WCSS) for different cluster counts to determine the optimal number of clusters using the Elbow Method. The script also visualizes the clusters in PCA-reduced space and saves the clustered dataset to a new Excel file.

Uploaded by

Haris Saleem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views2 pages

Nata Code

Uploaded by

Haris Saleem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

import matplotlib.

pyplot as plt
import pandas as pd
from sklearn.cluster import KMeans

dataset = pd.read_excel('nata.xlsx')
X = dataset.iloc[:, [10,11,12,13]].values
wcss = []
for i in range(1, 11):
kmeans = KMeans(n_clusters=i, init='k-means++', random_state=42)
kmeans.fit(X)
wcss.append(kmeans.inertia_)
print(f"WCSS for {i} clusters: {kmeans.inertia_}")

plt.plot(range(1, 11), wcss)

plt.xlabel('Number of clusters')
plt.ylabel('WCSS')
plt.title('Elbow Method')
plt.show()

kmeans = KMeans(n_clusters=4, init="k-means++", random_state=42)

y_kmeans = kmeans.fit_predict(X)

dataset['Cluster'] = y_kmeans
dataset.to_excel("nata_clusters.xlsx", index=False)

#import seaborn as sns

#sns.pairplot(dataset[['Income','Recency','Cluster']], hue='Cluster', diag_kind='kde')
#plt.show()

# transformation for Plot 4 dimensions reduced to 2 dimensions

from sklearn.decomposition import PCA

pca = PCA(n_components=2)
X_pca = pca.fit_transform(X)

plt.scatter(X_pca[:, 0], X_pca[:, 1], c=y_kmeans, cmap='rainbow', alpha=0.7)

plt.xlabel('Principal Component 1')
plt.ylabel('Principal Component 2')
plt.title('Clusters in PCA Space')
plt.show()

Kmeans Clustering Code
No ratings yet
Kmeans Clustering Code
2 pages
Market Analysis by Pchandru
No ratings yet
Market Analysis by Pchandru
10 pages
Iris Unsupervised Cluster
No ratings yet
Iris Unsupervised Cluster
1 page
Psii Viii
No ratings yet
Psii Viii
2 pages
DataScience All 1to8
No ratings yet
DataScience All 1to8
6 pages
Elbow Method Using Sns
No ratings yet
Elbow Method Using Sns
3 pages
K-Means 10
No ratings yet
K-Means 10
2 pages
ML Lab Experiment Shortened With Same Output
No ratings yet
ML Lab Experiment Shortened With Same Output
6 pages
Clustering 1
No ratings yet
Clustering 1
3 pages
Assignment 1
No ratings yet
Assignment 1
2 pages
KMeans Clustering Guide
No ratings yet
KMeans Clustering Guide
5 pages
MLT Exp 09
No ratings yet
MLT Exp 09
3 pages
Ex No 9
No ratings yet
Ex No 9
1 page
Elbow Method
No ratings yet
Elbow Method
2 pages
Avinash Tiwari 9
No ratings yet
Avinash Tiwari 9
4 pages
ML
No ratings yet
ML
7 pages
Linear SVM: 'Target'
No ratings yet
Linear SVM: 'Target'
13 pages
K-Means Clustering Implementation
No ratings yet
K-Means Clustering Implementation
3 pages
Slip Clustering
No ratings yet
Slip Clustering
2 pages
ML
No ratings yet
ML
11 pages
Lab Report6 - B21CI014
No ratings yet
Lab Report6 - B21CI014
8 pages
Vid 4
No ratings yet
Vid 4
6 pages
Practical 5
No ratings yet
Practical 5
6 pages
Output Xerox
No ratings yet
Output Xerox
12 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
K Means Clustering
No ratings yet
K Means Clustering
1 page
Assignment 4
No ratings yet
Assignment 4
9 pages
Implement Clustering Algorithms For Unsupervised Classification
No ratings yet
Implement Clustering Algorithms For Unsupervised Classification
4 pages
New K Means - Jupyter Notebook
No ratings yet
New K Means - Jupyter Notebook
4 pages
MLL
No ratings yet
MLL
2 pages
K-Means Clustering in Python
No ratings yet
K-Means Clustering in Python
2 pages
Iris Dataset PCA Analysis Code
No ratings yet
Iris Dataset PCA Analysis Code
21 pages
DS Prac 8
No ratings yet
DS Prac 8
4 pages
Pramkk
No ratings yet
Pramkk
10 pages
PMA Experiment 2
No ratings yet
PMA Experiment 2
6 pages
AAM 7th Prac
No ratings yet
AAM 7th Prac
4 pages
ML - Unit-6 KMeans
No ratings yet
ML - Unit-6 KMeans
20 pages
Income (K-Means Clustering On A Sample Data Set)
No ratings yet
Income (K-Means Clustering On A Sample Data Set)
3 pages
Slip
No ratings yet
Slip
5 pages
K Means Clustering
No ratings yet
K Means Clustering
6 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
Dulal Mondal LabReport-01
No ratings yet
Dulal Mondal LabReport-01
7 pages
8 Taks
No ratings yet
8 Taks
3 pages
ML Experiment WithDataset
No ratings yet
ML Experiment WithDataset
23 pages
AML Lab
No ratings yet
AML Lab
14 pages
Aiml Assignment 10
No ratings yet
Aiml Assignment 10
6 pages
Kmeans
No ratings yet
Kmeans
4 pages
Document
No ratings yet
Document
4 pages
Minor Lab
No ratings yet
Minor Lab
4 pages
Clustering Techniques in Python Analysis
No ratings yet
Clustering Techniques in Python Analysis
10 pages
Final Code
No ratings yet
Final Code
3 pages
22F-3437 22F-3407 Assignment 4 Ai
No ratings yet
22F-3437 22F-3407 Assignment 4 Ai
15 pages
K-means Clustering on Iris Dataset
No ratings yet
K-means Clustering on Iris Dataset
3 pages
Kmeansclustering Sales Dataset
No ratings yet
Kmeansclustering Sales Dataset
6 pages
Aiml Lab
No ratings yet
Aiml Lab
37 pages
ML Minors Exp7
No ratings yet
ML Minors Exp7
6 pages
Chapter-2 1.: #Print (Train - Data)
No ratings yet
Chapter-2 1.: #Print (Train - Data)
9 pages
Quiz Material 2
No ratings yet
Quiz Material 2
2 pages
Haris Resume Updated.
No ratings yet
Haris Resume Updated.
4 pages
MSBA Spring 25 Evaluation Pre Mid
No ratings yet
MSBA Spring 25 Evaluation Pre Mid
3 pages
Ha LeThiThanh Lop NguyenThiHong HOWGAMIFICATIONELEMENTSMOTIVATEBRANDLOVEOF
No ratings yet
Ha LeThiThanh Lop NguyenThiHong HOWGAMIFICATIONELEMENTSMOTIVATEBRANDLOVEOF
10 pages
NewAgeMarketing AIPersonalizationinaDigitalWorld IARJSET2024 11346
0% (1)
NewAgeMarketing AIPersonalizationinaDigitalWorld IARJSET2024 11346
10 pages

Nata Code

Uploaded by

Nata Code

Uploaded by

import matplotlib.

plt.plot(range(1, 11), wcss)

kmeans = KMeans(n_clusters=4, init="k-means++", random_state=42)

#import seaborn as sns

# transformation for Plot 4 dimensions reduced to 2 dimensions

plt.scatter(X_pca[:, 0], X_pca[:, 1], c=y_kmeans, cmap='rainbow', alpha=0.7)

You might also like