Data Clustering with K-Means

Uploaded by

anandadeepbala

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

125 views3 pages

Data Clustering with K-Means

Uploaded by

anandadeepbala

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

ASSIGNMENT NUMBER 3 SOLUTION

#Import libraries
import numpy as np # linear algebra
import pandas as pd # data processing, CSV file I/O (e.g. pd.read_csv)
import matplotlib.pyplot as plt # for data visualization
import seaborn as sns # for statistical data visualization
%matplotlib inline

# Input data files are available in the "../input/" directory.

# For example, running this (by clicking run or pressing Shift+Enter)
will list all files under the input directory

import os
for dirname, _, filenames in os.walk('/kaggle/input'):
for filename in filenames:
print(os.path.join(dirname, filename))
import warnings
warnings.filterwarnings('ignore')
#Importing the dataset
data = '/content/Live.csv'
df = pd.read_csv(data)

#Drop redundant columns

df.drop(['Column1', 'Column2', 'Column3', 'Column4'], axis=1,
inplace=True)
df.info()
df.describe()

# view the labels in the variable

df['status_type'].unique()
df.drop(['status_id', 'status_published'], axis=1, inplace=True)
df.head()
#Declaration of feature vector and target variable
X = df
y = df['status_type']

#Conversion of categorical variable into integers

from sklearn.preprocessing import LabelEncoder
le = LabelEncoder()
X['status_type'] = le.fit_transform(X['status_type'])
y = le.transform(y)
#K-Means model with two clusters
from sklearn.cluster import KMeans
kmeans = KMeans(n_clusters=2, random_state=0)
kmeans.fit(X)
labels = kmeans.labels_
# check how many of the samples were correctly labeled
correct_labels = sum(y == labels)
print("Result: %d out of %d samples were correctly labeled." %
(correct_labels, y.size))
print('Accuracy score: {0:0.2f}'. format(correct_labels/float(y.size)))
#K-Means model with 3 clusters
kmeans = KMeans(n_clusters=3, random_state=0)
kmeans.fit(X)

# check how many of the samples were correctly labeled

labels = kmeans.labels_
correct_labels = sum(y == labels)
print("Result: %d out of %d samples were correctly labeled." %
(correct_labels, y.size))
print('Accuracy score: {0:0.2f}'. format(correct_labels/float(y.size)))
#Use elbow method to find optimal number of clusters
from sklearn.cluster import KMeans
cs = []
for i in range(1, 11):
kmeans = KMeans(n_clusters = i, init = 'k-means++', max_iter = 300,
n_init = 10, random_state = 0)
kmeans.fit(X)
cs.append(kmeans.inertia_)
plt.plot(range(1, 11), cs)
plt.title('The Elbow Method')
plt.xlabel('Number of clusters')
plt.ylabel('CS')
plt.show()
#By the above plot, we can see that there is a kink at k=2.Hence k=2
can be considered a good number of the cluster to cluster this data.

DATASET LINK: https://www.semanticscholar.org/paper/Dataset-on-usage-and-

engagement-patterns-for-Live-Dehouche/c0ec91003f8bdce99a56fa60dc2d20268cc808b8

DM ML Practical
No ratings yet
DM ML Practical
13 pages
Heart Disease Prediction Guide
100% (1)
Heart Disease Prediction Guide
73 pages
Lab Report6 - B21CI014
No ratings yet
Lab Report6 - B21CI014
8 pages
Advanced Machine Learning Experiments
No ratings yet
Advanced Machine Learning Experiments
15 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
K-Means Clustering From Scratch
No ratings yet
K-Means Clustering From Scratch
3 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
26 pages
ML Codes
No ratings yet
ML Codes
9 pages
Lab Manual
No ratings yet
Lab Manual
9 pages
Minor Lab
No ratings yet
Minor Lab
4 pages
LAB7 Kmeans
No ratings yet
LAB7 Kmeans
11 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
Document
No ratings yet
Document
4 pages
Experiment 9
No ratings yet
Experiment 9
10 pages
Assignment 4
No ratings yet
Assignment 4
9 pages
Clustering Algorithms in Machine Learning
No ratings yet
Clustering Algorithms in Machine Learning
6 pages
ML Exp5 C36
No ratings yet
ML Exp5 C36
18 pages
Aam Codes
No ratings yet
Aam Codes
8 pages
S6 - Data Mining Lab Experiments (Except 1)
No ratings yet
S6 - Data Mining Lab Experiments (Except 1)
6 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
20 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
44 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
AAM 7th Prac
No ratings yet
AAM 7th Prac
4 pages
ML Lab Mannual
No ratings yet
ML Lab Mannual
29 pages
Naive Bayes Algorithm
No ratings yet
Naive Bayes Algorithm
51 pages
Bone Suplement Market Segmentation
No ratings yet
Bone Suplement Market Segmentation
20 pages
Python ML Algorithms Guide
No ratings yet
Python ML Algorithms Guide
7 pages
Machine Learning: Supervised /unsupervised
No ratings yet
Machine Learning: Supervised /unsupervised
33 pages
DWM Exp4
No ratings yet
DWM Exp4
9 pages
ML Minors Exp7
No ratings yet
ML Minors Exp7
6 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
Aiml Lab
No ratings yet
Aiml Lab
37 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
34 pages
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
No ratings yet
Subject: ML Name: Priyanshu Gandhi Date: 10/4/21 Expt. No.: 9 Roll No.: C008 Title: Clustering Implementation in Python
7 pages
K Means Clustering - Experiment 12
No ratings yet
K Means Clustering - Experiment 12
3 pages
Statistic Inference Unit 2 Notes
No ratings yet
Statistic Inference Unit 2 Notes
34 pages
ML Lab Manual
No ratings yet
ML Lab Manual
12 pages
K-Means Clustering Algorithm
No ratings yet
K-Means Clustering Algorithm
17 pages
KNN Final
No ratings yet
KNN Final
4 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
ML
No ratings yet
ML
11 pages
Elbow Method
No ratings yet
Elbow Method
2 pages
23BCE7092 ML Lab Assignment
No ratings yet
23BCE7092 ML Lab Assignment
14 pages
DataScience All 1to8
No ratings yet
DataScience All 1to8
6 pages
ML Notes 1
No ratings yet
ML Notes 1
3 pages
K-means Clustering on Iris Dataset
No ratings yet
K-means Clustering on Iris Dataset
3 pages
Machine Learning Model Building
No ratings yet
Machine Learning Model Building
6 pages
Python Code For KNN Classifier 1. Initial Message
No ratings yet
Python Code For KNN Classifier 1. Initial Message
7 pages
Slip
No ratings yet
Slip
5 pages
Bacdeaf 23032025 115708 Split 1
No ratings yet
Bacdeaf 23032025 115708 Split 1
37 pages
DADV Exp-5
No ratings yet
DADV Exp-5
3 pages
Shubham Pract 6 - Merged
No ratings yet
Shubham Pract 6 - Merged
12 pages
K-Means Clustering Using Matlab: December 2015
No ratings yet
K-Means Clustering Using Matlab: December 2015
6 pages
K-Means Clustering Implementation Guide
No ratings yet
K-Means Clustering Implementation Guide
8 pages
Implementing K-Means Clustering: '/content/mall - Customers (1) .CSV'
No ratings yet
Implementing K-Means Clustering: '/content/mall - Customers (1) .CSV'
8 pages
Assembly Language: by - Prof. Prithi K.S
No ratings yet
Assembly Language: by - Prof. Prithi K.S
67 pages
Onehouse
No ratings yet
Onehouse
39 pages
Veneration Without Understanding Interpretation
No ratings yet
Veneration Without Understanding Interpretation
2 pages
Excel Formulas and Functions Guide
100% (3)
Excel Formulas and Functions Guide
5 pages
This Document Contains "Project Management Plan" of Project School Management System
No ratings yet
This Document Contains "Project Management Plan" of Project School Management System
8 pages
Or Final Project
No ratings yet
Or Final Project
10 pages
GR 7 Math - Smart Minds Mathematics Schemes of Work Term 2.
No ratings yet
GR 7 Math - Smart Minds Mathematics Schemes of Work Term 2.
3 pages
Grade 3 Reading Lesson Plan Template
No ratings yet
Grade 3 Reading Lesson Plan Template
2 pages
En The Concept of Life After Death in Islam
No ratings yet
En The Concept of Life After Death in Islam
10 pages
PSY National Preliminary Exam 2024 Guide
No ratings yet
PSY National Preliminary Exam 2024 Guide
4 pages
2023 Year 1 Diagnostic Test
No ratings yet
2023 Year 1 Diagnostic Test
8 pages
Ozone Console
No ratings yet
Ozone Console
3 pages
q1 Mathematics 5 Week 1 Day 2
No ratings yet
q1 Mathematics 5 Week 1 Day 2
36 pages
Vinesh
No ratings yet
Vinesh
1 page
Neural Network Sinusoidal Approximation
No ratings yet
Neural Network Sinusoidal Approximation
3 pages
Judges
No ratings yet
Judges
15 pages
11ler 1. Dönem 2. Yazılı
No ratings yet
11ler 1. Dönem 2. Yazılı
2 pages
TP Manual 2019R1
No ratings yet
TP Manual 2019R1
881 pages
EEM 486 Computer Architecture Homework VI 1. Consider The Following Code Sequence
No ratings yet
EEM 486 Computer Architecture Homework VI 1. Consider The Following Code Sequence
2 pages
Graves Into Gardens: Verse 1 Ab Eb Ab Eb CM BB Ab Eb Ab Eb CM BB Ab
No ratings yet
Graves Into Gardens: Verse 1 Ab Eb Ab Eb CM BB Ab Eb Ab Eb CM BB Ab
2 pages
AI Curriculum for Grade 8 Students
No ratings yet
AI Curriculum for Grade 8 Students
4 pages
Features of Windows NT 4 Overview
No ratings yet
Features of Windows NT 4 Overview
11 pages
SAIL Bokaro OCTT ACTT Reasoning Old Paper
No ratings yet
SAIL Bokaro OCTT ACTT Reasoning Old Paper
11 pages
The Chimney Sweeper 1
No ratings yet
The Chimney Sweeper 1
1 page
4100U File Transfer Utility Guide RevA
No ratings yet
4100U File Transfer Utility Guide RevA
18 pages
Englishq3 Mod1
No ratings yet
Englishq3 Mod1
27 pages
Sound Figures (Alliteration - Assonance - Onomatopoeia - Paronomasia)
No ratings yet
Sound Figures (Alliteration - Assonance - Onomatopoeia - Paronomasia)
6 pages
Essay Kaise-Brochure and Schedule
No ratings yet
Essay Kaise-Brochure and Schedule
4 pages
Paper 6.1 LAB Manual
No ratings yet
Paper 6.1 LAB Manual
15 pages
IEEE Paper Formatting Template Guide
No ratings yet
IEEE Paper Formatting Template Guide
4 pages

Data Clustering with K-Means

Uploaded by

Data Clustering with K-Means

Uploaded by

ASSIGNMENT NUMBER 3 SOLUTION

# Input data files are available in the "../input/" directory.

#Drop redundant columns

# view the labels in the variable

#Conversion of categorical variable into integers

# check how many of the samples were correctly labeled

DATASET LINK: https://www.semanticscholar.org/paper/Dataset-on-usage-and-

You might also like