0% found this document useful (0 votes)

15 views3 pages

KNN Py

Uploaded by

Fahad Nasim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views3 pages

KNN Py

Uploaded by

Fahad Nasim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

#!

/usr/bin/env python
# coding: utf-8

# In[487]:

import pandas as pd
import numpy as np
import [Link] as plt
get_ipython().run_line_magic('matplotlib', 'notebook')

# In[488]:

df = pd.read_csv(r'C:\Users\DELL\Downloads\Social_Network_Ads.csv')

# In[508]:

df = [Link](frac=1, random_state=42).reset_index(drop=True)
#shuffles the rows #resets the index starting from 0

# In[509]:

df['Gender'] = [Link](df['Gender'])[0] #returns a tuple where the first

element of that tuple is the array of
#integers assigned to string values
and the second element is the array of the
# actual string values

# In[524]:

dfc = [Link]() #created a copy of the actual data set cuz i want to, no
explanation 😒😒😒

# In[527]:

def feature_scaling(dfc):
f1, f2 = dfc['Age'], dfc['EstimatedSalary']
dfc['Age'] = (f1-min(f1))/(max(f1) - min(f1)) #so basically i am scaling the
data points after which they will range from 0 to 1
dfc['EstimatedSalary'] = (f2-min(f2))/(max(f2) - min(f2))

return (dfc['Age'], dfc['EstimatedSalary'])

feature_scaling(dfc) #calling this function otherwise it will have no effect on
the data set, ofcourse you need to call the function, otherwise
#whats the point of making a function

# In[537]:
training_data = dfc[:320] #splitting the data into two parts, one i'll use for
training other for the testing
test_data = dfc[320:]

# In[538]:

#converting the data set coloumns to numpy arrays and assigning them to variables

x = training_data.iloc[:, [1, 2, 3]].values #x: numpy array, shape (320, 3),

training feature data
y = training_data.iloc[:, -1].values #y: numpy array, shape (320, ),
training labels

a = test_data.iloc[:, [1, 2, 3]].values #a: numpy array, shape (320, 3), test
feature data
b = test_data.iloc[:, -1].values #b: numpy array, shape (320, 3), test
labels

# In[544]:

import [Link] as px

fig = px.scatter_3d(x=x[:, 0], y=x[:, 1], z=x[:, 2], color=[ 'blue' if label==0
else 'red' for label in y])
fig.update_layout(width=1000,height=600)
fig.update_traces(marker=dict(size=2)) # you can try size=2 or 1 as well
[Link]()

# In[560]:

fig = px.scatter_3d(x=a[:, 0], y=a[:, 1], z=a[:, 2], color=[ 'blue' if label==0
else 'red' for label in b])
fig.update_layout(width=1000,height=500)
fig.update_traces(marker=dict(size=2)) #can try size=2 or 1 as well
[Link]()

# In[554]:

def KNN(x, query_point, K):

dist = [Link](((x-query_point)**2).sum(axis=1)) #calculating the
euclidean distance

stacked_array = [Link]([dist, y], axis = 1) #stacking the distances

with there actual y labels
sorted_indices = [Link](stacked_array[:, 0])
ranked_array = stacked_array[sorted_indices] #now ranking the arrays
on the basis of distance(y labels remains intact)
ranked_array = ranked_array[:K] #returns actual sorted
array but only K rows are returned
predict = [Link](ranked_array[:, 1], return_counts=True) #returns a tulple
of two array one element of tuple is the array of all number
#and the other one
is the array of the number of times they occured.

purchased_or_not = predict[0][predict[1].argmax()] #This is for

[Link] purchased(1) is more or not purchased(0) is more
#select the first
array and then select the second array with has
#max count and the
whole gives the number which occured most

return (1 if purchased_or_not==1 else 0)

# In[555]:

err_arr = [] #empty list for storing the

errors. Called error_array.
for i in range(len(a)):
if (KNN(x, a[i], 5))==b[i]:
err = 0 #0 if there is no error
else:
err = 1 #1 if there is error

err_arr.append(err) #that array containing all the

errors in terms of 0 and 1

total_cost=(sum(err_arr)/len(a))*100 #calculating the cost ie.,

total error percentage

print(total_cost, '%') #for the test data set i

got minimum error as 7.5% with a K=5.

# In[ ]:

Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
Ai Lab
No ratings yet
Ai Lab
11 pages
AI Lab Codes.
No ratings yet
AI Lab Codes.
12 pages
EDA Plots Code
No ratings yet
EDA Plots Code
13 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
9 pages
Mlalllabprgs
No ratings yet
Mlalllabprgs
17 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
S6 - Data Mining Lab Experiments (Except 1)
No ratings yet
S6 - Data Mining Lab Experiments (Except 1)
6 pages
Data Analysis for Beginners
No ratings yet
Data Analysis for Beginners
1 page
1 - All Python Codes + Neo4j Samples
No ratings yet
1 - All Python Codes + Neo4j Samples
16 pages
Machine Learning Programs
No ratings yet
Machine Learning Programs
10 pages
Titanic Shuffle Analysis in ML Lab
No ratings yet
Titanic Shuffle Analysis in ML Lab
24 pages
V
No ratings yet
V
8 pages
Aiml Lab
No ratings yet
Aiml Lab
14 pages
Machine Learning Lab Manual Guide
No ratings yet
Machine Learning Lab Manual Guide
13 pages
DA Programs
No ratings yet
DA Programs
44 pages
Lab - 7 - 21130616 - TranhThanhVu - Ipynb - Colab
No ratings yet
Lab - 7 - 21130616 - TranhThanhVu - Ipynb - Colab
10 pages
ML Experiment WithDataset
No ratings yet
ML Experiment WithDataset
23 pages
Stat Lab
No ratings yet
Stat Lab
24 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
33 pages
AIML
No ratings yet
AIML
12 pages
Experiment 1111
No ratings yet
Experiment 1111
25 pages
DWM Practical
No ratings yet
DWM Practical
12 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
34 pages
Data Preprocessing 2
No ratings yet
Data Preprocessing 2
5 pages
ML Lab
No ratings yet
ML Lab
23 pages
EE 559 HW2Code PDF
No ratings yet
EE 559 HW2Code PDF
7 pages
AIML Final Programs
No ratings yet
AIML Final Programs
8 pages
ML Short Code - Under Updating
No ratings yet
ML Short Code - Under Updating
4 pages
Machine Learning Algorithms in Python
No ratings yet
Machine Learning Algorithms in Python
18 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
26 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
18 pages
AI Lab10
No ratings yet
AI Lab10
4 pages
Reading Data: #Importing Required Libraries
No ratings yet
Reading Data: #Importing Required Libraries
16 pages
Machine Learning Practical File MRIEM
No ratings yet
Machine Learning Practical File MRIEM
49 pages
Walmart Sales Forecasting Guide
No ratings yet
Walmart Sales Forecasting Guide
37 pages
Apriori Algorithm for Itemset Mining
No ratings yet
Apriori Algorithm for Itemset Mining
28 pages
Python Programs for AI Algorithms
No ratings yet
Python Programs for AI Algorithms
28 pages
ps2 Macro Bongioanni TXT
No ratings yet
ps2 Macro Bongioanni TXT
4 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
ML - Lab Manual With Woad File
No ratings yet
ML - Lab Manual With Woad File
12 pages
ML Lab Record
No ratings yet
ML Lab Record
33 pages
ML Journal External
No ratings yet
ML Journal External
14 pages
ML Lab Prgms Split
No ratings yet
ML Lab Prgms Split
3 pages
01 134192 066 9559671601 28052022 103753pm
No ratings yet
01 134192 066 9559671601 28052022 103753pm
1 page
Ashwin Report
No ratings yet
Ashwin Report
18 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
Fda Batch2program
No ratings yet
Fda Batch2program
18 pages
Linear Reg 33
No ratings yet
Linear Reg 33
3 pages
Wa0003
No ratings yet
Wa0003
16 pages
Lab4 KNN
No ratings yet
Lab4 KNN
9 pages
Datascience PR 6 Veda
No ratings yet
Datascience PR 6 Veda
6 pages
Page Rank
No ratings yet
Page Rank
7 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
12 pages
ML Labmanual
No ratings yet
ML Labmanual
33 pages
Python ML Algorithms Guide
No ratings yet
Python ML Algorithms Guide
7 pages
Ailmml
No ratings yet
Ailmml
1 page
Bilal Ahmad Ai & DSS Assign # 03
No ratings yet
Bilal Ahmad Ai & DSS Assign # 03
7 pages
CS-30013 (DMDW) - CS End Nov 2024
No ratings yet
CS-30013 (DMDW) - CS End Nov 2024
21 pages
An Improved Closed-Circuit RO (CCRO) System - Design and Cyclic Simulation
No ratings yet
An Improved Closed-Circuit RO (CCRO) System - Design and Cyclic Simulation
15 pages
ANSYS Mechanical APDL Basic Analysis Guide
No ratings yet
ANSYS Mechanical APDL Basic Analysis Guide
304 pages
Module 3 - Line
No ratings yet
Module 3 - Line
2 pages
PPMP 1811
No ratings yet
PPMP 1811
16 pages
One-Sample Hypothesis Tests
No ratings yet
One-Sample Hypothesis Tests
47 pages
Deep Learning Approaches For Speech Emotion Recognition: State of The Art and Research Challenges
No ratings yet
Deep Learning Approaches For Speech Emotion Recognition: State of The Art and Research Challenges
68 pages
Huang 等 - 2025 - Machine Learning-optimized Jet-Enhanced Immersion Liquid Cooling for High-power Data Centers
No ratings yet
Huang 等 - 2025 - Machine Learning-optimized Jet-Enhanced Immersion Liquid Cooling for High-power Data Centers
16 pages
03 Annex A2 - Index List of Subject Combns For 2025
No ratings yet
03 Annex A2 - Index List of Subject Combns For 2025
2 pages
AE Unit-4 QB With Solution
No ratings yet
AE Unit-4 QB With Solution
20 pages
Is Logic Ever Foundational
No ratings yet
Is Logic Ever Foundational
4 pages
Dplyr Grammar for Data Wrangling
No ratings yet
Dplyr Grammar for Data Wrangling
21 pages
Bridge Detailing 2.0: Computational Modelling Methods Using Civil 3D, Revit & Dynamo
100% (1)
Bridge Detailing 2.0: Computational Modelling Methods Using Civil 3D, Revit & Dynamo
33 pages
A Brinding Model
No ratings yet
A Brinding Model
28 pages
Crystal Growth PHD Thesispdf
100% (2)
Crystal Growth PHD Thesispdf
8 pages
Class XII Mathematics Exam
No ratings yet
Class XII Mathematics Exam
6 pages
Kami Export - 6. GWC PHYSICS 185 ONLINE LAB-Collision PHET Lab-1 1
No ratings yet
Kami Export - 6. GWC PHYSICS 185 ONLINE LAB-Collision PHET Lab-1 1
5 pages
CSCI1120 Introduction To Computing Using C++ Tutorial 9: Assignment 5
No ratings yet
CSCI1120 Introduction To Computing Using C++ Tutorial 9: Assignment 5
42 pages
Management Accounting For Engineers: Final Examination
No ratings yet
Management Accounting For Engineers: Final Examination
11 pages
Scilab 5.4.1 Notes PDF
No ratings yet
Scilab 5.4.1 Notes PDF
7 pages
SBA1 Maths 2 Yr4 2023
No ratings yet
SBA1 Maths 2 Yr4 2023
7 pages
Eukaryotic Swimming Cells Are Shaped by Hydrodynamic Constraints
No ratings yet
Eukaryotic Swimming Cells Are Shaped by Hydrodynamic Constraints
9 pages
Algebraic Laws of Regular Expressions
No ratings yet
Algebraic Laws of Regular Expressions
42 pages
Python Collections Cheatsheet
No ratings yet
Python Collections Cheatsheet
2 pages
Math Problem Solutions and Explanations
100% (1)
Math Problem Solutions and Explanations
73 pages
Computer Aided Machine Drawing Laboratory: Lab Manual
No ratings yet
Computer Aided Machine Drawing Laboratory: Lab Manual
31 pages
Apple - LeetCode PDF
No ratings yet
Apple - LeetCode PDF
11 pages
18 Analysis of Geometrically Nonlinear Systems: P) (BU + H3)
No ratings yet
18 Analysis of Geometrically Nonlinear Systems: P) (BU + H3)
12 pages
Lecture 01
No ratings yet
Lecture 01
31 pages
The Influence of Leadership Style, Organizational Culture, and Job Satisfaction On Employee Performance Department of Education and Culture of Yapen Islands
No ratings yet
The Influence of Leadership Style, Organizational Culture, and Job Satisfaction On Employee Performance Department of Education and Culture of Yapen Islands
14 pages

KNN Py

Uploaded by

KNN Py

Uploaded by

#!

df['Gender'] = [Link](df['Gender'])[0] #returns a tuple where the first

return (dfc['Age'], dfc['EstimatedSalary'])

x = training_data.iloc[:, [1, 2, 3]].values #x: numpy array, shape (320, 3),

def KNN(x, query_point, K):

stacked_array = [Link]([dist, y], axis = 1) #stacking the distances

purchased_or_not = predict[0][predict[1].argmax()] #This is for

return (1 if purchased_or_not==1 else 0)

err_arr = [] #empty list for storing the

err_arr.append(err) #that array containing all the

total_cost=(sum(err_arr)/len(a))*100 #calculating the cost ie.,

print(total_cost, '%') #for the test data set i

You might also like