0% found this document useful (0 votes)

44 views5 pages

Lab - 5 (CB - En.u4ece22115)

The document discusses performing several experiments to analyze different classification algorithms on datasets. Experiment 1 uses decision tree and naive bayes classifiers on the IRIS dataset, achieving 100% accuracy with both. Experiment 2 compares the performance of decision tree and naive bayes classifiers using different cross validation techniques. Experiment 3 discusses label encoding and one hot encoding of categorical variables. Experiment 4 analyzes support vector machines with and without normalization/standardization on IRIS data. Experiment 5 aims to perform color classification on images using support vector machines, k-nearest neighbors, decision tree and naive bayes classifiers.

Uploaded by

Daejuswaram Gopinath

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views5 pages

Lab - 5 (CB - En.u4ece22115)

Uploaded by

Daejuswaram Gopinath

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

LAB - 5 (CB.EN.

U4ECE22115)
Experiment 1: Performance decision tree (DT) and Naive Bayes (NB) in IRIS dataset and note the accuracy using both classifiers 70-30% training-test split.

In [42]: import pandas as pd

from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier
from sklearn.naive_bayes import GaussianNB
from sklearn.metrics import accuracy_score

# Load IRIS dataset

iris = pd.read_csv(r"C:\Users\Daejuswaram Gopinath\Downloads\Iris_Dataset.csv")
X = iris.iloc[:,0:4]
y = iris.Species

# Split data into training and testing sets (70-30 split)

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

# Decision Tree Classifier

dt_classifier = DecisionTreeClassifier()
dt_classifier.fit(X_train, y_train)
dt_pred = dt_classifier.predict(X_test)
dt_accuracy = accuracy_score(y_test, dt_pred)

# Naive Bayes Classifier

nb_classifier = GaussianNB()
nb_classifier.fit(X_train, y_train)
nb_pred = nb_classifier.predict(X_test)
nb_accuracy = accuracy_score(y_test, nb_pred)

print("Decision Tree Accuracy:", dt_accuracy)

print("Naive Bayes Accuracy:", nb_accuracy)

Decision Tree Accuracy: 1.0

Naive Bayes Accuracy: 1.0

In [43]: iris.head()

Out[43]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

Experiment 2: Compare the performance of both classifiers (DT and NB) using 10-fold, leave one out, and 10-fold stratified cross-validation.

In [44]: from sklearn.model_selection import cross_val_score, LeaveOneOut, StratifiedKFold

# Cross-validation methods
cv_methods = ['10-fold', 'Leave One Out', 'Stratified 10-fold']
classifiers = [DecisionTreeClassifier(), GaussianNB()]

for clf in classifiers:

for cv_method in cv_methods:
if cv_method == '10-fold':
scores = cross_val_score(clf, X, y, cv=10)
elif cv_method == 'Leave One Out':
loo = LeaveOneOut()
scores = cross_val_score(clf, X, y, cv=loo)
else:
skf = StratifiedKFold(n_splits=10)
scores = cross_val_score(clf, X, y, cv=skf)
print(f"{cv_method} Cross-validation Accuracy for {type(clf).__name__}: {scores.mean()}")

10-fold Cross-validation Accuracy for DecisionTreeClassifier: 0.9666666666666668

Leave One Out Cross-validation Accuracy for DecisionTreeClassifier: 0.9933333333333333
Stratified 10-fold Cross-validation Accuracy for DecisionTreeClassifier: 0.9333333333333332
10-fold Cross-validation Accuracy for GaussianNB: 0.9866666666666667
Leave One Out Cross-validation Accuracy for GaussianNB: 0.9866666666666667
Stratified 10-fold Cross-validation Accuracy for GaussianNB: 0.9866666666666667

In [45]: from sklearn.preprocessing import LabelEncoder, OneHotEncoder

import pandas as pd

# Assuming 'iris' is a DataFrame version of the Iris dataset

# And 'species' is the column with categorical data to be encoded

# Label Encoding
le = LabelEncoder()
iris['species_encoded'] = le.fit_transform(iris['Species'])

# One Hot Encoding

ohe = OneHotEncoder()
species_ohe = ohe.fit_transform(iris[['Species']]).toarray()
species_ohe_df = pd.DataFrame(species_ohe, columns=[f"species_{i}" for i in range(species_ohe.shape[1])])
iris = pd.concat([iris, species_ohe_df], axis=1)

In [46]: label_encoder = LabelEncoder()

y_train_encoded = label_encoder.fit_transform(y_train)
y_test_encoded = label_encoder.transform(y_test)

# Support Vector Machine Classifier without Standardization

svm_classifier = SVC(kernel='linear')
svm_classifier.fit(X_train, y_train_encoded)
y_pred_no_standardization = svm_classifier.predict(X_test)
accuracy_no_standardization = accuracy_score(y_test_encoded, y_pred_no_standardization)

# Support Vector Machine Classifier with Standardization

scaler = StandardScaler()
X_train_standardized = scaler.fit_transform(X_train)
X_test_standardized = scaler.transform(X_test)

svm_classifier_standardized = SVC(kernel='linear')
svm_classifier_standardized.fit(X_train_standardized, y_train_encoded)
y_pred_standardization = svm_classifier_standardized.predict(X_test_standardized)
accuracy_standardization = accuracy_score(y_test_encoded, y_pred_standardization)

# Support Vector Machine Classifier without Normalization

svm_classifier_no_normalization = SVC(kernel='linear')
svm_classifier_no_normalization.fit(X_train, y_train_encoded)
y_pred_no_normalization = svm_classifier_no_normalization.predict(X_test)
accuracy_no_normalization = accuracy_score(y_test_encoded, y_pred_no_normalization)

# Support Vector Machine Classifier with Normalization

normalizer = MinMaxScaler()
X_train_normalized = normalizer.fit_transform(X_train)
X_test_normalized = normalizer.transform(X_test)
svm_classifier_normalized = SVC(kernel='linear')
svm_classifier_normalized.fit(X_train_normalized, y_train_encoded)
y_pred_normalization = svm_classifier_normalized.predict(X_test_normalized)
accuracy_normalization = accuracy_score(y_test_encoded, y_pred_normalization)

# Print accuracies
print("Accuracy without Standardization:", accuracy_no_standardization)
print("Accuracy with Standardization:", accuracy_standardization)
print("Accuracy without Normalization:", accuracy_no_normalization)
print("Accuracy with Normalization:", accuracy_normalization)

Accuracy without Standardization: 1.0

Accuracy with Standardization: 1.0
Accuracy without Normalization: 1.0
Accuracy with Normalization: 1.0

In [47]: iris.head()

Out[47]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species species_encoded species_0 species_1 species_2

0 1 5.1 3.5 1.4 0.2 Iris-setosa 0 1.0 0.0 0.0

1 2 4.9 3.0 1.4 0.2 Iris-setosa 0 1.0 0.0 0.0

2 3 4.7 3.2 1.3 0.2 Iris-setosa 0 1.0 0.0 0.0

3 4 4.6 3.1 1.5 0.2 Iris-setosa 0 1.0 0.0 0.0

4 5 5.0 3.6 1.4 0.2 Iris-setosa 0 1.0 0.0 0.0

Experiment 5: Perform colour classification for the above dataset using SVM, k-NN, DT, NB classifier.

Type - 1 (Considering specific colors)

In [39]: import numpy as np

import pandas as pd
import cv2
import os
from sklearn.model_selection import train_test_split
from sklearn.svm import SVC
from sklearn.neighbors import KNeighborsClassifier
from sklearn.tree import DecisionTreeClassifier
from sklearn.naive_bayes import GaussianNB
from sklearn.metrics import accuracy_score

# Define the directory containing the image data

DATADIR = r'C:\Users\Daejuswaram Gopinath\Downloads\ColorClassification'

# Define the categories or labels

CATEGORIES = ['Black', 'Blue', 'Brown', 'Green','Violet', 'White']

# Function to load and preprocess images

def load_images_and_labels():
images = []
labels = []
for category in CATEGORIES:
path = os.path.join(DATADIR, category)
class_num = CATEGORIES.index(category)
for img in os.listdir(path):
img_array = cv2.imread(os.path.join(path,img))
img_array = cv2.resize(img_array, (100, 100)) # Resize images to a fixed size
images.append(img_array.flatten())
labels.append(class_num)
return images, labels
# Load and preprocess images
images, labels = load_images_and_labels()

# Convert to numpy arrays

X = np.array(images)
y = np.array(labels)

# Split the dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=1)

# Initialize classifiers
svm_classifier = SVC(kernel='linear')
knn_classifier = KNeighborsClassifier(n_neighbors=5)
dt_classifier = DecisionTreeClassifier()
nb_classifier = GaussianNB()

# Train classifiers
svm_classifier.fit(X_train, y_train)
knn_classifier.fit(X_train, y_train)
dt_classifier.fit(X_train, y_train)
nb_classifier.fit(X_train, y_train)

# Predictions
y_pred_svm = svm_classifier.predict(X_test)
y_pred_knn = knn_classifier.predict(X_test)
y_pred_dt = dt_classifier.predict(X_test)
y_pred_nb = nb_classifier.predict(X_test)

# Calculate accuracies
accuracy_svm = accuracy_score(y_test, y_pred_svm)
accuracy_knn = accuracy_score(y_test, y_pred_knn)
accuracy_dt = accuracy_score(y_test, y_pred_dt)
accuracy_nb = accuracy_score(y_test, y_pred_nb)

# Print accuracies
print("SVM Accuracy:", accuracy_svm)
print("k-NN Accuracy:", accuracy_knn)
print("Decision Tree Accuracy:", accuracy_dt)
print("Naive Bayes Accuracy:", accuracy_nb)

SVM Accuracy: 0.8076923076923077

k-NN Accuracy: 0.5384615384615384
Decision Tree Accuracy: 0.5384615384615384
Naive Bayes Accuracy: 0.8076923076923077

Type - 2 (Considering all colors)

In [40]: import numpy as np

# Define the directory containing the image data

DATADIR = r'C:\Users\Daejuswaram Gopinath\Downloads\ColorClassification'

# Define the categories or labels

CATEGORIES = ['orange','Violet','red','Blue','Green','Black','Brown','White']
# Function to load and preprocess images
def load_images_and_labels():
images = []
labels = []
for category in CATEGORIES:
path = os.path.join(DATADIR, category)
class_num = CATEGORIES.index(category)
for img in os.listdir(path):
img_array = cv2.imread(os.path.join(path,img))
img_array = cv2.resize(img_array, (100, 100)) # Resize images to a fixed size
images.append(img_array.flatten())
labels.append(class_num)
return images, labels

# Load and preprocess images

images, labels = load_images_and_labels()

# Convert to numpy arrays

X = np.array(images)
y = np.array(labels)

# Split the dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=1)

# Initialize classifiers
svm_classifier = SVC(kernel='linear')
knn_classifier = KNeighborsClassifier(n_neighbors=5)
dt_classifier = DecisionTreeClassifier()
nb_classifier = GaussianNB()

# Train classifiers
svm_classifier.fit(X_train, y_train)
knn_classifier.fit(X_train, y_train)
dt_classifier.fit(X_train, y_train)
nb_classifier.fit(X_train, y_train)

# Predictions
y_pred_svm = svm_classifier.predict(X_test)
y_pred_knn = knn_classifier.predict(X_test)
y_pred_dt = dt_classifier.predict(X_test)
y_pred_nb = nb_classifier.predict(X_test)

# Print accuracies
print("SVM Accuracy:", accuracy_svm)
print("k-NN Accuracy:", accuracy_knn)
print("Decision Tree Accuracy:", accuracy_dt)
print("Naive Bayes Accuracy:", accuracy_nb)

SVM Accuracy: 0.6363636363636364

k-NN Accuracy: 0.5454545454545454
Decision Tree Accuracy: 0.6666666666666666
Naive Bayes Accuracy: 0.5757575757575758

In [ ]:

Iris Classifier Accuracy Comparison
No ratings yet
Iris Classifier Accuracy Comparison
5 pages
ML Lab Manual
No ratings yet
ML Lab Manual
6 pages
VAMSHI PR (1) 2 Edit
No ratings yet
VAMSHI PR (1) 2 Edit
16 pages
Code Examples in Space
No ratings yet
Code Examples in Space
13 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
Classification Review
No ratings yet
Classification Review
8 pages
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
No ratings yet
Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050
5 pages
3 Classification
No ratings yet
3 Classification
16 pages
Support Vector Machines for Solar Data
No ratings yet
Support Vector Machines for Solar Data
4 pages
ML 4 SVM
No ratings yet
ML 4 SVM
3 pages
SVM Classification on Iris Dataset
No ratings yet
SVM Classification on Iris Dataset
4 pages
ML Lab6
No ratings yet
ML Lab6
4 pages
Machine Learning Aiml
No ratings yet
Machine Learning Aiml
7 pages
CV 5
No ratings yet
CV 5
4 pages
NaiveBayesClassifier - Jupyter Notebook
No ratings yet
NaiveBayesClassifier - Jupyter Notebook
2 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 7
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 7
23 pages
AML Lab3 2021wb15156
No ratings yet
AML Lab3 2021wb15156
13 pages
SVC Classifier Applications in Python
No ratings yet
SVC Classifier Applications in Python
3 pages
ML Functions
No ratings yet
ML Functions
12 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
6 pages
Assignment 1
No ratings yet
Assignment 1
17 pages
ML Lab
No ratings yet
ML Lab
10 pages
Support Vector Machine
No ratings yet
Support Vector Machine
7 pages
ML Using Python Programs
No ratings yet
ML Using Python Programs
12 pages
PYHTONPRACT
No ratings yet
PYHTONPRACT
4 pages
ML Remaining Jds
No ratings yet
ML Remaining Jds
35 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
Linearregression SVM
No ratings yet
Linearregression SVM
3 pages
ML Expt 4
No ratings yet
ML Expt 4
4 pages
ML Lab-1
No ratings yet
ML Lab-1
32 pages
TASK 8: Deploy Support Vector Machine, Apriori Algorithm: BTCS619-18
No ratings yet
TASK 8: Deploy Support Vector Machine, Apriori Algorithm: BTCS619-18
5 pages
AML Lab
No ratings yet
AML Lab
14 pages
ML5 Implementation
No ratings yet
ML5 Implementation
32 pages
Scikit-Learn Python Cheat Sheet
100% (1)
Scikit-Learn Python Cheat Sheet
1 page
1
No ratings yet
1
13 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
All in One
No ratings yet
All in One
13 pages
Program 5
No ratings yet
Program 5
3 pages
AIML Project
No ratings yet
AIML Project
4 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 5
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 5
27 pages
Ex 6, EX 7 AIML
No ratings yet
Ex 6, EX 7 AIML
9 pages
Scikit Learn Cross-Validation Guide
No ratings yet
Scikit Learn Cross-Validation Guide
141 pages
Perform The Data Classification Using SVM Classifier - BI Prac 1
No ratings yet
Perform The Data Classification Using SVM Classifier - BI Prac 1
8 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
Programs Lab Bca
No ratings yet
Programs Lab Bca
16 pages
AI ML - Cycle 2 Programs
No ratings yet
AI ML - Cycle 2 Programs
15 pages
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
No ratings yet
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
8 pages
Comparison of Classifiers
No ratings yet
Comparison of Classifiers
6 pages
Shobit Sharma (2124399) ML Lab File PDF
No ratings yet
Shobit Sharma (2124399) ML Lab File PDF
19 pages
SVM Classification on Iris Dataset
No ratings yet
SVM Classification on Iris Dataset
7 pages
Scikit-Learn Python Cheat Sheet
100% (1)
Scikit-Learn Python Cheat Sheet
1 page
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
Unit 2
No ratings yet
Unit 2
5 pages
Naive Bayes Iris Classifier Guide
No ratings yet
Naive Bayes Iris Classifier Guide
14 pages
ML Batch
No ratings yet
ML Batch
36 pages
ML Codes
No ratings yet
ML Codes
9 pages
Customer Care Executive - ELE - Q4603 - v3.0
No ratings yet
Customer Care Executive - ELE - Q4603 - v3.0
39 pages
BS Iec 61935-2-25-2015
No ratings yet
BS Iec 61935-2-25-2015
14 pages
The Self in Western and Eastern Thought
No ratings yet
The Self in Western and Eastern Thought
18 pages
Test 5
No ratings yet
Test 5
6 pages
B.Tech R20 IQuestion Bank CC
No ratings yet
B.Tech R20 IQuestion Bank CC
3 pages
Acoustics for Interior Design Students
No ratings yet
Acoustics for Interior Design Students
92 pages
Veg Menu Selection PDF
No ratings yet
Veg Menu Selection PDF
1 page
10 Minute English Typing Test - Dhandi Muhammad
No ratings yet
10 Minute English Typing Test - Dhandi Muhammad
2 pages
24 A - Review - On - The - Effect - of - Soil - Compaction - and - Its
No ratings yet
24 A - Review - On - The - Effect - of - Soil - Compaction - and - Its
23 pages
Bacterial Skin Infections Guide
No ratings yet
Bacterial Skin Infections Guide
10 pages
10th Class Test Linear Equation Class Test
No ratings yet
10th Class Test Linear Equation Class Test
1 page
CH 8-The-Prisoner-of-Zenda PDF
No ratings yet
CH 8-The-Prisoner-of-Zenda PDF
8 pages
E-Commerce Practical File Overview
No ratings yet
E-Commerce Practical File Overview
23 pages
Vdocuments - MX - Volk Racing GT Series JRP 18 X 75 Call Call Call 18 X 75 Call Call Call
No ratings yet
Vdocuments - MX - Volk Racing GT Series JRP 18 X 75 Call Call Call 18 X 75 Call Call Call
7 pages
E225 - E229, E10 Pilot Devices: Modular DIN Rail Components
No ratings yet
E225 - E229, E10 Pilot Devices: Modular DIN Rail Components
1 page
MA2002D Course Plan-2
No ratings yet
MA2002D Course Plan-2
2 pages
Chapter 2 Strategy and Project Selction
No ratings yet
Chapter 2 Strategy and Project Selction
34 pages
Animal Breeding Course Overview
No ratings yet
Animal Breeding Course Overview
20 pages
207 13 25 31 Primaax Series PDF
No ratings yet
207 13 25 31 Primaax Series PDF
8 pages
vt202406 DL
No ratings yet
vt202406 DL
108 pages
Bures 1989
No ratings yet
Bures 1989
5 pages
SP12 2y3 PC PP 993 007
100% (1)
SP12 2y3 PC PP 993 007
21 pages
SCH 3401 Electrochemistry 1
No ratings yet
SCH 3401 Electrochemistry 1
3 pages
Casablanca Weather - Google Search 3
No ratings yet
Casablanca Weather - Google Search 3
1 page
2020 Monthly Revenue and Order Analysis
No ratings yet
2020 Monthly Revenue and Order Analysis
1,087 pages
71 - Ansi Agma 2008-D11
No ratings yet
71 - Ansi Agma 2008-D11
49 pages
Series X Product Brochure 2022
No ratings yet
Series X Product Brochure 2022
2 pages
PT English 3
No ratings yet
PT English 3
3 pages
4hi S4CLD2308 BPD en de
No ratings yet
4hi S4CLD2308 BPD en de
111 pages
2.3-2 Settiing-Up Wireless Access Point
100% (2)
2.3-2 Settiing-Up Wireless Access Point
19 pages

Lab - 5 (CB - En.u4ece22115)

Uploaded by

Lab - 5 (CB - En.u4ece22115)

Uploaded by

LAB - 5 (CB.EN.

In [42]: import pandas as pd

# Load IRIS dataset

# Split data into training and testing sets (70-30 split)

# Decision Tree Classifier

# Naive Bayes Classifier

print("Decision Tree Accuracy:", dt_accuracy)

Decision Tree Accuracy: 1.0

Out[43]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

In [44]: from sklearn.model_selection import cross_val_score, LeaveOneOut, StratifiedKFold

for clf in classifiers:

10-fold Cross-validation Accuracy for DecisionTreeClassifier: 0.9666666666666668

In [45]: from sklearn.preprocessing import LabelEncoder, OneHotEncoder

# Assuming 'iris' is a DataFrame version of the Iris dataset

# One Hot Encoding

In [46]: label_encoder = LabelEncoder()

# Support Vector Machine Classifier without Standardization

# Support Vector Machine Classifier with Standardization

# Support Vector Machine Classifier without Normalization

# Support Vector Machine Classifier with Normalization

Accuracy without Standardization: 1.0

0 1 5.1 3.5 1.4 0.2 Iris-setosa 0 1.0 0.0 0.0

1 2 4.9 3.0 1.4 0.2 Iris-setosa 0 1.0 0.0 0.0

2 3 4.7 3.2 1.3 0.2 Iris-setosa 0 1.0 0.0 0.0

3 4 4.6 3.1 1.5 0.2 Iris-setosa 0 1.0 0.0 0.0

4 5 5.0 3.6 1.4 0.2 Iris-setosa 0 1.0 0.0 0.0

Type - 1 (Considering specific colors)

In [39]: import numpy as np

# Define the directory containing the image data

# Define the categories or labels

# Function to load and preprocess images

# Convert to numpy arrays

# Split the dataset into training and testing sets

SVM Accuracy: 0.8076923076923077

Type - 2 (Considering all colors)

In [40]: import numpy as np

# Define the directory containing the image data

# Define the categories or labels

# Load and preprocess images

# Convert to numpy arrays

# Split the dataset into training and testing sets

SVM Accuracy: 0.6363636363636364

You might also like