Machine Learning: Lecture 7: Create Your First Project

This document provides instructions for creating a machine learning project to classify iris flowers using the iris dataset, which includes 150 samples described by 4 features. It outlines loading and exploring the iris data, splitting it into training and test sets, building a decision tree classifier model, and evaluating the model's accuracy on both the training and test sets. Additionally, it suggests some homework extensions including applying normalization, comparing other classifier models, and finding the best predictive model.

Uploaded by

Bisnu Sarkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views17 pages

Machine Learning: Lecture 7: Create Your First Project

Uploaded by

Bisnu Sarkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Machine Learning

Lecture 7: Create Your First Project

COURSE CODE: CSE490
2019
Course Teacher
Dr. Mrinal Kanti Baowaly
Assistant Professor
Department of Computer Science and
Engineering, Bangabandhu Sheikh
Mujibur Rahman Science and
Technology University, Bangladesh.

Email: [email protected]
Iris flower classification
Iris dataset
 150 samples
 3 labels/categories: Species of Iris (Iris setosa, Iris virginica and Iris
versicolor)
 4 features: Sepal length, Sepal width, Petal length, Petal Width in
cm
Iris dataset instances
Import libraries
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn import tree
from sklearn.metrics import accuracy_score
Load the dataset
iris_data = pd.read_csv('IRIS.csv')
Summarize the dataset
# dimensions (no. of rows & columns)
print(iris_data.shape)
# list of columns/features
print(iris_data.columns)
# peek some data
print(iris_data.head(10))
# statistical summary
print(iris_data.describe())
Specify the target variable and its
distribution
# target variable
target = iris_data['species']

# distribution of class labels or categories

print(pd.value_counts(target))
Specify the target variable and its
distribution
# target variable
target = iris_data['species']

# distribution of class labels or categories

print(pd.value_counts(target))

# alternative of finding class distribution

print(iris_data.groupby('species').size())
Split dataset into training and test data
seed = 7
train_data, test_data = train_test_split(iris_data, test_size=0.3,
random_state= 7)
# shape of the datasets
print('\nShape of training data :',train_data.shape)
print('\nShape of testing data :',test_data.shape)
# class distribution of the training data
print(pd.value_counts(train_data['species']))
# class distribution of the test data
print(pd.value_counts(test_data['species']))
Balanced split of the dataset
seed = 7
train_data, test_data = train_test_split(iris_data, test_size=0.3,
random_state=seed, stratify=target)
Separate the independent and target
variables
# separate the independent and target variables from training data
train_x = train_data.drop(columns=['species'],axis=1)
train_y = train_data['species']

# separate the independent and target variables from test data

test_x = test_data.drop(columns=['species'],axis=1)
test_y = test_data['species']
Build the model
# create a classifier object/model
model=tree.DecisionTreeClassifier()

# train the model with fit function

model.fit(train_x, train_y)
Make predictions
# make predictions on training data
predictions_train = model.predict(train_x)
print('\nTraining Accuracy :', accuracy_score(train_y,
predictions_train))

# make predictions on test data

predictions_test = model.predict(test_x)
print('\nTest Accuracy :', accuracy_score(test_y, predictions_test))
Home work for the Lab.
Apply normalization or standardization
Apply different classifiers and compare their performances
• Logistic Regression (LR)
• K-Nearest Neighbors (KNN)
• Support Vector Machines (SVM)
Find the best model for the prediction task
Some example projects
Iris classification [Link1, Link2]
Machine Learning-Let’s Get Started [Link]
Your First Machine Learning Project in Python Step-By-Step [Link]
24 Data Science Projects To Boost Your Knowledge and Skills [link]
6 Complete Machine Learning Projects [Link]

ML Lecture 10 Project
No ratings yet
ML Lecture 10 Project
20 pages
Lab 6
No ratings yet
Lab 6
4 pages
Iris Flower Classification Project
100% (1)
Iris Flower Classification Project
14 pages
Iris Flower Classification with Neural Networks
No ratings yet
Iris Flower Classification with Neural Networks
38 pages
Iris Classification
No ratings yet
Iris Classification
6 pages
Major Project (Kartik Joshi)
No ratings yet
Major Project (Kartik Joshi)
4 pages
VAMSHI PR (1) 2 Edit
No ratings yet
VAMSHI PR (1) 2 Edit
16 pages
AML Lab3 2021wb15156
No ratings yet
AML Lab3 2021wb15156
13 pages
Understanding-Code-for A-Classifier
No ratings yet
Understanding-Code-for A-Classifier
15 pages
Logistic Regression on Iris Dataset
No ratings yet
Logistic Regression on Iris Dataset
39 pages
Iris Flower Classification Final
No ratings yet
Iris Flower Classification Final
15 pages
ML Remaining Jds
No ratings yet
ML Remaining Jds
35 pages
Iris Flower Classification Project Report
No ratings yet
Iris Flower Classification Project Report
42 pages
Iris Machine Learning Model Guide
No ratings yet
Iris Machine Learning Model Guide
5 pages
Practical File DL
No ratings yet
Practical File DL
14 pages
Assignment 4 R Program1
No ratings yet
Assignment 4 R Program1
11 pages
ML LAB Manual With Output
No ratings yet
ML LAB Manual With Output
25 pages
ML Lab Manual
No ratings yet
ML Lab Manual
6 pages
Module 4 - Supervised Learning - First ML Model
No ratings yet
Module 4 - Supervised Learning - First ML Model
23 pages
ML Lab1 PGM
No ratings yet
ML Lab1 PGM
4 pages
Chapter 4
No ratings yet
Chapter 4
5 pages
Exp 9 - 2131
No ratings yet
Exp 9 - 2131
7 pages
Iris Flower Classification
No ratings yet
Iris Flower Classification
3 pages
Types of ML Systems
No ratings yet
Types of ML Systems
5 pages
Tutorial 6
No ratings yet
Tutorial 6
8 pages
22IZ023 Nikhil - Exercise 7 A - Decision Trees
No ratings yet
22IZ023 Nikhil - Exercise 7 A - Decision Trees
4 pages
Iris - Copy1 - Jupyter Notebook
No ratings yet
Iris - Copy1 - Jupyter Notebook
8 pages
Task 1 Iris Flower Classification Using Machine Learning
No ratings yet
Task 1 Iris Flower Classification Using Machine Learning
10 pages
Python ML Lab for Beginners
No ratings yet
Python ML Lab for Beginners
10 pages
CS178 Winter 2017 Homework 1 Guide
No ratings yet
CS178 Winter 2017 Homework 1 Guide
4 pages
Iris Dataset EDA and ML Models
No ratings yet
Iris Dataset EDA and ML Models
17 pages
Machine Learning Techniques on Iris Dataset
No ratings yet
Machine Learning Techniques on Iris Dataset
8 pages
Iris Flower Classification Project
No ratings yet
Iris Flower Classification Project
9 pages
Amber Iris
No ratings yet
Amber Iris
23 pages
178 hw1
No ratings yet
178 hw1
4 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
SVM and KNN Classification Assignment
No ratings yet
SVM and KNN Classification Assignment
18 pages
Lab Report 10 FDS
No ratings yet
Lab Report 10 FDS
7 pages
Nomlab 14 Ai
No ratings yet
Nomlab 14 Ai
3 pages
Scikit Learn Cross-Validation Guide
No ratings yet
Scikit Learn Cross-Validation Guide
141 pages
DS Report
No ratings yet
DS Report
11 pages
Iris Script 2
No ratings yet
Iris Script 2
2 pages
Iris Dataset Project Report - Compress
No ratings yet
Iris Dataset Project Report - Compress
16 pages
ChatGPT - MyLearning On Coding For Machine Learning
No ratings yet
ChatGPT - MyLearning On Coding For Machine Learning
16 pages
2 Machine Learning
No ratings yet
2 Machine Learning
21 pages
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
No ratings yet
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
8 pages
Animal Species Prediction Using Machine Learning
No ratings yet
Animal Species Prediction Using Machine Learning
10 pages
AI Code Generation for Developers
No ratings yet
AI Code Generation for Developers
12 pages
Classification of Iris Flower Species Updated
100% (1)
Classification of Iris Flower Species Updated
5 pages
3 Text
No ratings yet
3 Text
2 pages
Python ML Programs for Data Analysis
No ratings yet
Python ML Programs for Data Analysis
16 pages
Module 4
No ratings yet
Module 4
30 pages
Module 4-1
No ratings yet
Module 4-1
30 pages
Animal Species Prediction Using Machine Learning
No ratings yet
Animal Species Prediction Using Machine Learning
10 pages
ML Internal Answers
No ratings yet
ML Internal Answers
9 pages
To Study About Numpy, Pandas and Matplotlib Libraries in Python
No ratings yet
To Study About Numpy, Pandas and Matplotlib Libraries in Python
21 pages
Classification Algorithms II
No ratings yet
Classification Algorithms II
9 pages
Machine Learning Aiml
No ratings yet
Machine Learning Aiml
7 pages
Decision Tree Exp 5 DWM
No ratings yet
Decision Tree Exp 5 DWM
2 pages
Foster. Gold Metallogeny and Exploration (Book)
100% (3)
Foster. Gold Metallogeny and Exploration (Book)
447 pages
ARALING PANLIPUNAN VI Summative Week 4
100% (1)
ARALING PANLIPUNAN VI Summative Week 4
4 pages
Science PDF
100% (1)
Science PDF
117 pages
Eng7 Unit4 Exercises
No ratings yet
Eng7 Unit4 Exercises
4 pages
PHD. Thesis Format
100% (1)
PHD. Thesis Format
16 pages
UKG Lesson Plan 1-1
No ratings yet
UKG Lesson Plan 1-1
5 pages
Dual Training System
100% (1)
Dual Training System
11 pages
English Ws g-7
No ratings yet
English Ws g-7
9 pages
Social Media's Impact on Grade 11 STEM Performance
No ratings yet
Social Media's Impact on Grade 11 STEM Performance
39 pages
TTL2 Module1
No ratings yet
TTL2 Module1
8 pages
Argumentative Essay - Day 1
No ratings yet
Argumentative Essay - Day 1
6 pages
Um Certificate
50% (2)
Um Certificate
257 pages
CV EN v0.1 - Director Partnerships
No ratings yet
CV EN v0.1 - Director Partnerships
1 page
Dr. Rammanohar Lohia Avadh University, Ayodhya
0% (1)
Dr. Rammanohar Lohia Avadh University, Ayodhya
2 pages
Understanding Modulo 5 Arithmetic
No ratings yet
Understanding Modulo 5 Arithmetic
2 pages
Table of Contents - 2
No ratings yet
Table of Contents - 2
3 pages
Brock Blasdell Resume - Academic
No ratings yet
Brock Blasdell Resume - Academic
1 page
OT53 The Book of Psalms Baby
No ratings yet
OT53 The Book of Psalms Baby
12 pages
Recruit Top Talent from IIST
No ratings yet
Recruit Top Talent from IIST
16 pages
ABRSM Horn Sight Reading 1-5
100% (3)
ABRSM Horn Sight Reading 1-5
26 pages
Second Life Books
No ratings yet
Second Life Books
3 pages
Senate Bill 438 Read:: "Why Filipino Student Are Required To Study The Life and Works of Jose Rizal" Focus Questions
No ratings yet
Senate Bill 438 Read:: "Why Filipino Student Are Required To Study The Life and Works of Jose Rizal" Focus Questions
5 pages
PE First Aid Kit
No ratings yet
PE First Aid Kit
2 pages
Module 3A: Designing Instruction in The Different Learning Delivery Modalities
No ratings yet
Module 3A: Designing Instruction in The Different Learning Delivery Modalities
5 pages
Acharya Nagarjuna University: Naac - Grade
No ratings yet
Acharya Nagarjuna University: Naac - Grade
2 pages
CV 18-06-25
No ratings yet
CV 18-06-25
4 pages
Student Health Ambassador Training August 2020
No ratings yet
Student Health Ambassador Training August 2020
113 pages
Listening and Speaking Lesson Plan
No ratings yet
Listening and Speaking Lesson Plan
14 pages
CH1 Introduction To Data Science BS
No ratings yet
CH1 Introduction To Data Science BS
69 pages
NCOs: Building Pride in the Military
No ratings yet
NCOs: Building Pride in the Military
4 pages

Machine Learning: Lecture 7: Create Your First Project

Uploaded by

Machine Learning: Lecture 7: Create Your First Project

Uploaded by

Machine Learning

Lecture 7: Create Your First Project

# distribution of class labels or categories

# distribution of class labels or categories

# alternative of finding class distribution

# separate the independent and target variables from test data

# train the model with fit function

# make predictions on test data

You might also like