ML Lecture 8 9 Classification

Uploaded by

Shohanur Rahman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views35 pages

ML Lecture 8 9 Classification

Uploaded by

Shohanur Rahman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Machine Learning

Lecture 8-9: Classification and Decision Trees

COURSE CODE: CSE451
2023
Course Teacher
Dr. Mrinal Kanti Baowaly
Associate Professor
Department of Computer Science and
Engineering, Bangabandhu Sheikh
Mujibur Rahman Science and
Technology University, Bangladesh.

Email: [email protected]
Classification: Definition
 Classification is the task of learning a target function f that maps
each input x to one of the predefined class labels y.
• x: attribute, predictor, independent variable, input
• y: class, category, response, dependent variable, target variable,
output
 The target function f is also known informally as a classification
model.
Examples of Classification Task
 Email Spam filtering
 Language detection
 A search of similar documents
 Sentiment analysis
 Recognition of handwritten
characters and numbers
 Fraud detection etc.
Types of Classification
 Binary Classification: Classifying instances into one of two class
labels/categories
 Multiclass Classification: Classifying instances into one of three or
more class labels/categories
 Multi-Label Classification: Multiple class labels or categories are to
be predicted for each instance
Classification Techniques / Algorithms
Base Classifiers
◦ Decision Tree based Methods
◦ Rule-based Methods
◦ Nearest-neighbor
◦ Neural Networks
◦ Deep Learning
◦ Naïve Bayes and Bayesian Belief Networks
◦ Support Vector Machines
Ensemble Classifiers
◦ Boosting, Bagging, Random Forests
Decision Tree Classification
 Decision tree is a type of supervised learning algorithm that is
mostly used in classification problems.
 It works for both categorical and continuous input and output
variables.
 This technique splits the population or data set into two or more
homogeneous sets (or sub-populations) based on most significant
splitter / differentiator in input variables.
An Example of Decision Tree
MarSt Single,
Married Divorced
Home Marital Annual Defaulted
ID
Owner Status Income Borrower
NO Home
1 Yes Single 125K No
Yes Owner No
2 No Married 100K No
3 No Single 70K No NO Income
4 Yes Married 120K No < 80K > 80K
5 No Divorced 95K Yes
NO YES
6 No Married 60K No
7 Yes Divorced 220K No
8 No Single 85K Yes
9 No Married 75K No There could be more than one tree that
fits the same data!
10 No Single 90K Yes
10
Apply Model to Test Data
Test Data
Start from the root of tree.
Home Marital Annual Defaulted
Owner Status Income Borrower
No Married 80K ?
Home 10

Yes Owner No

NO MarSt
Single, Divorced Married

Income NO
< 80K > 80K