0% found this document useful (0 votes)

76 views45 pages

Multiclass Classification

Multiclass classification involves categorizing data points into more than two classes, where each point belongs to only one class. Techniques include One vs. Rest (OvR) and One vs. One (OvO), with OvR generating one classifier per class and OvO generating classifiers for each pair of classes. Performance assessment can be done using confusion matrices, precision-recall curves, and metrics like accuracy and F1 score, particularly useful for imbalanced datasets.

Uploaded by

darshuipath

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views45 pages

Multiclass Classification

Uploaded by

darshuipath

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Multiclass classification

Multiclass classification
• What is multiclass classification?
• Which classifiers do we use in multiclass
classification?
• How and when do we use these classifiers?
• Is multiclass and multi-label classification
similar?
Multiclass classification
• Classification involving more than two classes (i.e., > 2
Classes)
• Each data point can only belong to one class
Is multiclass and multi-label
classification similar?
• There are mainly two types of multi-class
classification techniques:-
–One vs. All (one-vs-rest)
–One vs. One
Binary Classification
• Only two class instances are present in the dataset.
• It requires only one classifier model.
• Confusion Matrix is easy to derive and understand.
• Example:- Check email is spam or not, predicting gender based on height and weight.

Multi-class Classification

• Multiple class labels are present in the dataset.

• The number of classifier models depends on the classification technique we are applying
to.

• One vs. All:- N-class instances then N binary classifier models

• One vs. One:- N-class instances then N* (N-1)/2 binary classifier models

• The Confusion matrix is easy to derive but complex to understand.

• Example:- Check whether the fruit is apple, banana, or orange.

Method 1. OvR — One vs Rest
• OvR stands for “One vs Rest”, and as the name suggests is one method to
evaluate multiclass models by comparing each class against all the
others at the same time.
• In this scenario we take one class and consider it as our “positive” class,
while all the others (the rest) are considered as the “negative” class.
• By doing this, we reduce the multiclass classification output into a
binary classification one, and so it is possible to use all the known
binary classification metrics to evaluate this scenario.
• We must repeat this for each class present on the data, so for a 3-class
dataset we get 3 different OvR scores. In the end, we can average them
(simple or weighted average) to have a final OvR model score.
Method 1. One-vs.-Rest(OvR) or
One-vs-All or OvA)
• In one-vs-All classification, for the N-class instances dataset, we have to
generate the N-binary classifier models.
• The number of class labels present in the dataset and the number of
generated binary classifiers must be the same.

we have three classes,

for example,
type 1 for Green,
type 2 for Blue, and
type 3 for Red.
Method 1. One-vs.-Rest(OvR) or
One-vs-All or OvA)

• Learn a classifier one at a time

• One-vs-rest
• Given m classes, train m classifiers: one for each class
• Classifier i: treat tuples in class i as positive& all others as
negative
• To classify a tuple X, choose the classifier with maximum
value
• Uses “winner-takes-all” strategy.
• Generate the same number of classifiers as
the class labels are present in the dataset, So
we have to create three classifiers here for
three respective classes.
– Classifier 1:- [Green] vs [Red, Blue]
– Classifier 2:- [Blue] vs [Green, Red]
– Classifier 3:- [Red] vs [Blue, Green]
• Now to train these three classifiers, we need
to create three training datasets. So let’s
consider our primary dataset is as follows,
• You can see that there are three class labels Green, Blue, and Red present in the
dataset. Now we have to create a training dataset for each class.
• Here, we created the training datasets by putting +1 in the class column for that
feature value, which is aligned to that particular class only. For the costs of the
remaining features, we put -1 in the class column.
Training dataset for Green class

Consider the primary dataset, in the first row; we have x1, x2, x3 feature values, and the corresponding
class value is G, which means these feature values belong to G class. So we put +1 value in the class
column for the correspondence of green type. Then we applied the same for the x10, x11, x12 input train
data.
For the rest of the values of the features which are not in correspondence with the Green class, we put -1
in their class column.
• Training dataset for Blue class and Red class
create a training dataset for each classifier, we provide it to our
classifier model and train the model by applying an algorithm.

By analyzing the probability scores, we

predict the result as the class index having a
maximum probability score.
Example:
• consider three test features value as y1,y2,and y3 respectively.
• we pass the test data to the classifier models
• Let's say we got the outcome as,

• Green class classifier -> Positive with a probability score of [0.9]

• Blue class classifier -> Positive with a probability score of [0.4]
• Red class classifier -> Negative with a probability score of [0.5]

• Hence, based on the positive responses and decisive probability

score, we can say that out test input belongs to the Green class
• The benefit of the OVA scheme is that we only have
to train ‘m ‘ classifiers.
• However, we have to deal with highly unbalanced
training data for each binary classifier.
Balanced dataset
Imbalanced dataset
Classification of Class-Imbalanced
Data Sets
• Class-imbalance problem-
• Rare positive examples but numerous negative ones, e.g.,
medical diagnosis, fraud, fault identification, etc.
• Solutions:
– Oversampling: Re-sampling of data from positive
class
– Under-sampling: randomly eliminate tuples from
negative class
– Synthesizing new data points for minority class
1-2 Illustration of Oversampling and
Undersampling
• Oversampling
randomly replicates
minority instances to
increase their
population.

• Undersampling
randomly
downsamples the
majority class.
3. Synthesizing new examples
Method 2. One-vs.-One(OvO)
• OvO stands for “One vs One” and is really similar to OvR,
but instead of comparing each class with the rest, we
compare all possible two-class combinations of the
dataset.
• Let’s say we have a 3-class scenario and we choose the
combination “Class1 vs Class2” as the first one. The first
step is to get a copy of the dataset that only contains the
two classes and discard all the others.
Method 2. One-vs.-One(OvO)
• Learn a classifier for each pair of classes

• Given m classes, construct m(m-1)/2 binary classifiers

• A classifier is trained using tuples of the two classes

• To classify a tuple x, each classifier votes. x is assigned to
the class with maximal vote
• one classifier to distinguish each pair of classes i and j. Let f ij
be the classifier where class i were positive examples and
class j were negative.
• In One-vs-One classification, for the N-class instances dataset, we have to generate
the N* (N-1)/2 binary classifier models. Using this classification approach, we
split the primary dataset into one dataset for each class

• we have a classification problem having three types: Green, Blue, and Red (N=3).

• We divide this problem into N* (N-1)/2 = 3 binary classifier problems:

• Classifier 1: Green vs. Blue
• Classifier 2: Green vs. Red
• Classifier 3: Blue vs. Red

• Each binary classifier predicts one class label. When we input the test data to the
classifier, then the model with the majority counts is concluded as a result.
Assessing Multi-class classification
Performance
• If we have k classes, performance of a classifier can be
assessed using a k-by-k contingency table.
• Classifier’s accuracy: sum of the descending diagonal of the
contingency table, divided by the number of test instances.
• Ex: For the given confusion matrix, calculate
the accuracy, precision and recall
• Accuracy : (15+15+45)/100 = 0.75
• Per class precision : for the first class: 15/24 = 0.63, for the
second class 15/20 = 0.75 , for the third class 45/56 = 0.80
• Per class recall: for first class :15/20 = 0.75, 15/30 = 0.50 (for
second class) and 45/50 = 0.90 (third class)
• Average these numbers to obtain single precision and recall
numbers for the whole classifier
• Or take a weighted average taking the proportion of each
class into account.
• For instance, the weighted average precision is 0.20
·0.63+0.30·0.75+0.50·0.80 = 0.75.
Problem Solving
Performance of multi-class classifiers

True Positive

False Positive of A
True Negative of A

True Negative of D
False positive of A

False positive of B
False Negative of A

Accuracy
Precision-Recall Curves
• Precision-recall is a useful measure of success
for prediction when the classes are
imbalanced.
• Precision is a measure of the ability of a
classification model to identify only the
relevant data points, while recall is a measure
of the ability of a model to find all the relevant
cases within a data set.
• The precision-recall curve shows the trade-off
between precision and recall for different
thresholds.
• A high area under the curve represents both
high recall and high precision, where high
precision relates to a low false positive rate,
and high recall relates to a low false negative
rate.
F1 score is good metric when data is imbalanced

• F1 score is harmonic mean of recall and precision

ML-Unit 3 Classification
No ratings yet
ML-Unit 3 Classification
41 pages
One-vs-Rest vs One-vs-One Explained
No ratings yet
One-vs-Rest vs One-vs-One Explained
16 pages
ECOC
No ratings yet
ECOC
23 pages
Multiclass Classification Overview
No ratings yet
Multiclass Classification Overview
6 pages
Multiclass vs Binary Classification
No ratings yet
Multiclass vs Binary Classification
3 pages
Notes Binary Mulitclass Classification
No ratings yet
Notes Binary Mulitclass Classification
7 pages
06 EnsembleLearning
No ratings yet
06 EnsembleLearning
65 pages
ML Session 3 25
No ratings yet
ML Session 3 25
81 pages
Module 4 - Classification
No ratings yet
Module 4 - Classification
10 pages
Hybrid Clustering Strategies For Effective Oversampling and Undersampling in Multiclass Classification
No ratings yet
Hybrid Clustering Strategies For Effective Oversampling and Undersampling in Multiclass Classification
20 pages
Pattern Recognition: Zhe Wang, Zonghai Zhu, Dongdong Li
No ratings yet
Pattern Recognition: Zhe Wang, Zonghai Zhu, Dongdong Li
14 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
3ML.02.MainConcepts Evaluation
No ratings yet
3ML.02.MainConcepts Evaluation
35 pages
Lecture 3 1611410001002
No ratings yet
Lecture 3 1611410001002
51 pages
ML - Mod2 Classification
No ratings yet
ML - Mod2 Classification
74 pages
04 EnsembleLearning
No ratings yet
04 EnsembleLearning
40 pages
Beyond Binary: Multi-Class Classification
No ratings yet
Beyond Binary: Multi-Class Classification
16 pages
Classification: Prof. Gheith Abandah
No ratings yet
Classification: Prof. Gheith Abandah
30 pages
CS373 Lecture18.1
No ratings yet
CS373 Lecture18.1
33 pages
Multiclass Classification Survey
No ratings yet
Multiclass Classification Survey
9 pages
Machine Learning
No ratings yet
Machine Learning
28 pages
5 Techniques To Handle Imbalanced Data For A Classification Problem
No ratings yet
5 Techniques To Handle Imbalanced Data For A Classification Problem
7 pages
Article
No ratings yet
Article
23 pages
Classification in Machine Learning
No ratings yet
Classification in Machine Learning
4 pages
Understanding Multiclass Classification Techniques
No ratings yet
Understanding Multiclass Classification Techniques
59 pages
Multiclass Classification Techniques
No ratings yet
Multiclass Classification Techniques
59 pages
Ensembles of Classifiers: Evgueni Smirnov
No ratings yet
Ensembles of Classifiers: Evgueni Smirnov
43 pages
Machine Learning Unit-2
No ratings yet
Machine Learning Unit-2
89 pages
Linear Algorithms for Multiclass Probability Estimation
No ratings yet
Linear Algorithms for Multiclass Probability Estimation
15 pages
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
No ratings yet
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
21 pages
Hands On ML Workshop-Classification
No ratings yet
Hands On ML Workshop-Classification
17 pages
1608 06048 PDF
No ratings yet
1608 06048 PDF
7 pages
Binary, Multi-Class & Multi-Label Classification
No ratings yet
Binary, Multi-Class & Multi-Label Classification
6 pages
Beyond Binary Classification
No ratings yet
Beyond Binary Classification
34 pages
ML Unit 2
No ratings yet
ML Unit 2
31 pages
Đại Học Quốc Gia Thành Phố Hồ Chí Minh Trường Đại Học Khoa Học Tự Nhiên Khoa Công Nghệ Thông Tin Bộ Môn Công Nghệ Tri Thức
No ratings yet
Đại Học Quốc Gia Thành Phố Hồ Chí Minh Trường Đại Học Khoa Học Tự Nhiên Khoa Công Nghệ Thông Tin Bộ Môn Công Nghệ Tri Thức
9 pages
Classifier Evaluation Techniques
No ratings yet
Classifier Evaluation Techniques
59 pages
4 Types of Classification Tasks in Machine Learning
No ratings yet
4 Types of Classification Tasks in Machine Learning
14 pages
SVM Multi-Class Classification
No ratings yet
SVM Multi-Class Classification
5 pages
Advanced ML Classification Guide
No ratings yet
Advanced ML Classification Guide
40 pages
Ensemble Learning (Autosaved)
No ratings yet
Ensemble Learning (Autosaved)
31 pages
Class Weights in CatBoost for Imbalance
No ratings yet
Class Weights in CatBoost for Imbalance
31 pages
AI Lec 4
No ratings yet
AI Lec 4
35 pages
UNIT 4 Supervised Learning
No ratings yet
UNIT 4 Supervised Learning
38 pages
Smai Lecture 04 Perf Measures Classification
No ratings yet
Smai Lecture 04 Perf Measures Classification
42 pages
Data Mining Evaluation Metrics Guide
No ratings yet
Data Mining Evaluation Metrics Guide
40 pages
Lec5 Classification
No ratings yet
Lec5 Classification
27 pages
Jntuk r20 ML Unit-III
No ratings yet
Jntuk r20 ML Unit-III
28 pages
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-25 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101735 2024-07-25 Reference-Material-I
37 pages
MLA CT1 - Notes
No ratings yet
MLA CT1 - Notes
17 pages
Machine Learning Classification Guide
No ratings yet
Machine Learning Classification Guide
28 pages
Class Imbalance Strategies & Metrics
No ratings yet
Class Imbalance Strategies & Metrics
19 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
Introduction to Classification Algorithms
No ratings yet
Introduction to Classification Algorithms
51 pages
Imbalanced Classes in Big Data
No ratings yet
Imbalanced Classes in Big Data
20 pages
Large Margin DAGs for Multiclass Classification
No ratings yet
Large Margin DAGs for Multiclass Classification
8 pages
Aiml 4 and 5
No ratings yet
Aiml 4 and 5
87 pages
19-Introduction Classification Algorithm-18-09-2024
No ratings yet
19-Introduction Classification Algorithm-18-09-2024
102 pages
SVM&Hiercarcal
No ratings yet
SVM&Hiercarcal
30 pages
Burkov's Guide to Machine Learning
100% (11)
Burkov's Guide to Machine Learning
135 pages
SVM Kernal
No ratings yet
SVM Kernal
5 pages
Introduction to Dimensionality Reduction
No ratings yet
Introduction to Dimensionality Reduction
5 pages
Machine Learning Projects Python
94% (18)
Machine Learning Projects Python
134 pages
Understanding Machine Learning
100% (73)
Understanding Machine Learning
416 pages
Machine Learning Projects in Python
100% (17)
Machine Learning Projects in Python
135 pages
Deep Learning - Fundamentals, Theory and Applications 2019 PDF
100% (11)
Deep Learning - Fundamentals, Theory and Applications 2019 PDF
168 pages
Guide To AUC ROC Curve in Machine Learning
No ratings yet
Guide To AUC ROC Curve in Machine Learning
10 pages
OS Processes & Threads Guide
No ratings yet
OS Processes & Threads Guide
62 pages
Support Vector Machine
No ratings yet
Support Vector Machine
45 pages
Deep Learning With PyTorch Guide For Beginners and Intermediate
100% (7)
Deep Learning With PyTorch Guide For Beginners and Intermediate
120 pages
Markov Models for Data Analysis
No ratings yet
Markov Models for Data Analysis
32 pages
Machine Learning From Scratch PDF
89% (9)
Machine Learning From Scratch PDF
124 pages
Machine Learning With Python
100% (15)
Machine Learning With Python
692 pages
Machine Learning - The Mastery Bible - The Definitive Guide To Machine Learning Data Science PDF
100% (6)
Machine Learning - The Mastery Bible - The Definitive Guide To Machine Learning Data Science PDF
331 pages
Machine Learning With Python.
100% (2)
Machine Learning With Python.
147 pages
Hackers Guide To Machine Learning With Python PDF
100% (16)
Hackers Guide To Machine Learning With Python PDF
272 pages
Introduction To Hidden Markov Models
No ratings yet
Introduction To Hidden Markov Models
31 pages
Full Course of Machine Learning
100% (17)
Full Course of Machine Learning
660 pages
Time Series For Data Science Analysis and Forecasting (Wayne A. Woodward, Bivin Philip Sadler Etc.) (Z-Library)
100% (1)
Time Series For Data Science Analysis and Forecasting (Wayne A. Woodward, Bivin Philip Sadler Etc.) (Z-Library)
529 pages
Neural Networks From Scratch in Python
100% (8)
Neural Networks From Scratch in Python
658 pages
Understanding Hidden Markov Models
No ratings yet
Understanding Hidden Markov Models
9 pages
Python ML Guide for Beginners
100% (6)
Python ML Guide for Beginners
541 pages
Python Machine Learning For Beginners Ebook Final
100% (11)
Python Machine Learning For Beginners Ebook Final
305 pages
Building AI Agents With LLMS, RAG, and Knowledge Graphs
100% (10)
Building AI Agents With LLMS, RAG, and Knowledge Graphs
560 pages
Data Structure and Algorithms With Python
100% (16)
Data Structure and Algorithms With Python
369 pages
Machine Learning Basics for Students
100% (1)
Machine Learning Basics for Students
78 pages
Hands On Machine Learning With Python Concepts and Applications For Beginners - John Anderson 2018
91% (11)
Hands On Machine Learning With Python Concepts and Applications For Beginners - John Anderson 2018
166 pages
Hidden Markov Models Overview
No ratings yet
Hidden Markov Models Overview
51 pages
SSC JE CIVIL ENGINEERING CHAPTERWISE SOLVED PAPERS English Medium
80% (5)
SSC JE CIVIL ENGINEERING CHAPTERWISE SOLVED PAPERS English Medium
656 pages
Preparation and Self-Assessment of The Basic Swimming Skills: Student's Experiences
No ratings yet
Preparation and Self-Assessment of The Basic Swimming Skills: Student's Experiences
7 pages
7-2106+SSSH+2024 5
No ratings yet
7-2106+SSSH+2024 5
8 pages
Decision Tree
No ratings yet
Decision Tree
4 pages
l2l CH 1 Study Notes 1 61c7b2b96c9b7
No ratings yet
l2l CH 1 Study Notes 1 61c7b2b96c9b7
8 pages
HRM Long Ans Q2: On-The-Job Training or Internal Training T
No ratings yet
HRM Long Ans Q2: On-The-Job Training or Internal Training T
8 pages
Current Role of Melatonin in Pediatric Neurology: Clinical Recommendations
No ratings yet
Current Role of Melatonin in Pediatric Neurology: Clinical Recommendations
12 pages
Language Disorders in Children 2nd Edition by Kaderavek Full Version
No ratings yet
Language Disorders in Children 2nd Edition by Kaderavek Full Version
324 pages
Geography of Ethiopia and the Horn
No ratings yet
Geography of Ethiopia and the Horn
316 pages
MDM Scheme IVRS/SMS RFP
100% (1)
MDM Scheme IVRS/SMS RFP
57 pages
Grade 5 Science: The Sun's Parts
No ratings yet
Grade 5 Science: The Sun's Parts
7 pages
MPU3223 - V2 Entrepreneurship 2 - Emay21
0% (1)
MPU3223 - V2 Entrepreneurship 2 - Emay21
239 pages
Ophthalmology in Focus by Jack J Kanski MD MS FRCS FRCOphth Brad Bowling FRCSEdOphth FRCOphth FRANZCO Ebook and TestBank Bundle Verified PDF
No ratings yet
Ophthalmology in Focus by Jack J Kanski MD MS FRCS FRCOphth Brad Bowling FRCSEdOphth FRCOphth FRANZCO Ebook and TestBank Bundle Verified PDF
405 pages
2021 ACM CHI A Visual Analytics Approach To Facilitate The Proc
No ratings yet
2021 ACM CHI A Visual Analytics Approach To Facilitate The Proc
18 pages
Jamie's Market: Hiring Temp Workers Analysis
No ratings yet
Jamie's Market: Hiring Temp Workers Analysis
12 pages
ICT Grade 11 UAE
No ratings yet
ICT Grade 11 UAE
82 pages
Prospectus of DEI 2015-16
No ratings yet
Prospectus of DEI 2015-16
96 pages
PreIGCSE Year 8 Math Exam Paper
No ratings yet
PreIGCSE Year 8 Math Exam Paper
2 pages
Q4 Tos Math 3
No ratings yet
Q4 Tos Math 3
3 pages
Assessment of Pediatric Client Whole Physical Assessment With Anthropomorphic
No ratings yet
Assessment of Pediatric Client Whole Physical Assessment With Anthropomorphic
51 pages
Demystifying The Dissertation
100% (7)
Demystifying The Dissertation
225 pages
0620 w11 Ms 32
No ratings yet
0620 w11 Ms 32
6 pages
AIML-3rd To 8th Sem-21scheme N Syllabus
No ratings yet
AIML-3rd To 8th Sem-21scheme N Syllabus
115 pages
Awareness and Utilization of Ai Tools
No ratings yet
Awareness and Utilization of Ai Tools
14 pages
Breathing For Speech Exercises September 2021
No ratings yet
Breathing For Speech Exercises September 2021
3 pages
Syllabus of B.A., LL.B. (Hons) CBCS 2018 2019
No ratings yet
Syllabus of B.A., LL.B. (Hons) CBCS 2018 2019
4 pages
Assessment Brief 2 and Report Structure
No ratings yet
Assessment Brief 2 and Report Structure
6 pages
MAYBURY LEWIS, David - Dialectical Societies
100% (3)
MAYBURY LEWIS, David - Dialectical Societies
348 pages
World English Intro Workbook Unit 11
100% (1)
World English Intro Workbook Unit 11
6 pages
Scheme of Work - Cambridge IGCSE Physics (0625) : Unit 5: Electromagnetism
No ratings yet
Scheme of Work - Cambridge IGCSE Physics (0625) : Unit 5: Electromagnetism
5 pages

Multiclass Classification

Uploaded by

Multiclass Classification

Uploaded by

Multiclass classification

• Multiple class labels are present in the dataset.

• One vs. All:- N-class instances then N binary classifier models

• The Confusion matrix is easy to derive but complex to understand.

• Example:- Check whether the fruit is apple, banana, or orange.

we have three classes,

• Learn a classifier one at a time

By analyzing the probability scores, we

• Green class classifier -> Positive with a probability score of [0.9]

• Hence, based on the positive responses and decisive probability

• Given m classes, construct m(m-1)/2 binary classifiers

• A classifier is trained using tuples of the two classes

• We divide this problem into N* (N-1)/2 = 3 binary classifier problems:

• F1 score is harmonic mean of recall and precision

You might also like