0% found this document useful (0 votes)

185 views13 pages

Confusion Matrix and Performance Evaluation Metrics

The document discusses performance metrics for evaluating classification models using a confusion matrix. The confusion matrix contains four quadrants representing true positives, false negatives, false positives, and true negatives. Key metrics discussed include accuracy, precision, recall, F1 score, true positive rate, false positive rate, and how these metrics can characterize classifiers as perfect, worst, ultra-liberal, or ultra-conservative.

Uploaded by

md shazzad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

185 views13 pages

Confusion Matrix and Performance Evaluation Metrics

Uploaded by

md shazzad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 13

1

Confusion Matrix & Performance Measurement

Metrics
Al Amin Biswas
Lecturer, CSE, DIU
Confusion Matrix
2
 A confusion matrix for a two classes (+, -) is shown below.

 There are four quadrants in the confusion matrix, which are symbolized as
below.
 True Positive (TP: f++) : The number of instances that were positive (+) and
correctly classified as positive (+).
 False Negative (FN: f+-): The number of instances that were positive (+) and
incorrectly classified as negative (-).
 False Positive (FP: f-+): The number of instances that were negative (-) and
incorrectly classified as (+).
 True Negative (TN: f--): The number of instances that were negative (-)
and correctly classified as (-).
Confusion Matrix
3
Note:
 Np = TP (f++) + FN (f+-)
= is the total number of positive instances.

 Nn = FP(f-+) + Tn(f--)
= is the total number of negative instances.

 N = Np + Nn
= is the total number of instances.

 (TP + TN) denotes the number of correct classification

 (FP + FN) denotes the number of errors in classification.

4
Confusion Matrix Example

 For example,

Class + -
+ 52 (TP) 18 (FN)
- 21 (FP) 123 (TN)

Calculate the performance evaluation metrics

5 Accuracy
 It is defined as the fraction of the number of examples that are correctly
classified by the classifier to the total number of instances.
Accuracy
Performance Evaluation Metrics
 6

 We now define a number of metrics for the measurement of a classifier.

 In our discussion, we shall make the assumptions that there are only two
classes: + (positive) and – (negative)

 True Positive Rate (TPR): It is defined as the fraction of the positive

examples predicted correctly by the classifier.

 This metrics is also known as Recall, Sensitivity or Hit rate.

 False Positive Rate (FPR): It is defined as the fraction of negative examples

classified as positive class by the classifier.
Performance Evaluation Metrics
7
 False Negative Rate (FNR): It is defined as the fraction of positive
examples classified as a negative class by the classifier.

 True Negative Rate (TNR): It is defined as the fraction of negative

examples classified correctly by the classifier

 This metric is also known as Specificity.

8
Performance Evaluation Metrics

 Both, Precision and Recall are defined by
9
Performance Evaluation Metrics

F1 Score (F1): Recall (r) and Precision (p) are two
widely used metrics employed in analysis..
 It is defined in terms of (r or Recall) and (p or Precision) as
follows.

Note
F1 represents the harmonic mean between recall and precision

High value of F1 score ensures that both Precision and Recall

are reasonably high.
Analysis with Performance Measurement Metrics
10

 Based on the various performance metrics, we can characterize a classifier.

 We do it in terms of TPR, FPR, Precision and Recall and Accuracy

 Case 1: Perfect Classifier

When every instance is correctly classified, it is called the perfect classifier. In this case,
TP = P, TN = N and CM is
Predicted Class

TPR = TP/(TP+FN)= =1 + -

+ P 0
FPR = =0
Actual
class

Precision = = 1 - 0 N

F1 Score = = 1
Accuracy = = 1
Analysis with Performance Measurement Metrics
11

 Case 2: Worst Classifier

When every instance is wrongly classified, it is called the worst classifier. In this
case, TP = 0, TN = 0 and the CM is
Predicted Class

+ -
TPR = =0
+ 0 P

Actual
FPR = = 1

class
- N 0
Precision = = 0
F1 Score = Not applicable
as Recall + Precision = 0
Accuracy = =0
Analysis with Performance Measurement Metrics
12

 Case 3: Ultra-Liberal Classifier

The classifier always predicts the + class correctly. Here, the False Negative
(FN) and True Negative (TN) are zero. The CM is
Predicted Class

+ -
TPR = = 1
+ P 0

Actual
FPR = = 1

class
- N 0
Precision =
F1 Score =
Accuracy = =0
Analysis with Performance Measurement Metrics
13

 Case 4: Ultra-Conservative Classifier

This classifier always predicts the - class correctly. Here, the False Negative
(FN) and True Negative (TN) are zero. The CM is
Predicted Class

+ -
TPR = = 0
+ 0 p

Actual
FPR = = 0

class
- 0 N
Precision =
(as TP + FP = 0)
F1 Score =
Accuracy = =0

Understanding Confusion Matrix Metrics
No ratings yet
Understanding Confusion Matrix Metrics
18 pages
Confusion Matrix-Based Feature Selection
No ratings yet
Confusion Matrix-Based Feature Selection
8 pages
Expert Systems With Applications: Moloud Abdar, Mariam Zomorodi-Moghadam, Resul Das, I-Hsien Ting
No ratings yet
Expert Systems With Applications: Moloud Abdar, Mariam Zomorodi-Moghadam, Resul Das, I-Hsien Ting
13 pages
DingLi Wireless Network Solutions Brochure
No ratings yet
DingLi Wireless Network Solutions Brochure
17 pages
Data Mining Architecture Guide
No ratings yet
Data Mining Architecture Guide
7 pages
Pilot Fleet
No ratings yet
Pilot Fleet
2 pages
PRTG Desktop Manual
No ratings yet
PRTG Desktop Manual
113 pages
Computer Forensics
No ratings yet
Computer Forensics
25 pages
Clickjacking PoC Tool Guide
No ratings yet
Clickjacking PoC Tool Guide
2 pages
Streamlit Interface For Multiple Disease Diagnosis
No ratings yet
Streamlit Interface For Multiple Disease Diagnosis
8 pages
Data Warehouse Concepts
No ratings yet
Data Warehouse Concepts
26 pages
Data Warehousing Concepts Guide
No ratings yet
Data Warehousing Concepts Guide
68 pages
Understanding Confusion Matrix
No ratings yet
Understanding Confusion Matrix
4 pages
Imbalanced Data Solutions with GAN
No ratings yet
Imbalanced Data Solutions with GAN
27 pages
Big Data: Innovation and Productivity Insights
No ratings yet
Big Data: Innovation and Productivity Insights
23 pages
A Study On Credit Card Fraud Detection Using Machine Learning
No ratings yet
A Study On Credit Card Fraud Detection Using Machine Learning
4 pages
SMOTE For Imbalanced Classification With Python
No ratings yet
SMOTE For Imbalanced Classification With Python
75 pages
Enhancing Build Outcome Prediction with SMOTE
No ratings yet
Enhancing Build Outcome Prediction with SMOTE
6 pages
Open Source ETL Tools Comparision
No ratings yet
Open Source ETL Tools Comparision
10 pages
Introduction to DevOps Methodology
No ratings yet
Introduction to DevOps Methodology
20 pages
Complete Reference C5.0 Good
No ratings yet
Complete Reference C5.0 Good
27 pages
945-Article Text-2920-1-10-20190802
No ratings yet
945-Article Text-2920-1-10-20190802
6 pages
Overview of Apache Hive Architecture
No ratings yet
Overview of Apache Hive Architecture
48 pages
Confusion Matrix
No ratings yet
Confusion Matrix
14 pages
02-dw Architecture
No ratings yet
02-dw Architecture
31 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
6 pages
Optimal Hybrid Classifiers for IDS
No ratings yet
Optimal Hybrid Classifiers for IDS
12 pages
Future of Cyber Security
No ratings yet
Future of Cyber Security
9 pages
Python For R Users
No ratings yet
Python For R Users
34 pages
SMOTE: Improving Classifier Performance
No ratings yet
SMOTE: Improving Classifier Performance
37 pages
Enterprise Architecture Views and Viewpoints: Prof. Dr. Knut Hinkelmann
100% (1)
Enterprise Architecture Views and Viewpoints: Prof. Dr. Knut Hinkelmann
12 pages
Data Lake 1
No ratings yet
Data Lake 1
19 pages
Logistic Regression Confusion Matrix Explained
No ratings yet
Logistic Regression Confusion Matrix Explained
2 pages
Convolution Neural Networks (CNN) : Ms. Anisha Mahato Assistant Professor (CSE Specialization)
No ratings yet
Convolution Neural Networks (CNN) : Ms. Anisha Mahato Assistant Professor (CSE Specialization)
97 pages
A Case Study On Clickjacking Attack and Location Leakage
No ratings yet
A Case Study On Clickjacking Attack and Location Leakage
12 pages
Break Down Data Silos With ETL and Unlock Trapped Data With ETL
No ratings yet
Break Down Data Silos With ETL and Unlock Trapped Data With ETL
25 pages
7.1 Clickjack Python Tool
No ratings yet
7.1 Clickjack Python Tool
2 pages
PPT04-Hadoop Infrastructure Layer
No ratings yet
PPT04-Hadoop Infrastructure Layer
40 pages
Enhancements in TOGAF 9.2 Overview
100% (1)
Enhancements in TOGAF 9.2 Overview
1 page
RabbitMQ for Developers
100% (1)
RabbitMQ for Developers
7 pages
Malware Classification and Impact Analysis
No ratings yet
Malware Classification and Impact Analysis
7 pages
Top Data Integration Trends and Best
No ratings yet
Top Data Integration Trends and Best
18 pages
DW Life Cycle
No ratings yet
DW Life Cycle
114 pages
Twitter Sentiment Analysis with TensorFlow
No ratings yet
Twitter Sentiment Analysis with TensorFlow
13 pages
Data Warehouse Components
No ratings yet
Data Warehouse Components
18 pages
Modul 9 - Data Warehousing and Business Intelligence - DMBOK2
No ratings yet
Modul 9 - Data Warehousing and Business Intelligence - DMBOK2
59 pages
Naive Bayes Spam Classifier
0% (1)
Naive Bayes Spam Classifier
44 pages
IT Infrastructure Monitoring Tool
No ratings yet
IT Infrastructure Monitoring Tool
4 pages
Data Mining Concepts and Techniques
100% (1)
Data Mining Concepts and Techniques
63 pages
Designing An Agile Enterprise Architecture For Mining Company by Using TOGAF Framework (2016)
No ratings yet
Designing An Agile Enterprise Architecture For Mining Company by Using TOGAF Framework (2016)
6 pages
Data Cleaning and JSON in R
No ratings yet
Data Cleaning and JSON in R
61 pages
Backpropagation With Example
No ratings yet
Backpropagation With Example
42 pages
Mapping The Business Model Canvas To Archimate
No ratings yet
Mapping The Business Model Canvas To Archimate
8 pages
Gartner EA Tools 2024
No ratings yet
Gartner EA Tools 2024
41 pages
Data Mining From Data To Knowledge PDF
No ratings yet
Data Mining From Data To Knowledge PDF
464 pages
ABACUS Features US
No ratings yet
ABACUS Features US
2 pages
DQ Architecture
0% (1)
DQ Architecture
3 pages
Confusion Matrix & Metrics Guide
No ratings yet
Confusion Matrix & Metrics Guide
13 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
Intro to Descriptive & Inferential Stats
No ratings yet
Intro to Descriptive & Inferential Stats
46 pages
Quantitative Reasoning Exam Review
100% (2)
Quantitative Reasoning Exam Review
17 pages
9PS0 01 Que 20170608
No ratings yet
9PS0 01 Que 20170608
36 pages
Stats Q4
No ratings yet
Stats Q4
46 pages
Data Science for Sri Lanka's Growth
No ratings yet
Data Science for Sri Lanka's Growth
15 pages
HR Analytics Applications
No ratings yet
HR Analytics Applications
26 pages
Regression Analysis and Modelling - Amar Sahay
No ratings yet
Regression Analysis and Modelling - Amar Sahay
93 pages
The Effect of Work Discipline and Work Motivation
No ratings yet
The Effect of Work Discipline and Work Motivation
9 pages
Statistical Tests for Mean Comparisons
No ratings yet
Statistical Tests for Mean Comparisons
6 pages
9 Sampling Distribution and Point Estimation of Parameters
No ratings yet
9 Sampling Distribution and Point Estimation of Parameters
4 pages
Black Friday Sales Prediction Project
No ratings yet
Black Friday Sales Prediction Project
14 pages
Required Sample Size: From: The Research Advisors
67% (3)
Required Sample Size: From: The Research Advisors
3 pages
LAS Variance
No ratings yet
LAS Variance
2 pages
Measures of Dispersion - Notes
No ratings yet
Measures of Dispersion - Notes
5 pages
Statistics Homework Solutions
No ratings yet
Statistics Homework Solutions
23 pages
Bes Project 2021
No ratings yet
Bes Project 2021
15 pages
Manzan Sample Midterm2
No ratings yet
Manzan Sample Midterm2
9 pages
Comp3314 4. Regression Classification
No ratings yet
Comp3314 4. Regression Classification
120 pages
Combined QP (Reduced) - S1 Edexcel PDF
No ratings yet
Combined QP (Reduced) - S1 Edexcel PDF
107 pages
Probability and Statistics Question Bank
No ratings yet
Probability and Statistics Question Bank
26 pages
2016.random Forest in Remote Sensing A Review of Applications and Future
No ratings yet
2016.random Forest in Remote Sensing A Review of Applications and Future
8 pages
EHP614 Smcontents 508
No ratings yet
EHP614 Smcontents 508
2 pages
Statistics Quiz: Key Concepts & Calculations
No ratings yet
Statistics Quiz: Key Concepts & Calculations
6 pages
Rank Biserial Correlation
100% (1)
Rank Biserial Correlation
3 pages
5St1 Systematic Samp - in 12
No ratings yet
5St1 Systematic Samp - in 12
4 pages
Ashish+Gupta+Project+Report Advanced+Statistics 13 11 2022
50% (2)
Ashish+Gupta+Project+Report Advanced+Statistics 13 11 2022
21 pages
Chapter 8
No ratings yet
Chapter 8
3 pages
Action Research Using Strategic Inquiry To Improve Teaching and Learning 1st Edition Rock Test Bank Instant Download
No ratings yet
Action Research Using Strategic Inquiry To Improve Teaching and Learning 1st Edition Rock Test Bank Instant Download
85 pages
Analysis of Supermarket Price Differences
No ratings yet
Analysis of Supermarket Price Differences
15 pages
Evaluating Machine Learning Models
100% (2)
Evaluating Machine Learning Models
10 pages

Confusion Matrix and Performance Evaluation Metrics

Uploaded by

Confusion Matrix and Performance Evaluation Metrics

Uploaded by

1

Confusion Matrix & Performance Measurement

 (TP + TN) denotes the number of correct classification

 (FP + FN) denotes the number of errors in classification.

Calculate the performance evaluation metrics

 We now define a number of metrics for the measurement of a classifier.

 True Positive Rate (TPR): It is defined as the fraction of the positive

 This metrics is also known as Recall, Sensitivity or Hit rate.

 False Positive Rate (FPR): It is defined as the fraction of negative examples

 True Negative Rate (TNR): It is defined as the fraction of negative

 This metric is also known as Specificity.

High value of F1 score ensures that both Precision and Recall

 We do it in terms of TPR, FPR, Precision and Recall and Accuracy

 Case 1: Perfect Classifier

You might also like