0% found this document useful (0 votes)

33 views6 pages

Performance Metrics in Machine Learning

The document discusses performance metrics in machine learning, emphasizing their importance in evaluating model effectiveness. It covers various metrics for classification, such as accuracy, confusion matrix, precision, recall, F-scores, and AUC-ROC, as well as regression metrics like Mean Absolute Error, Mean Squared Error, and R Squared Score. Understanding these metrics is crucial for improving model performance and ensuring generalization on unseen data.

Uploaded by

demonkingaeron

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views6 pages

Performance Metrics in Machine Learning

Uploaded by

demonkingaeron

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 6

Performance Metrics in Machine Learning

Evaluating the performance of a Machine learning model is one of the important steps while
building an effective ML model. To evaluate the performance or quality of the model, different
metrics are used, and these metrics are known as performance metrics or evaluation metrics.
These performance metrics help us understand how well our model has performed for the
given data. In this way, we can improve the model's performance by tuning the hyper-
parameters. Each ML model aims to generalize well on unseen/new data, and performance
metrics help determine how well the model generalizes on the new dataset.

1. Performance Metrics for Classification

In a classification problem, the category or classes of data is identified based on training data.
The model learns from the given dataset and then classifies the new data into classes or groups
based on the training. It predicts class labels as the output, such as Yes or No, 0 or 1, Spam or
Not Spam, etc. To evaluate the performance of a classification model, different metrics are
used, and some of them are as follows:

Accuracy

Confusion Matrix

Precision

Recall

F-Score

AUC(Area Under the Curve)-ROC

I. Accuracy

The accuracy metric is one of the simplest Classification metrics to implement, and it can be
determined as the number of correct predictions to the total number of predictions.

It can be formulated as:

When to Use Accuracy?

It is good to use the Accuracy metric when the target variable classes in data are approximately
balanced. For example, if 60% of classes in a fruit image dataset are of Apple, 40% are Mango.
In this case, if the model is asked to predict whether the image is of Apple or Mango, it will give
a prediction with 97% of accuracy.

When not to use Accuracy?

It is recommended not to use the Accuracy measure when the target variable majorly belongs
to one class. For example, Suppose there is a model for a disease prediction in which, out of
100 people, only five people have a disease, and 95 people don't have one. In this case, if our
model predicts every person with no disease (which means a bad prediction), the Accuracy
measure will be 95%, which is not correct.
II. Confusion Matrix
A confusion matrix is a tabular representation of prediction outcomes of any binary classifier,
which is used to describe the performance of the classification model on a set of test data when
true values are known.

The confusion matrix is simple to implement, but the terminologies used in this matrix might be
confusing for beginners.

A typical confusion matrix for a binary classifier looks like the below image(However, it can be
extended to use for classifiers with more than two classes).
We can determine the following from the above matrix:

In the matrix, columns are for the prediction values, and rows specify the
Actual values. Here Actual and prediction give two possible classes, Yes or
No. So, if we are predicting the presence of a disease in a patient, the
Prediction column with Yes means, Patient has the disease, and for NO, the
Patient doesn't have the disease.

In this example, the total number of predictions are 165, out of which 110
time predicted yes, whereas 55 times predicted No.

However, in reality, 60 cases in which patients don't have the disease,

whereas 105 cases in which patients have the disease.

True Positive(TP) signifies how many positive class samples your model
predicted correctly.

True Negative(TN) signifies how many negative class samples your model
predicted correctly.

False Positive(FP) signifies how many negative class samples your model
predicted incorrectly. This factor represents Type-I error in statistical
nomenclature. This error positioning in the confusion matrix depends on the
choice of the null hypothesis.

False Negative(FN) signifies how many positive class samples your model
predicted incorrectly.

III. Precision
The precision metric is used to overcome the limitation of Accuracy. The
precision determines the proportion of positive prediction that was actually
correct. It can be calculated as the True Positive or predictions that are
actually true to the total positive predictions (True Positive and False
Positive).

IV. Recall or Sensitivity

It is also similar to the Precision metric; however, it aims to calculate the

proportion of actual positive that was identified incorrectly. It can be
calculated as True Positive or predictions that are actually true to the total
number of positives, either correctly predicted as positive or incorrectly
predicted as negative (true Positive and false negative).

The formula for calculating Recall is given below:

V. F-Scores

F-score or F1 Score is a metric to evaluate a binary classification model on

the basis of predictions that are made for the positive class. It is calculated
with the help of Precision and Recall. It is a type of single score that
represents both Precision and Recall. So, the F1 Score can be calculated
as the harmonic mean of both precision and Recall, assigning equal
weight to each of them.

The formula for calculating the F1 score is given below:

VI. AUC-ROC
Sometimes we need to visualize the performance of the classification model
on charts; then, we can use the AUC-ROC curve. It is one of the popular and
important metrics for evaluating the performance of the classification model.

Firstly, let's understand ROC (Receiver Operating Characteristic curve)

curve. ROC represents a graph to show the performance of a
classification model at different threshold levels. The curve is plotted
between two parameters, which are:

o True Positive Rate

o False Positive Rate

TPR or true Positive rate is a synonym for Recall, hence can be calculated as:

FPR or False Positive Rate can be calculated as:

2. Performance Metrics for Regression

Regression is a supervised learning technique that aims to find the

relationships between the dependent and independent variables. A
predictive regression model predicts a numeric or discrete value. The metrics
used for regression are different from the classification metrics.

Mean Absolute Error

Mean Squared Error

R2 Score

I. Mean Absolute Error (MAE)

Mean Absolute Error or MAE is one of the simplest metrics, which measures
the absolute difference between actual and predicted values, where absolute
means taking a number as Positive.

To understand MAE, let's take an example of Linear Regression, where the

model draws a best fit line between dependent and independent variables.
To measure the MAE or error in prediction, we need to calculate the
difference between actual values and predicted values.

The below formula is used to calculate MAE:

Here,

Y is the Actual outcome, Y' is the predicted outcome, and N is the total number of data points.
I. Mean Squared Error
Mean Squared error or MSE is one of the most suitable metrics for Regression evaluation. It
measures the average of the Squared difference between predicted values and the actual value
given by the model.

Since in MSE, errors are squared, therefore it only assumes non-negative values, and it is
usually positive and non-zero.

The formula for calculating MSE is given below:

Here,

Y is the Actual outcome, Y' is the predicted outcome, and N is the total number of data points.
III. R Squared Score
R squared error is also known as Coefficient of Determination, which is another popular metric
used for Regression model evaluation.

The R squared score will always be less than or equal to 1 without

concerning if the values are too large or small.

Iai&ml Unit-5
No ratings yet
Iai&ml Unit-5
15 pages
Performance Metrics in Machine Learning
No ratings yet
Performance Metrics in Machine Learning
19 pages
Performance Metrics in Machine Learning
No ratings yet
Performance Metrics in Machine Learning
30 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
8 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
6 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
3.4. Evaluation Metrics For AI Models
No ratings yet
3.4. Evaluation Metrics For AI Models
36 pages
Performance Metric (Summerized)
No ratings yet
Performance Metric (Summerized)
43 pages
Unit 4
No ratings yet
Unit 4
20 pages
Machine Learning Evaluation Metrics Explained in S
No ratings yet
Machine Learning Evaluation Metrics Explained in S
10 pages
Machine Learning Evaluation Metrics Explained in S
No ratings yet
Machine Learning Evaluation Metrics Explained in S
18 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
11 pages
Evaluating Models CH-3
No ratings yet
Evaluating Models CH-3
5 pages
Unit3 Evaluating Models
No ratings yet
Unit3 Evaluating Models
10 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
11 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
Model Evaluation
No ratings yet
Model Evaluation
18 pages
GR 10 - Final Evaluation
No ratings yet
GR 10 - Final Evaluation
45 pages
Performance Metrics
No ratings yet
Performance Metrics
8 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
Session 3 (Machine Learning)
No ratings yet
Session 3 (Machine Learning)
29 pages
Unit 2 Part 3 Confusion Matrix
No ratings yet
Unit 2 Part 3 Confusion Matrix
23 pages
Exp7 MLAI2
No ratings yet
Exp7 MLAI2
8 pages
Model Evaluation
No ratings yet
Model Evaluation
37 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Unit8 (Evaluation Method)
No ratings yet
Unit8 (Evaluation Method)
43 pages
UNIT-1-2.Binary Classification and Related Tasks
No ratings yet
UNIT-1-2.Binary Classification and Related Tasks
22 pages
Unit - 5
No ratings yet
Unit - 5
57 pages
Ads 5
No ratings yet
Ads 5
5 pages
Worksheet For 8th
100% (1)
Worksheet For 8th
5 pages
b.3. Evaluating Models
No ratings yet
b.3. Evaluating Models
10 pages
Confusion Matrix & Evaluation Metrics in Machine Learning
No ratings yet
Confusion Matrix & Evaluation Metrics in Machine Learning
23 pages
Data Science Statistics Mathematics Cheat Sheet
100% (1)
Data Science Statistics Mathematics Cheat Sheet
13 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
22 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
20 pages
2-Training and Testing Models, Evaluation Metrics-01-07-2023
No ratings yet
2-Training and Testing Models, Evaluation Metrics-01-07-2023
23 pages
ML Unit-3 - RTU
No ratings yet
ML Unit-3 - RTU
20 pages
Model Evaluation for Data Scientists
No ratings yet
Model Evaluation for Data Scientists
7 pages
Intro to Machine Learning Steps
No ratings yet
Intro to Machine Learning Steps
35 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
AI Facilitators Handbook X-63-63
No ratings yet
AI Facilitators Handbook X-63-63
119 pages
Unit - I Chap-4 Model Evaluation and Development
No ratings yet
Unit - I Chap-4 Model Evaluation and Development
35 pages
Model Evaluation Metrics 1683566651
No ratings yet
Model Evaluation Metrics 1683566651
12 pages
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
20 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Lecture 5
No ratings yet
Lecture 5
21 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
Data Preprocessing and Model Evalution
No ratings yet
Data Preprocessing and Model Evalution
3 pages
Part B Unit 3
No ratings yet
Part B Unit 3
23 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
2.3 Performance Metrics
No ratings yet
2.3 Performance Metrics
32 pages
Confusion Matrix
No ratings yet
Confusion Matrix
29 pages
ML Unit 3
No ratings yet
ML Unit 3
127 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
ML Classification Metrics Guide
100% (1)
ML Classification Metrics Guide
30 pages
Security Testing and Risk Analysis Detailed
No ratings yet
Security Testing and Risk Analysis Detailed
4 pages
Support Vector Machines (SVMS)
No ratings yet
Support Vector Machines (SVMS)
19 pages
Polynomial Regression
No ratings yet
Polynomial Regression
11 pages
Training Vs Testing and Split
No ratings yet
Training Vs Testing and Split
4 pages
Enhanced EMS Notes
No ratings yet
Enhanced EMS Notes
2 pages
Benchmark of Working Capital Performance
No ratings yet
Benchmark of Working Capital Performance
15 pages
Formal Muticolor
No ratings yet
Formal Muticolor
36 pages
G. S. Maddala - Introduction To Econometrics-Macmillan Pub. Co. - Maxwell Macmillan Canada - Maxwell Macmillan International (1992)
No ratings yet
G. S. Maddala - Introduction To Econometrics-Macmillan Pub. Co. - Maxwell Macmillan Canada - Maxwell Macmillan International (1992)
637 pages
Mark All Your Answers in The System
No ratings yet
Mark All Your Answers in The System
5 pages
Regression Analysis in STAT 445
No ratings yet
Regression Analysis in STAT 445
49 pages
Forecasting Methods
No ratings yet
Forecasting Methods
38 pages
Basic Stats:: A Supplement To Multivariate Data Analysis
No ratings yet
Basic Stats:: A Supplement To Multivariate Data Analysis
55 pages
Solution Manual For Introductory Econometrics A Modern Approach 5th Edition Wooldridge 1111531048 9781111531041 PDF Download
71% (7)
Solution Manual For Introductory Econometrics A Modern Approach 5th Edition Wooldridge 1111531048 9781111531041 PDF Download
49 pages
M.tech ML Unit-3
No ratings yet
M.tech ML Unit-3
17 pages
Statistics Using Stata An Integrative Approach: Weinberg and Abramowitz 2016
No ratings yet
Statistics Using Stata An Integrative Approach: Weinberg and Abramowitz 2016
46 pages
Stat 101 4th Exam
No ratings yet
Stat 101 4th Exam
4 pages
Chapter 3 Multiple Linear Regression - Jan
No ratings yet
Chapter 3 Multiple Linear Regression - Jan
47 pages
BS Economics Course Outline
No ratings yet
BS Economics Course Outline
114 pages
Nthu Bacs-Hw
No ratings yet
Nthu Bacs-Hw
4 pages
Part 4C Quantitative Methods For Decision Analysis 354
No ratings yet
Part 4C Quantitative Methods For Decision Analysis 354
102 pages
Linear Regression MCQS
No ratings yet
Linear Regression MCQS
3 pages
Understanding Cost Behavior Analysis
67% (3)
Understanding Cost Behavior Analysis
15 pages
CH07 Linear Regression
No ratings yet
CH07 Linear Regression
39 pages
Statistics For Business and Economics 3rd Edition David R. Anderson Et Al. Digital Download
No ratings yet
Statistics For Business and Economics 3rd Edition David R. Anderson Et Al. Digital Download
52 pages
1 - Stat-701 Regression
No ratings yet
1 - Stat-701 Regression
18 pages
Statistics Using R - Sharon Lawner Weinberg, Daphna Harel
100% (1)
Statistics Using R - Sharon Lawner Weinberg, Daphna Harel
726 pages
Statistics Using IBM SPSS: An Integrative Approach - Ebook PDF Version Download
100% (2)
Statistics Using IBM SPSS: An Integrative Approach - Ebook PDF Version Download
60 pages
LM10 Simple Linear Regression IFT Notes
No ratings yet
LM10 Simple Linear Regression IFT Notes
28 pages
Simple Linear Regression Fundamentals
No ratings yet
Simple Linear Regression Fundamentals
6 pages
Sem 3 - MCQ - Subject-Wise
No ratings yet
Sem 3 - MCQ - Subject-Wise
23 pages
Introduction To Econometrics, 5 Edition: Chapter 3: Multiple Regression Analysis
No ratings yet
Introduction To Econometrics, 5 Edition: Chapter 3: Multiple Regression Analysis
25 pages
30th Hour Peak
No ratings yet
30th Hour Peak
18 pages
Mission Hospital Case Study
No ratings yet
Mission Hospital Case Study
1 page
SLIDE3
No ratings yet
SLIDE3
27 pages
Sample Final
No ratings yet
Sample Final
10 pages

Performance Metrics in Machine Learning

Uploaded by

Performance Metrics in Machine Learning

Uploaded by

Performance Metrics in Machine Learning

1. Performance Metrics for Classification

AUC(Area Under the Curve)-ROC

It can be formulated as:

When to Use Accuracy?

When not to use Accuracy?

However, in reality, 60 cases in which patients don't have the disease,

IV. Recall or Sensitivity

It is also similar to the Precision metric; however, it aims to calculate the

The formula for calculating Recall is given below:

F-score or F1 Score is a metric to evaluate a binary classification model on

The formula for calculating the F1 score is given below:

Firstly, let's understand ROC (Receiver Operating Characteristic curve)

o True Positive Rate

FPR or False Positive Rate can be calculated as:

2. Performance Metrics for Regression

Regression is a supervised learning technique that aims to find the

Mean Absolute Error

Mean Squared Error

I. Mean Absolute Error (MAE)

To understand MAE, let's take an example of Linear Regression, where the

The below formula is used to calculate MAE:

The formula for calculating MSE is given below:

The R squared score will always be less than or equal to 1 without

You might also like