0% found this document useful (0 votes)

39 views9 pages

Evaluating Models

Uploaded by

beheraranju97

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views9 pages

Evaluating Models

Uploaded by

beheraranju97

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

What is evaluation?

Model evaluation is the process of using different evaluation metrics to understand a

machine learning model’s performance. An AI model gets better with constructive
feedback; you build a model, get feedback from metrics, make improvements and
continue until you achieve a desirable accuracy.

What is Evaluating Models?

The evaluation process uses different evaluation metrics to understand a machine

learning model’s performance, strengths, and weaknesses. Evaluation is the process of
understanding the reliability of any AI model, based on outputs by feeding test dataset
into the model and comparing with actual answers. There can be different Evaluation
techniques, depending on the type and purpose of the model.

Why we need evaluation model?

Evaluation models are methods for evaluating and choosing the best model during the
modeling process. The model evaluation is like giving your AI model a report card. It
helps you understand its strengths, weaknesses, and suitability for the task at hand.
This feedback loop is essential for building trustworthy and reliable AI systems.

Splitting the training set data for Evaluation

The train-test split is a technique for evaluating the performance of a machine learning
algorithm. It can be used for any supervised learning algorithm. The evaluation model
divides the dataset into a training set and a testing set.

Why we need of Train-test split?

The train dataset is used to make the model learn, the input elements of the test dataset
are provided to the trained model. The model makes predictions, and the predicted
values are compared to the expected values. The objective is to estimate the
performance of the machine learning model on new data: data not used to train the
model.

Accuracy and Error

In AI model evaluation accuracy and error are key metrics which helps to understand
how well a model performs and identify the areas for improvement. In AI model
evaluation, higher accuracy means a model is better, while lower error indicates less
mistakes.

 Accuracy – Accuracy is an evaluation metric that allows you to measure the total
number of predictions a model gets right. The accuracy of the model and performance of
the model is directly proportional, and hence better the performance of the model, the
more accurate are the predictions.
 Error – Error can be described as an action that is inaccurate or wrong. In Machine
Learning, the error is used to see how accurately our model can predict data it uses to
learn new, unseen data. Based on our error, we choose the machine learning model
which performs best for a particular dataset.
How to find accuracy of the AI model

To find the accuracy of an AI model, we have to first calculate the percentage of correct
predictions made by the testing dataset. The formula to find the accuracy is—

Error = Actual – Predicted

Error Rate = Error / Actual Price

Accuracy = 1 – Error Rate

Accuracy in percentage = Accuracy * 100

Given values:

 Predicted House Price = 391k

 Actual House Price = 402k
Step 1: Calculate Absolute Error
Error: 402k−391k = 11k
Step 2: Calculate Error Rate
Error Rate: 11 / 402 = 0.0274
Step 3: Calculate Accuracy
Accuracy: 1 – 0.0274 = 0.9726
Step 4: Convert to Percentage
Accuracy in percentage: 0.973 × 100 = 97.3%
Evaluation metrics for classification

What is Classification?

In artificial intelligence classification is a technique that organizes data into categories.

It’s a type of machine learning that uses algorithms to sort data into predefined classes.

Classification metrics

Classification metrics are used to evaluate the performance of a classification model in

machine learning, or you can say that it is performance measures used to evaluate the
effectiveness of the model.

Different types of classification techniques in AI

Popular metrics used for classification model

 Confusion matrix
 Classification accuracy
 Precision
 Recall
 F1 Score
1. What is confusion matrix?

The confusion matrix is a handy presentation of the accuracy of a model with two or
more classes. The confusion matrix comparison between the prediction and reality and
can be recorded in what we call the confusion matrix. The confusion matrix allows us to
understand the prediction results.

It consists of four values:

 True Positive (TP): Correctly predicted positive cases.

 False Negative (FN): Model predicted negative, but it was actually positive.
 False Positive (FP): Model predicted positive, but it was actually negative.
 True Negative (TN): Correctly predicted negative cases.
Prediction and Reality can be easily mapped together with the help of this confusion
matrix.
2. Classification accuracy

Classification accuracy allows you to count the total number of accurate predictions
made by a model. The accuracy calculation is as follows: How many of the model
predictions were accurate will be determined by accuracy. True Positives and True
Negatives are what accuracy considers.

Here, total observations cover all the possible cases of prediction that can be True Positive
(TP), True Negative (TN), False Positive (FP) and False Negative (FN).

3. Precision

Precision is defined as the percentage of true positive cases versus all the cases where
the prediction is true. That is, it takes into account the True Positives and False
Positives.
3. Recall

It can be described as the percentage of positively detected cases that are positive. The
scenarios where a fire actually existed in reality but was either correctly or incorrectly
recognized by the machine are heavily considered. That is, it takes into account both,
False Negatives (there was a forest fire but the model didn’t predict it) and True
Positives (there was a forest fire in reality and the model anticipated a forest fire).

4. F1 Score

F1 score can be defined as the measure of balance between precision and recall or F1-
Score provides a way to combine both precisions and recall into a single measure that
captures both properties.

Take a look at the formula and think of when can we get a perfect F1 score?
An ideal situation would be when we have a value of 1 (that is 100%) for both Precision
and Recall. In that case, the F1 score would also be an ideal 1 (100%). It is known as
the perfect value for F1 Score. As the values of both Precision and Recall ranges from 0
to 1, the F1 score also ranges from 0 to 1.

Let us explore the variations we can have in the F1 Score:

Let’s see one question

Draw the confusion matrix for the following data

1. the number of true positive = 100

2. the number of true negative 47
3. the number of false positive = 62
4. the number of false negative = 290
5.

Question: An AI model made the following sales prediction for a new mobile phone which they
have recently launched:
Answer:

(i) the total number of wrong predictions made by the model is the sum of false positive
and false negative.

FP+FN=0+100= 100

(ii) Before calculating, we will first see the formulas for precision, recall, and F1 score.

Precision=TP/(TP+FP)
=900/(900+0)
=(900/900)*100
=1.0

Recall=TP/(TP+FN)
=900/(900+100)
=900/1000
=.0.9

F1 Score = 2 * Precision * Recall / ( Precision + Recall )

=2 * 1.0 * 0.9 / (1.0+0.9)
=2 * 0.4737
=0.947

The accuracy of the model is 94.7%.

Question: An AI model made the following sales prediction for a new mobile phone
which they have recently launched:

Answer:
(i) the total number of wrong predictions made by the model is the sum of false positive
and false negative.

FP+FN=40+12= 52

(ii) Before calculating, we will first see the formulas for precision, recall, and F1 score.

Precision=TP/(TP+FP)
=50/(50+40)
=(50/90)*100
=0.55

Recall=TP/(TP+FN)
=50/(50+12)
=50/62
=.81

F1 Score = 2 * Precision * Recall / ( Precision + Recall )

=2 * 0.55 * .81 / (.55+.81)
=.891 / 1.36
=0.65

Which metric is appropriate to evaluate the AI model?

Let’s compare which matrix is important for finding accuracy.

Ethical concerns around model evaluation

Ethical concerns around model evaluation primarily focus on three aspects: bias,
transparency, and accuracy. Nowadays, we are moving from the Information era to the
Artificial Intelligence era. Now we do not use data or information, but the intelligence
collected from the data to build solutions. We need to keep aspects relating to ethical
practices in mind while developing solutions using AI. Let us understand some of the
ethical concerns in detail.

 Bias – Bias occurs when a model generates unfair or discriminatory results. This
can happen due to the model favoring certain groups or due to the algorithm. For
example, if the AI application of Amazon is favoring male candidates only, then
the maximum product suggestion will be shown only to male candidates, which
will decrease the profit of the company.
 Transparency – The AI decision-making process should be transparent; people
can easily understand and interpret the result. If the lack of transparency issue is
there, then the people will not trust the model. For example, if any person has
applied for a loan and the AI model denies a loan application of any candidate,
then it is the duty of the AI that the applicant should know why the loan
application is rejected.
 Accuracy – The AI model should predict the correct result. The accurate model
makes error-free and reliable results. For example, in medicine, an AI model
should diagnose and generate accurate predictions; otherwise, due to wrong
diagnoses, it can lead to a serious illness in the people.

Wa0002.
No ratings yet
Wa0002.
6 pages
Worksheet For 8th
100% (1)
Worksheet For 8th
5 pages
Unit - 7 - Evaluation
No ratings yet
Unit - 7 - Evaluation
30 pages
GR 10 - Final Evaluation
No ratings yet
GR 10 - Final Evaluation
45 pages
Evaluationnai
No ratings yet
Evaluationnai
5 pages
Part B CH 8 Evaluation 1
No ratings yet
Part B CH 8 Evaluation 1
39 pages
Part B Unit 3
No ratings yet
Part B Unit 3
23 pages
b.3. Evaluating Models
No ratings yet
b.3. Evaluating Models
10 pages
3.1 - Importance of Model Evaluation and Splitting The Training Set Data For Evaluation
No ratings yet
3.1 - Importance of Model Evaluation and Splitting The Training Set Data For Evaluation
6 pages
Unit-3-Evaluating Models
No ratings yet
Unit-3-Evaluating Models
3 pages
Chapter 3
No ratings yet
Chapter 3
25 pages
Evaluation
No ratings yet
Evaluation
7 pages
L3 Evaluation
No ratings yet
L3 Evaluation
3 pages
Evaluating Models
No ratings yet
Evaluating Models
8 pages
Evaluating Models NOTES
No ratings yet
Evaluating Models NOTES
5 pages
Unit - 3 Evaluation
No ratings yet
Unit - 3 Evaluation
6 pages
Evaluation New
No ratings yet
Evaluation New
42 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Evaluating AI Model Performance
No ratings yet
Evaluating AI Model Performance
18 pages
Evaluating Models CH-3
No ratings yet
Evaluating Models CH-3
5 pages
AI Evaluation
No ratings yet
AI Evaluation
30 pages
Unit3 Evaluating Models
No ratings yet
Unit3 Evaluating Models
10 pages
AI Evaluation
No ratings yet
AI Evaluation
18 pages
Evaluation CH Summary Notes
No ratings yet
Evaluation CH Summary Notes
19 pages
Artificial Intelligence Grade 10 Unit-3: Evaluating Models
No ratings yet
Artificial Intelligence Grade 10 Unit-3: Evaluating Models
54 pages
AI Model Evaluation QA
No ratings yet
AI Model Evaluation QA
1 page
CH EVALUATION
No ratings yet
CH EVALUATION
7 pages
AI-Evaluating Models Class 10 Notes
No ratings yet
AI-Evaluating Models Class 10 Notes
8 pages
CH 7 - Notes Evaluation
No ratings yet
CH 7 - Notes Evaluation
3 pages
7383469-AI Handout PartB Unit-3 Evaluating Models
100% (1)
7383469-AI Handout PartB Unit-3 Evaluating Models
16 pages
10 Ai Evaluation tp01
No ratings yet
10 Ai Evaluation tp01
5 pages
1006 Ai Evaluation
No ratings yet
1006 Ai Evaluation
4 pages
X Unit 7 Evaluation
No ratings yet
X Unit 7 Evaluation
5 pages
Grade 10 Unit 7 - Evaluation
No ratings yet
Grade 10 Unit 7 - Evaluation
50 pages
AI Model Evaluation Parameters Explained
No ratings yet
AI Model Evaluation Parameters Explained
3 pages
Assignment 5
No ratings yet
Assignment 5
22 pages
Evaluation Data
No ratings yet
Evaluation Data
3 pages
Q6.What Will Happen If You Deploy An AI Model Without Evaluating It With Known Test Set Data? - Unreliable Performance
No ratings yet
Q6.What Will Happen If You Deploy An AI Model Without Evaluating It With Known Test Set Data? - Unreliable Performance
4 pages
Evaluating Model
No ratings yet
Evaluating Model
2 pages
Class X Unit3WS Evaluation
No ratings yet
Class X Unit3WS Evaluation
3 pages
5.10ai - 2B
No ratings yet
5.10ai - 2B
15 pages
AI Project Evaluation 1
No ratings yet
AI Project Evaluation 1
5 pages
CH 07 Evaluation
No ratings yet
CH 07 Evaluation
25 pages
Evaluating Models QA ClassX 25-26
No ratings yet
Evaluating Models QA ClassX 25-26
6 pages
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
No ratings yet
Unit-7 Evaluation: 7. What Is Meant by Overfitting of Data?
7 pages
EVALUATION
No ratings yet
EVALUATION
10 pages
Chater 3 Class 10
No ratings yet
Chater 3 Class 10
4 pages
Unit 4
No ratings yet
Unit 4
20 pages
EVALUATION
No ratings yet
EVALUATION
25 pages
Evaluation
No ratings yet
Evaluation
32 pages
AI Model Evaluation Answers
No ratings yet
AI Model Evaluation Answers
2 pages
STD X - Model Evaluation - Content
No ratings yet
STD X - Model Evaluation - Content
5 pages
1006 Evaluation1 PPT
No ratings yet
1006 Evaluation1 PPT
20 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
Evaluation Grade10 Ai
No ratings yet
Evaluation Grade10 Ai
32 pages
Evaluation Notes
No ratings yet
Evaluation Notes
12 pages
Model Evaluation and Performance Metrics
No ratings yet
Model Evaluation and Performance Metrics
2 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
MSDS Buffer Solution PH 4.0
No ratings yet
MSDS Buffer Solution PH 4.0
5 pages
Adarsha Acharya - Error-Bar Task
No ratings yet
Adarsha Acharya - Error-Bar Task
4 pages
IUP Map for PT Multi Harapan Utama
No ratings yet
IUP Map for PT Multi Harapan Utama
1 page
6.electromagnetic Induction
No ratings yet
6.electromagnetic Induction
13 pages
1 s2.0 S0925838822039482 Main
No ratings yet
1 s2.0 S0925838822039482 Main
9 pages
Somali Youth Encouraged To Embrace Science To Drive National Transformation
No ratings yet
Somali Youth Encouraged To Embrace Science To Drive National Transformation
3 pages
Reading I
No ratings yet
Reading I
10 pages
Correction of High Vibration On A VTP With A Dynamic Vibration Absorber
100% (1)
Correction of High Vibration On A VTP With A Dynamic Vibration Absorber
18 pages
2021 Data Representation IGCSE 0478
No ratings yet
2021 Data Representation IGCSE 0478
18 pages
Case Dozer Fuel Consumption
100% (2)
Case Dozer Fuel Consumption
1 page
Foundations of Economics 7th Edition Bade Get PDF Now
No ratings yet
Foundations of Economics 7th Edition Bade Get PDF Now
301 pages
Biography Text
No ratings yet
Biography Text
9 pages
Ôn Tập Học Kỳ 1 Lớp 9 Năm 2022-2023
No ratings yet
Ôn Tập Học Kỳ 1 Lớp 9 Năm 2022-2023
8 pages
sth315n10f7 2
No ratings yet
sth315n10f7 2
17 pages
Panorama Andino ISDA
No ratings yet
Panorama Andino ISDA
10 pages
Hygienic Cleaning of Food Rooms and Catering Equipment (Deep Cleaning)
No ratings yet
Hygienic Cleaning of Food Rooms and Catering Equipment (Deep Cleaning)
16 pages
2025 - Pinheiro Et Al.
No ratings yet
2025 - Pinheiro Et Al.
30 pages
Art Therapy
No ratings yet
Art Therapy
9 pages
ECEN 248 Lab 2: Behavioral Verilog
No ratings yet
ECEN 248 Lab 2: Behavioral Verilog
7 pages
Information Retrieval Models Guide
No ratings yet
Information Retrieval Models Guide
15 pages
Jeremy Harmers 50 Communicative Activities - Eltshop - Ir
No ratings yet
Jeremy Harmers 50 Communicative Activities - Eltshop - Ir
129 pages
NAZA-M LITE User Manual v1.00 en
No ratings yet
NAZA-M LITE User Manual v1.00 en
43 pages
Prestressing Techniques Explained
No ratings yet
Prestressing Techniques Explained
32 pages
APP-CSE 2024 Form - Other Items V2
No ratings yet
APP-CSE 2024 Form - Other Items V2
485 pages
Group 9 Doors Wood Aluminum and Steel
No ratings yet
Group 9 Doors Wood Aluminum and Steel
3 pages
Sensation & Perception Study Guide
No ratings yet
Sensation & Perception Study Guide
6 pages
Marpol Annex II P&a Manual
100% (2)
Marpol Annex II P&a Manual
27 pages
Algorithmic Auditorium Design Model
No ratings yet
Algorithmic Auditorium Design Model
10 pages
Eatron 48V AI BMS Datasheet v1
No ratings yet
Eatron 48V AI BMS Datasheet v1
2 pages
Regression Performance Metrics Overview
No ratings yet
Regression Performance Metrics Overview
6 pages