0% found this document useful (0 votes)

98 views3 pages

CH 3 Evalauting Models

Uploaded by

bhavyamaheshwari7104

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views3 pages

CH 3 Evalauting Models

Uploaded by

bhavyamaheshwari7104

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

EVALUATION Model evaluation is the process of using different evaluation metrics to understand a machine learning model's

performance.

An AI model gets better with constructive feedback. You build a model, get feedback from metrics, make improvements, and
repeat the cycle until you achieve a desirable accuracy.

NEED FOR MODEL EVALUATION

● Model evaluation is like giving your AI model a report card, which helps you understand its strengths, weaknesses, and
suitability for the task. ●This feedback loop is essential for building trustworthy and reliable AI systems.

WHAT IS TRAIN-TEST SPLIT?

▪The train-test split is a technique for evaluating the performance of a machine learning algorithm
▪ It can be used for any supervised learning algorithm
▪ The procedure involves taking a dataset and dividing it into two subsets: The training dataset and the testing dataset
▪ The train-test procedure is appropriate when there is a sufficiently large dataset available

WHY DO WE NEED TO DO TRAIN-TEST SPLIT?

▪The train dataset is used to make the model learn ▪ The input elements of the test dataset are provided to the trained model.
The model makes predictions, and the predicted values are compared to the expected values
▪ The objective is to estimate the performance of the machine learning model on new data: data not used to train the model

ACCURACY : Accuracy is an evaluation metric that measures the total number of predictions a model gets right.

The accuracy of the model and its performance are directly proportional, so a better-performing model will have more accurate
predictions. The goal is to maximize accuracy.

ERROR : Error is an action that is inaccurate or wrong.

In machine learning, error is used to see how accurately a model can predict new, unseen data.

Error refers to the difference between a model's prediction and the actual outcome, and it quantifies how often the model
makes mistakes. The goal is to minimize error.

WHAT IS CLASSIFICATION? : Classification is a problem where a specific class label is the result to be predicted from a
given input .Example : a vegetable-grocery classifier model that predicts whether an item is a vegetable or a grocery item22.

CLASSIFICATION METRICS (4)

1) Confusion Matrix:
The confusion matrix is a table that is used to visualize the performance of a classification model. It helps you understand where
the model is succeeding and where it is making errors. It is a handy presentation of accuracy of a model.

 True Positive (TP): The model correctly predicted the positive class. For example, a spam filter correctly identifies a
spam email as "spam."
 True Negative (TN): The model correctly predicted the negative class. For example, the spam filter correctly identifies
a legitimate email as "not spam."
 False Positive (FP): The model incorrectly predicted the positive class. This is also known as a Type I Error. For
example, the spam filter incorrectly flags a legitimate email as "spam" (a "false alarm").
 False Negative (FN): The model incorrectly predicted the negative class. This is also known as a Type II Error. For
example, the spam filter incorrectly flags a spam email as "not spam" (a "missed case").
2) Accuracy :
Accuracy is the most intuitive and widely used classification metric. It measures the proportion of correct predictions (both true
positives and true negatives) out of the total number of predictions made by the model.

**Formula : Accuracy = (TP + TN) / (TP + TN + FP + FN)

key limitation: It is suitable only when there are equal number of observations in each class, when dealing with unbalanced
datasets where the number of observations in each class is not equal, other classification metrics like precision, recall, and F1
score are recommended

3) Precision :
It is the ratio of total number of correctly classified positive examples and total number of predicted positive examples.
It focuses on the quality of positive predictions. Generally it is used for unbalanced datasets.

**Formula: Precision = TP / (TP + FP)

Example: In a medical test for a rare disease, high precision is crucial. A high precision means that when the test says a patient
has the disease, they are very likely to actually have it, avoiding unnecessary stress and treatment for healthy people.

4) Recall :
It is the measure of our model correctly identifying True Positive. It focuses on the model's ability to find all positive instances.
Generally it is used for unbalanced datasets.

**Formula: Recall = TP / (TP + FN)

Example: In a system for detecting fraud, high recall is critical. A high recall means the system is able to catch most of the
fraudulent transactions, even if it flags a few legitimate ones by mistake.

◉ F1 Score :
The F1 Score is the harmonic mean of Precision and Recall. It provides a single score that balances both metrics.

“F1 is a better selection in evaluation metrics” : In cases where the data is unbalanced, we are unable to decide whether FP is
more important for FN, we should use F1 score as the suitable metric.

**Formula: F1 Score = 2 * (Precision * Recall) / (Precision + Recall)

Example: An email spam filter needs to be both precise (not flagging legitimate emails as spam) and have high recall (not
letting spam into the inbox). The F1 score helps evaluate if the model is doing a good job on both fronts.

ETHICAL CONCERNS In Model Evaluation :

1) Bias :
Ensure that the evaluation metrics chosen don’t result in any kind of bias

2) Transparency :
Honest explanation how the chosen evaluation metrics work and produce results without keeping any information hidden

3) Accountability :
Take responsibility for your choice of metrics and methodology of evaluation in case any user faces a disadvantage because of
your chosen methodology

Worksheet For 8th
100% (1)
Worksheet For 8th
5 pages
Unit - 3 Evaluation
No ratings yet
Unit - 3 Evaluation
6 pages
Unit3 Evaluating Models
No ratings yet
Unit3 Evaluating Models
10 pages
GR 10 - Final Evaluation
No ratings yet
GR 10 - Final Evaluation
45 pages
Evaluationnai
No ratings yet
Evaluationnai
5 pages
Part B Unit 3
No ratings yet
Part B Unit 3
23 pages
STD X - Model Evaluation - Content
No ratings yet
STD X - Model Evaluation - Content
5 pages
Assignment 5
No ratings yet
Assignment 5
22 pages
b.3. Evaluating Models
No ratings yet
b.3. Evaluating Models
10 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Evaluating Models CH-3
No ratings yet
Evaluating Models CH-3
5 pages
Evaluating Models
No ratings yet
Evaluating Models
9 pages
Unit 4
No ratings yet
Unit 4
20 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
61 pages
Evaluating Model
No ratings yet
Evaluating Model
2 pages
Wa0002.
No ratings yet
Wa0002.
6 pages
ML - Training - Evaluation For Machine Learning Course
No ratings yet
ML - Training - Evaluation For Machine Learning Course
31 pages
Chater 3 Class 10
No ratings yet
Chater 3 Class 10
4 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
6 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
Part B CH 8 Evaluation 1
No ratings yet
Part B CH 8 Evaluation 1
39 pages
7383469-AI Handout PartB Unit-3 Evaluating Models
100% (1)
7383469-AI Handout PartB Unit-3 Evaluating Models
16 pages
Lecture 20 - Evaluation Metrics
No ratings yet
Lecture 20 - Evaluation Metrics
27 pages
Unit3 Evaluating Models
No ratings yet
Unit3 Evaluating Models
2 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
Evaluating AI Models
No ratings yet
Evaluating AI Models
3 pages
AI Evaluation
No ratings yet
AI Evaluation
18 pages
Ads Exp4
No ratings yet
Ads Exp4
3 pages
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
No ratings yet
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
9 pages
A Complete Guide To Model Evaluation Metrics
No ratings yet
A Complete Guide To Model Evaluation Metrics
9 pages
Evaluation Data
No ratings yet
Evaluation Data
3 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
11 pages
Unit - 5
No ratings yet
Unit - 5
57 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Assignment Unit3ainotes-Class10 20250629124030
No ratings yet
Assignment Unit3ainotes-Class10 20250629124030
2 pages
Lesson 4 - Performance Metrics
No ratings yet
Lesson 4 - Performance Metrics
46 pages
Chapter 3
No ratings yet
Chapter 3
25 pages
3.4. Evaluation Metrics For AI Models
No ratings yet
3.4. Evaluation Metrics For AI Models
36 pages
9.1 Accuracy: Formula: Accuracy (True Positives + True Negatives) / (Total Observations)
No ratings yet
9.1 Accuracy: Formula: Accuracy (True Positives + True Negatives) / (Total Observations)
4 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
8 pages
Ad3501-Dl-Unit 4 Notes
No ratings yet
Ad3501-Dl-Unit 4 Notes
16 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
Confusion Matrix and Classification Evaluation Metrics
No ratings yet
Confusion Matrix and Classification Evaluation Metrics
16 pages
? Task
No ratings yet
? Task
23 pages
L3 Evaluation
No ratings yet
L3 Evaluation
3 pages
Unit - 7 - Evaluation
No ratings yet
Unit - 7 - Evaluation
30 pages
Ads 5
No ratings yet
Ads 5
5 pages
Evaluation Metrics in Machine Learning - GeeksforGeeks
No ratings yet
Evaluation Metrics in Machine Learning - GeeksforGeeks
6 pages
Session 2 Evaluation Boosting Bagging Contemporary Business Anaytics
No ratings yet
Session 2 Evaluation Boosting Bagging Contemporary Business Anaytics
17 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
Confusion Matrix & Metrics Guide
No ratings yet
Confusion Matrix & Metrics Guide
13 pages
ML Evaluation Metrics Guide
No ratings yet
ML Evaluation Metrics Guide
16 pages
Confusion Matrix and Performance Evaluation Metrics
No ratings yet
Confusion Matrix and Performance Evaluation Metrics
13 pages
08 Classifier Evaluation
No ratings yet
08 Classifier Evaluation
39 pages
06-FSSR DS610 2024 2025T1 Metrics
No ratings yet
06-FSSR DS610 2024 2025T1 Metrics
24 pages
Evaluation
No ratings yet
Evaluation
7 pages
Icse25 Seip
No ratings yet
Icse25 Seip
12 pages
DMMLASSIGNMENT
No ratings yet
DMMLASSIGNMENT
36 pages
Hate Speech Detection Using LSTM and NLP Sushan Pratihar 3 Page
No ratings yet
Hate Speech Detection Using LSTM and NLP Sushan Pratihar 3 Page
13 pages
UNIT 3 Evaluating Models Q-Ans
67% (3)
UNIT 3 Evaluating Models Q-Ans
6 pages
All Life Bank - AIML - ML - Project - Low - Code - Notebook
No ratings yet
All Life Bank - AIML - ML - Project - Low - Code - Notebook
78 pages
AI Classification Assignment Guide
No ratings yet
AI Classification Assignment Guide
2 pages
Taral ML Final
No ratings yet
Taral ML Final
54 pages
Ieee Paper
No ratings yet
Ieee Paper
7 pages
Predicting Metal-Organic Frameworks As Catalysts To Fix Carbon Dioxide To Cyclic Carbonate by Machine Learning
No ratings yet
Predicting Metal-Organic Frameworks As Catalysts To Fix Carbon Dioxide To Cyclic Carbonate by Machine Learning
10 pages
Extracting Aspects and Mining Opinions in Product Reviews Using Supervised Learning Algorithm
No ratings yet
Extracting Aspects and Mining Opinions in Product Reviews Using Supervised Learning Algorithm
5 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
22 pages
Orange - AI417 - 10 - QP (P2)
No ratings yet
Orange - AI417 - 10 - QP (P2)
8 pages
Krishi Sahayak: Smart Crop Recommendations
No ratings yet
Krishi Sahayak: Smart Crop Recommendations
9 pages
Algnet: Attention Light Graph Memory Network For Medical Recommendation System
No ratings yet
Algnet: Attention Light Graph Memory Network For Medical Recommendation System
8 pages
Thermal Detection InVietnam
No ratings yet
Thermal Detection InVietnam
8 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
23 pages
Program 4
No ratings yet
Program 4
3 pages
Sentiment Analysis Optimization Using Ensemble of
No ratings yet
Sentiment Analysis Optimization Using Ensemble of
10 pages
An Attention Mechanism Based CNN Bilstm Classification Model For Detection of Inappropriate Content in Cartoon Videos
No ratings yet
An Attention Mechanism Based CNN Bilstm Classification Model For Detection of Inappropriate Content in Cartoon Videos
24 pages
8034 26557 2 PB
No ratings yet
8034 26557 2 PB
9 pages
Advancing Underwater Vision A Survey of Deep Learning Models For Underwater Object Recognition and Tracking
No ratings yet
Advancing Underwater Vision A Survey of Deep Learning Models For Underwater Object Recognition and Tracking
38 pages
Classification Model Evaluation Metrics
No ratings yet
Classification Model Evaluation Metrics
9 pages
Enhancing IoT Security With CNN and LSTM-Based Int
No ratings yet
Enhancing IoT Security With CNN and LSTM-Based Int
7 pages
Ensemble Learning for Cyber-Attack Detection
No ratings yet
Ensemble Learning for Cyber-Attack Detection
4 pages
Major Project Presentation Template For Review 1
No ratings yet
Major Project Presentation Template For Review 1
49 pages
Ieee I2ct2023 Paper1304
No ratings yet
Ieee I2ct2023 Paper1304
8 pages
CT43B0513 Ieee
No ratings yet
CT43B0513 Ieee
6 pages
Crime Prediction Using Machine Learning
No ratings yet
Crime Prediction Using Machine Learning
60 pages
Lesson 3.0 Introduction To Classification Structured Data Projects
No ratings yet
Lesson 3.0 Introduction To Classification Structured Data Projects
10 pages
Automated Pitting Corrosion Detection of Metallic Glass Using Deep Learning
No ratings yet
Automated Pitting Corrosion Detection of Metallic Glass Using Deep Learning
6 pages

CH 3 Evalauting Models

Uploaded by

CH 3 Evalauting Models

Uploaded by

EVALUATION Model evaluation is the process of using different evaluation metrics to understand a machine learning model's

NEED FOR MODEL EVALUATION

WHAT IS TRAIN-TEST SPLIT?

WHY DO WE NEED TO DO TRAIN-TEST SPLIT?

ERROR : Error is an action that is inaccurate or wrong.

CLASSIFICATION METRICS (4)

**Formula : Accuracy = (TP + TN) / (TP + TN + FP + FN)

**Formula: Precision = TP / (TP + FP)

**Formula: Recall = TP / (TP + FN)

**Formula: F1 Score = 2 * (Precision * Recall) / (Precision + Recall)

ETHICAL CONCERNS In Model Evaluation :

You might also like