0% found this document useful (0 votes)

33 views18 pages

Clase10 11

The document discusses underfitting and overfitting in machine learning models. It provides examples of how increasing complexity in decision trees can lead to overfitting when the training data is insufficient. The key ways to address overfitting are pre-pruning and post-pruning of decision trees, which aim to find the optimal complexity that minimizes test error. Performance of models is evaluated using metrics like accuracy and the confusion matrix, though accuracy alone can be misleading if classes are imbalanced.

Uploaded by

Magdalena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views18 pages

Clase10 11

Uploaded by

Magdalena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

26-10-2020

Evaluation

Underfitting and Overfitting

Overfitting

Underfitting: when model is too simple, both training and test errors are large
Overfitting: when model is too complex, training error is small but test error is large

Overfitting due to Noise

Decision boundary is distorted by noise point

1
26-10-2020

Overfitting due to Insufficient Examples

Lack of data points in the lower half of the diagram makes it difficult to
predict correctly the class labels of that region
- Insufficient number of training records in the region causes the decision
tree to predict the test examples using other training records that are
irrelevant to the classification task

Underfitting and Overfitting (Example)

Two class problem:

+ : 5200 instances
• 5000 instances generated from a
Gaussian centered at (10,10)

• 200 noisy instances added

o : 5200 instances
• Generated from a uniform
distribution

10 % of the data used for

training and 90% of the data
used for testing

Increasing number of nodes in Decision Trees

2
26-10-2020

Decision Tree with 4 nodes

Decision Tree

Decision boundaries on Training data

Decision Tree with 50 nodes

Decision Tree

Decision boundaries on Training data

Which tree is better?

Decision Tree with 4 nodes

Which tree is better ?

Decision Tree with 50 nodes

3
26-10-2020

Model Overfitting

Underfitting: when model is too simple, both training and test errors are large
Overfitting: when model is too complex, training error is small but test error is large

Model Overfitting

Using twice the number of data instances

• If training data is under-representative, testing errors increase and training

errors decrease on increasing number of nodes
• Increasing the size of training data reduces the difference between training and
testing errors at a given number of nodes

Model Overfitting

Decision Tree with 50 nodes Decision Tree with 50 nodes

Using twice the number of data instances

• If training data is under-representative, testing errors increase and training

errors decrease on increasing number of nodes
• Increasing the size of training data reduces the difference between training and
testing errors at a given number of nodes

4
26-10-2020

Notes on Overfitting

• Overfitting results in decision trees that are more complex than necessary

• Training error no longer provides a good estimate of how well the tree will
perform on previously unseen records

• Need new ways for estimating errors

How to Address Overfitting

• Pre-Pruning (Early Stopping Rule)

– Stop the algorithm before it becomes a fully-grown tree
– Typical stopping conditions for a node:
• Stop if all instances belong to the same class
• Stop if all the attribute values are the same
– More restrictive conditions:
• Stop if number of instances is less than some user-specified threshold
• Stop if class distribution of instances are independent of the available features
(e.g., using  2 test)
• Stop if expanding the current node does not improve impurity
measures (e.g., Gini or information gain).

Model Selection for Decision Trees

• Post-pruning
– Grow decision tree to its entirety
– Subtree replacement
• Trim the nodes of the decision tree in a bottom-up fashion
• If generalization error improves after trimming, replace sub-tree
by a leaf node
• Class label of leaf node is determined from majority class of
instances in the sub-tree
– Subtree raising
• Replace subtree with most frequently used branch

5
26-10-2020

Examples of Post-pruning

Model Evaluation

• Metrics for Performance Evaluation

– How to evaluate the performance of a model?

• Methods for Performance Evaluation

– How to obtain reliable estimates?

• Methods for Model Comparison

– How to compare the relative performance among competing models?

Model Evaluation

• Metrics for Performance Evaluation

– How to evaluate the performance of a model?

• Methods for Performance Evaluation

– How to obtain reliable estimates?

• Methods for Model Comparison

– How to compare the relative performance among competing models?

6
26-10-2020

Metrics for Performance Evaluation

• Focus on the predictive capability of a model

– Rather than how fast it takes to classify or build models, scalability,
etc.
• Confusion Matrix:

PREDICTED CLASS
a: TP (true positive)

Class=Yes Class=No b: FN (false negative)

c: FP (false positive)

Class=Yes a b d: TN (true negative)

ACTUAL
CLASS Class=No c d

Metrics for Performance Evaluation

PREDICTED CLASS a: TP (true positive)

b: FN (false negative)

Class=Yes Class=No c: FP (false positive)

d: TN (true negative)

Class=Yes a b
ACTUAL
CLASS Class=No c d

Metrics for Performance Evaluation…

PREDICTED CLASS
Class=Yes Class=No

Class=Yes a b
ACTUAL (TP) (FN)
CLASS Class=No c d
(FP) (TN)

• Most widely-used metric:

ad TP  TN
Accuracy  
a  b  c  d TP  TN  FP  FN

7
26-10-2020

Limitation of Accuracy

• Consider a 2-class problem

– Number of Class 0 examples = 9990
– Number of Class 1 examples = 10

• If model predicts everything to be class 0, accuracy is 9990/10000 = 99.9%

– Accuracy is misleading because model does not detect any class 1
example

Performance metrics

PREDICTED CLASS
Class=Yes Class=No
Class=Yes a b
(TP) (FN)
ACTUAL
CLASS Class=No c d
(FP) (TN)

8
26-10-2020

Cost Matrix

PREDICTED CLASS

C(i|j) Class=Yes Class=No

Class=Yes C(Yes|Yes) C(No|Yes)

ACTUAL
CLASS Class=No C(Yes|No) C(No|No)

C(i|j): Cost of misclassifying class j example as class i

Computing Cost of Classification

Cost PREDICTED CLASS
Matrix
C(i|j) + -
ACTUAL + -1 100
CLASS
- 1 0

Model PREDICTED CLASS Model PREDICTED CLASS

M1 M2
+ - + -
ACTUAL + 150 40 ACTUAL + 250 45
CLASS CLASS
- 60 250 - 5 200

Accuracy = 80% Accuracy = 90%

Cost = 3910 Cost = 4255

9
26-10-2020

Cost vs Accuracy
Count PREDICTED CLASS Accuracy is proportional to cost if
1. C(Yes|No)=C(No|Yes) = q
Class=Yes Class=No
2. C(Yes|Yes)=C(No|No) = p
Class=Yes a b
ACTUAL N=a+b+c+d
CLASS Class=No c d
Accuracy = (a + d)/N

Cost PREDICTED CLASS

Cost = p (a + d) + q (b + c)
Class=Yes Class=No
= p (a + d) + q (N – a – d)
Class=Yes p q = q N – (q – p)(a + d)
ACTUAL
CLASS Class=No
= N [q – (q-p)  Accuracy]
q p

Model Evaluation

• Metrics for Performance Evaluation

– How to evaluate the performance of a model?

• Methods for Performance Evaluation

– How to obtain reliable estimates?

• Methods for Model Comparison

– How to compare the relative performance among competing models?

Model Evaluation
• Purpose:
– To estimate performance of classifier on previously
unseen data (test set)

• Holdout
– Reserve k% for training and (100-k)% for testing
– Random subsampling: repeated holdout
• Cross validation
– Partition data into k disjoint subsets
– k-fold: train on k-1 partitions, test on the remaining
one
– Leave-one-out: k=n

10
26-10-2020

Cross-validation Example

• 3-fold cross-validation

11
26-10-2020

Model Evaluation

• Metrics for Performance Evaluation

– How to evaluate the performance of a model?

• Methods for Performance Evaluation

– How to obtain reliable estimates?

• Methods for Model Comparison

– How to compare the relative performance among competing models?

ROC curves

• ROC = Receiver Operating Characteristic

• Started in electronic signal detection theory (1940s -
1950s)
• Has become very popular in in machine learning
applications to assess classifiers

12
26-10-2020

Call these “negatives” Call these “positives”

Test Result

True Positives

Test Result

Test Result False

Positives

13
26-10-2020

True
negatives

Test Result

False
negatives

Test Result

‘‘-’’ ‘‘+’’

Test Result

14
26-10-2020

‘‘-’’ ‘‘+’’

Test Result

ROC curve
100%
True Positive Rate
(sensitivity)

0
% 100%
0
% False Positive Rate
(1-specificity)

A good test: A poor test:

100% 100%
True Positive Rate

True Positive Rate

0 0
% %
0 100 100
0
% False Positive % False Positive %
%
Rate Rate

15
26-10-2020

Best Test: Worst test:

100 100
% %

True Positive
True Positive

Rate
Rate

0
0 %
% 0 10
0 100 False Positive 0%
False Positive % %
% Rate
Rate

The distributions The distributions

don’t overlap at all overlap completely

Area under ROC curve (AUC)

• Overall measure of test performance

• Comparisons between two tests based on
differences between (estimated) AUC
• For continuous data, AUC equivalent to Mann-
Whitney U-statistic (nonparametric test of
difference in location between two populations)

100 100
% %

AUC = 100%
True Positive
True Positive

Rate
Rate

AUC = 50%
0
0 %
% 0 100
0 100 False Positive
% %
% False Positive %
Rate
Rate

100 100
% %

AUC = 90%
True Positive

True Positive

AUC = 65%
Rate

Rate

0 0
% %
0 100 100
False Positive 0
% %
% False Positive %
Rate Rate

16
26-10-2020

RECEIVER OPERATOR CHARACTERISTIC

17
26-10-2020

Significance Tests

• T-Test between two difference performances

• ANOVA between two or more difference
performances

Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
Data Mining: Class Imbalance Solutions
No ratings yet
Data Mining: Class Imbalance Solutions
56 pages
UNIT 4 1 ConfusionMatrix
No ratings yet
UNIT 4 1 ConfusionMatrix
33 pages
Evaluation Metricsflaksdj Fa
No ratings yet
Evaluation Metricsflaksdj Fa
22 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
AIML-HC Mod 03
No ratings yet
AIML-HC Mod 03
46 pages
Model Evaluation
No ratings yet
Model Evaluation
31 pages
Unit Iii
No ratings yet
Unit Iii
67 pages
Classification: Basic Concepts, Decision Trees, and Model Evaluation
No ratings yet
Classification: Basic Concepts, Decision Trees, and Model Evaluation
46 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
How to Evaluate Machine Learning Models
No ratings yet
How to Evaluate Machine Learning Models
14 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
Lecture 3 1611410001002
No ratings yet
Lecture 3 1611410001002
51 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Evaluating Machine Learning Models
100% (2)
Evaluating Machine Learning Models
10 pages
Data Mining Evaluation Metrics Guide
No ratings yet
Data Mining Evaluation Metrics Guide
40 pages
Unit 3 ML
No ratings yet
Unit 3 ML
40 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
ML Endsem
No ratings yet
ML Endsem
14 pages
MACHINELEARNING
No ratings yet
MACHINELEARNING
20 pages
M6 - Model Overfitting
No ratings yet
M6 - Model Overfitting
30 pages
Evaluationnai
No ratings yet
Evaluationnai
5 pages
7 ML
No ratings yet
7 ML
38 pages
3ML.02.MainConcepts Evaluation
No ratings yet
3ML.02.MainConcepts Evaluation
35 pages
Classification Problems
No ratings yet
Classification Problems
53 pages
Machine Learning # 2
No ratings yet
Machine Learning # 2
17 pages
0 Machine Learning Overview and Metrics LT
No ratings yet
0 Machine Learning Overview and Metrics LT
84 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Model Evaluation in ML
No ratings yet
Model Evaluation in ML
12 pages
Lecture 9 - Evaluations
No ratings yet
Lecture 9 - Evaluations
68 pages
Unit 4
No ratings yet
Unit 4
34 pages
19-Introduction Classification Algorithm-18-09-2024
No ratings yet
19-Introduction Classification Algorithm-18-09-2024
102 pages
TR Rain Error
No ratings yet
TR Rain Error
6 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
ML Model Evaluation
No ratings yet
ML Model Evaluation
17 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
Unit 4.modelselection
No ratings yet
Unit 4.modelselection
26 pages
Chương 2e. Model Evaluation
No ratings yet
Chương 2e. Model Evaluation
27 pages
Mod8 DM
No ratings yet
Mod8 DM
13 pages
L2 - Problems in ML & Performance Evaluation
No ratings yet
L2 - Problems in ML & Performance Evaluation
30 pages
DM Unit - 3
No ratings yet
DM Unit - 3
21 pages
Chapter 2 Part II
No ratings yet
Chapter 2 Part II
28 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
Overfitting vs Underfitting in ML
No ratings yet
Overfitting vs Underfitting in ML
20 pages
Module 4 Supervised Algoritms-II
No ratings yet
Module 4 Supervised Algoritms-II
40 pages
Model Evaluation
No ratings yet
Model Evaluation
18 pages
Lecture 20 - Evaluation Metrics
No ratings yet
Lecture 20 - Evaluation Metrics
27 pages
Topic 3
No ratings yet
Topic 3
48 pages
Unit 2 Part 2 Data Science Final 23june
No ratings yet
Unit 2 Part 2 Data Science Final 23june
39 pages
Classification and Prediction Guide
No ratings yet
Classification and Prediction Guide
93 pages
Pa Unit 4
No ratings yet
Pa Unit 4
5 pages
Model Selection On ML
No ratings yet
Model Selection On ML
49 pages
Ai Unit 5
No ratings yet
Ai Unit 5
13 pages
Lecture 5 Evaluation - Classifer
No ratings yet
Lecture 5 Evaluation - Classifer
61 pages
Model Evaluation
No ratings yet
Model Evaluation
44 pages
ClassificationandPrediction Module3
No ratings yet
ClassificationandPrediction Module3
88 pages
SML Updated UNIT 4
No ratings yet
SML Updated UNIT 4
44 pages
UGC List of Approved Journals
No ratings yet
UGC List of Approved Journals
6 pages
SMS Spam Detection with Machine Learning
No ratings yet
SMS Spam Detection with Machine Learning
9 pages
ML Paper
No ratings yet
ML Paper
4 pages
Sharpness Minimization & Generalization
No ratings yet
Sharpness Minimization & Generalization
12 pages
Box-Office Revenue Estimation For Telugu Movie Industry Using Predictive Analytic Techniques
No ratings yet
Box-Office Revenue Estimation For Telugu Movie Industry Using Predictive Analytic Techniques
7 pages
AI-Enhanced Sepsis Detection
No ratings yet
AI-Enhanced Sepsis Detection
8 pages
Chapter 4
No ratings yet
Chapter 4
34 pages
Machine Learning Overview: Key Concepts & Applications
No ratings yet
Machine Learning Overview: Key Concepts & Applications
31 pages
(Research Handbooks in Transport Studies) Hussein Dia (Editor) - Handbook On Artificial Intelligence and Transport-Edward Elgar (2023)
No ratings yet
(Research Handbooks in Transport Studies) Hussein Dia (Editor) - Handbook On Artificial Intelligence and Transport-Edward Elgar (2023)
649 pages
Ola Data Analysis for Fare Prediction
No ratings yet
Ola Data Analysis for Fare Prediction
8 pages
Detecting Fake Amazon Reviews Using ML
No ratings yet
Detecting Fake Amazon Reviews Using ML
6 pages
Flight Delay Prediction Based On Machine Learning Full
No ratings yet
Flight Delay Prediction Based On Machine Learning Full
9 pages
MBS Prepayment Risk Analysis Report
No ratings yet
MBS Prepayment Risk Analysis Report
29 pages
Module 3
No ratings yet
Module 3
53 pages
Artificial Intelligence Code 843
No ratings yet
Artificial Intelligence Code 843
12 pages
Machine Learning for Lung Cancer
No ratings yet
Machine Learning for Lung Cancer
6 pages
Artificial Intelligence For UPSSSC
No ratings yet
Artificial Intelligence For UPSSSC
10 pages
Question Duplicates - Coursera
No ratings yet
Question Duplicates - Coursera
20 pages
DL Prac1
No ratings yet
DL Prac1
5 pages
Question-Answers in Machine Learning
No ratings yet
Question-Answers in Machine Learning
14 pages
Aiml Unit-3
No ratings yet
Aiml Unit-3
52 pages
AI Glossary
No ratings yet
AI Glossary
1 page
BitcoinAnalysis - Ipynb - Colaboratory
No ratings yet
BitcoinAnalysis - Ipynb - Colaboratory
12 pages
Smart Garbage Bin System for Waste Management
No ratings yet
Smart Garbage Bin System for Waste Management
13 pages
ML Lab Output
No ratings yet
ML Lab Output
15 pages
Lumbar
No ratings yet
Lumbar
68 pages
A Skin Disease Classification Model Based On DenseNet and ConvNeXt Fusion
No ratings yet
A Skin Disease Classification Model Based On DenseNet and ConvNeXt Fusion
19 pages
Face Mask Detection and Thermal Screening
No ratings yet
Face Mask Detection and Thermal Screening
12 pages
Lecture 15
No ratings yet
Lecture 15
37 pages
Brain Tumor Segmentation From 3D MRI Scans Using U Net
No ratings yet
Brain Tumor Segmentation From 3D MRI Scans Using U Net
10 pages

Clase10 11

Uploaded by

Clase10 11

Uploaded by

26-10-2020

Underfitting and Overfitting

Overfitting due to Noise

Decision boundary is distorted by noise point

Overfitting due to Insufficient Examples

Underfitting and Overfitting (Example)

Two class problem:

• 200 noisy instances added

10 % of the data used for

Increasing number of nodes in Decision Trees

Decision Tree with 4 nodes

Decision boundaries on Training data

Decision Tree with 50 nodes

Decision boundaries on Training data

Which tree is better?

Decision Tree with 4 nodes

Which tree is better ?

Using twice the number of data instances

• If training data is under-representative, testing errors increase and training

Decision Tree with 50 nodes Decision Tree with 50 nodes

Using twice the number of data instances

• If training data is under-representative, testing errors increase and training

• Need new ways for estimating errors

How to Address Overfitting

• Pre-Pruning (Early Stopping Rule)

Model Selection for Decision Trees

• Metrics for Performance Evaluation

• Methods for Performance Evaluation

• Methods for Model Comparison

• Metrics for Performance Evaluation

• Methods for Performance Evaluation

• Methods for Model Comparison

Metrics for Performance Evaluation

• Focus on the predictive capability of a model

Class=Yes Class=No b: FN (false negative)

Class=Yes a b d: TN (true negative)

Metrics for Performance Evaluation

PREDICTED CLASS a: TP (true positive)

Class=Yes Class=No c: FP (false positive)

Metrics for Performance Evaluation…

• Most widely-used metric:

• Consider a 2-class problem

• If model predicts everything to be class 0, accuracy is 9990/10000 = 99.9%

C(i|j) Class=Yes Class=No

Class=Yes C(Yes|Yes) C(No|Yes)

C(i|j): Cost of misclassifying class j example as class i

Computing Cost of Classification

Model PREDICTED CLASS Model PREDICTED CLASS

Accuracy = 80% Accuracy = 90%

Cost PREDICTED CLASS

• Metrics for Performance Evaluation

• Methods for Performance Evaluation

• Methods for Model Comparison

• Metrics for Performance Evaluation

• Methods for Performance Evaluation

• Methods for Model Comparison

• ROC = Receiver Operating Characteristic

Call these “negatives” Call these “positives”

Test Result False

A good test: A poor test:

True Positive Rate

Best Test: Worst test:

The distributions The distributions

Area under ROC curve (AUC)

• Overall measure of test performance

RECEIVER OPERATOR CHARACTERISTIC

RECEIVER OPERATOR CHARACTERISTIC

• T-Test between two difference performances

You might also like