0% found this document useful (0 votes)

41 views35 pages

Lecture15 evaluatingClassifiersPartII

Lecture 15 focuses on evaluating classifiers, including a review of confusion matrices, sensitivity, specificity, and the introduction of ROC curves and the AUC metric. Students are instructed to complete an assignment involving JupyterLab and specific datasets by a set deadline. The lecture emphasizes the importance of understanding thresholds in classification and how they affect sensitivity and specificity.

Uploaded by

Grant Rustan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views35 pages

Lecture15 evaluatingClassifiersPartII

Uploaded by

Grant Rustan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Lecture 15: Evaluating

Classifiers (Part II)

ENGR:3110
Introduction to AI and Machine Learning in Engineering

1
Today’s topics
• Brief review of confusion matrices and sensitivity/specificity
• Introduction to ROC curves and the AUC metric
• Lec15 assignment (due Monday by 11:59 p.m.)
• Before class starts:
• Start JupyterLab, download/open lec15_EvaluatingClassifiersPartII.ipynb, and (if not
downloaded already) download biomechanics_train.csv, DRIVE_fundus.png, and
DRIVE_target.png. Also NEWLY download DRIVE_prob_prediction_map.png.
• Submit your completed Jupyter notebook file
(lec15_EvaluatingClassifiersPartI.ipynb) to ICON (assignment lec15)

2
Brief review of confusion
matrices and
sensitivity/specificity

3
What does it mean to say that a rapid flu
test has a sensitivity of 50-70%?
A. The test is expected to report a ‘positive’ result (i.e., say that someone
has the flu) on 50-70% of patients who actually have the flu.
B. Of all patients who test ‘positive’ on the test (i.e., of the patients for
which the test reports that they have the flu), 50-70% of them are
expected to actually have the flu.
C. The test is expected to report a ‘negative’ result (i.e., say that
someone does NOT have the flu) on 50-70% of patients who actually
do NOT have the flu.
D. Of all patients who test ‘negative’ on the test (i.e., of the patients for
which the test reports that they do NOT have the flu), 50-70% of them
are expected to actually NOT have the flu.

Also see [Link]

4
What does it mean to say that a rapid flu
test has a sensitivity of 50-70%?
A. The test is expected to report a ‘positive’ result (i.e., say that someone
has the flu) on 50-70% of patients who actually have the flu.
B. Of all patients who test ‘positive’ on the test (i.e., of the patients for
which the test reports that they have the flu), 50-70% of them are
expected to actually have the flu.
C. The test is expected to report a ‘negative’ result (i.e., say that
someone does NOT have the flu) on 50-70% of patients who actually
do NOT have the flu.
D. Of all patients who test ‘negative’ on the test (i.e., of the patients for
which the test reports that they do NOT have the flu), 50-70% of them
are expected to actually NOT have the flu.

Also see [Link]

5
Review of sensitivity and specificity (from
the confusion matrix entries)
A. The test is expected to report a ‘positive’ result (i.e.,
say that someone has the flu) on 50-70% of patients
who actually have the flu.

(TP + TN) / (TP + FN + FP + TN)

accuracy

Also see lec13 notebook for a review of computing

the confusion matrix, and sensitivities/specificities 6
Review of sensitivity and specificity (from
the confusion matrix entries)
B. Of all patients who test ‘positive’ on the test (i.e., of the
patients for which the test reports that they have the flu),
50-70% of them are expected to actually have the flu.

(TP + TN) / (TP + FN + FP + TN)

accuracy

Also see lec13 notebook for a review of computing

the confusion matrix, and sensitivities/specificities 7
Review of sensitivity and specificity (from
the confusion matrix entries)
C. The test is expected to report a ‘negative’ result (i.e.,
say that someone does NOT have the flu) on 50-70% of
patients who actually do NOT have the flu.

(TP + TN) / (TP + FN + FP + TN)

accuracy

Also see lec13 notebook for a review of computing

the confusion matrix, and sensitivities/specificities 8
Review of sensitivity and specificity (from
the confusion matrix entries)
D Of all patients who test ‘negative’ on the test (i.e., of the
patients for which the test reports that they do NOT have
the flu), 50-70% of them are expected to actually NOT
have the flu.

(TP + TN) / (TP + FN + FP + TN)

accuracy

Also see lec13 notebook for a review of computing

the confusion matrix, and sensitivities/specificities 9
What does it mean to say that a rapid flu
test has a specificity of 90-95%?
A. The test is expected to report a ‘positive’ result (i.e., say that someone
has the flu) on 90-95% of patients who actually have the flu.
B. Of all patients who test ‘positive’ on the test (i.e., of the patients for
which the test reports that they have the flu), 90-95% of them are
expected to actually have the flu.
C. The test is expected to report a ‘negative’ result (i.e., say that
someone does NOT have the flu) on 90-95% of patients who actually
do NOT have the flu.
D. Of all patients who test ‘negative’ on the test (i.e., of the patients for
which the test reports that they do NOT have the flu), 90-95% of them
are expected to actually NOT have the flu.

Also see [Link]

10
What does it mean to say that a rapid flu
test has a specificity of 90-95%?
A. The test is expected to report a ‘positive’ result (i.e., say that someone
has the flu) on 90-95% of patients who actually have the flu.
B. Of all patients who test ‘positive’ on the test (i.e., of the patients for
which the test reports that they have the flu), 90-95% of them are
expected to actually have the flu.
C. The test is expected to report a ‘negative’ result (i.e., say that
someone does NOT have the flu) on 90-95% of patients who actually
do NOT have the flu.
D. Of all patients who test ‘negative’ on the test (i.e., of the patients for
which the test reports that they do NOT have the flu), 90-95% of them
are expected to actually NOT have the flu.

Also see [Link]

11
Review of sensitivity and specificity (from
the confusion matrix entries)
sensitivity → true positive rate (with respect
to ACTUAL positives): TP/(TP+FN)

specificity → true negative rate (with respect

to ACTUAL negatives): TN/(FP+TN)

accuracy → number correct:

(TP + TN) / (TP+FN+FP+TN)

(TP + TN) / (TP + FN + FP + TN)

accuracy

Also see lec13 notebook for a review of computing

the confusion matrix, and sensitivities/specificities 12
Introduction to ROC curves
and the AUC metric

13
Classifiers often have an intermediate
numerical result before determining the final
classification
Example: In k-NN classification, in the process of
determining the ‘majority class’ of the closest k
neighbors, one can also keep track of the
proportions of each class.
• Recall the iris classification problem. In the
example on the right (using two features), with
k=3, 2 of the closest points to the red star are
versicolor examples and the remaining point is
virginica. Thus, we could assign the ‘probability’ of
versicolor to be 0.67 and the ‘probability’ of
virginica to be 0.33.
• In sklearn, after training, you can obtain these
‘probability-like’ values using predict_proba rather
than predict (see lec13 notebook for
biomechanics example). 14
petal_length = 5.00, petal_width = 1.65
We can vary the ‘threshold’ of these numerical results to
obtain different classification results (also see lec13
notebook)

Biomechanics data…

Suppose the the ‘threshold’ is 0.5. Then we will classify

anything with an abnormal probability value >= 0.5 as
abnormal (this is also what [Link] would do).

15
We can vary the ‘threshold’ of these numerical results to
obtain different classification results (also see lec13
notebook)

Suppose that the ‘threshold’ is 0.3. Then we will classify

anything with an abnormal probability value >= 0.3 as
abnormal.

Note -- I colored the "Abnormal" rows yellow, but left the

text label "Normal" -- so you can see the progression

16
We can vary the ‘threshold’ of these numerical results to
obtain different classification results (also see lec13
notebook)
Suppose the the ‘threshold’ is 0.0. Then we will classify
anything with an abnormal probability value >= 0.0 as
abnormal.

17
We can vary the ‘threshold’ of these numerical results to
obtain different classification results (also see lec13
notebook)
Suppose the the ‘threshold’ is 0.0. Then we will classify
anything with an abnormal probability value >= 0.0 as
abnormal.

Question: Even though we are not showing the

actual target values in this example, what would
the sensitivity be in the case that the threshold is
0.0? What would the specificity be?

Recall that sensitivity is TP/(TP+FN) or TP/(all

actual positives).
Recall that specificity is TN/(FP+TN) or TN/(all
actual negatives).

18
We can vary the ‘threshold’ of these numerical results to
obtain different classification results (also see lec13
notebook)
Suppose the the ‘threshold’ is 0.0. Then we will classify
anything with an abnormal probability value >= 0.0 as
abnormal.

Question: Even though we are not showing the

actual target values in this example, what would
the sensitivity be in the case that the threshold is
0.0? What would the specificity be?

Recall that sensitivity is TP/(TP+FN) or TP/(all

actual positives).
Recall that specificity is TN/(FP+TN) or TN/(all
actual negatives).
Sensitivity = 100%
Specificity = 0%
19
We can vary the ‘threshold’ of these numerical results to
obtain different classification results (also see lec13
notebook)

Suppose the the ‘threshold’ is 0.5. Then we will classify

anything with an abnormal probability value >= 0.5 as
abnormal (this is also what [Link] would do).

20
We can vary the ‘threshold’ of these numerical results to
obtain different classification results (also see lec13
notebook)

Suppose the ‘threshold’ is 0.7. Then we will classify

anything with an abnormal probability value >= 0.7 as
abnormal.

21
We can vary the ‘threshold’ of these numerical results to
obtain different classification results (also see lec13
notebook)

Suppose the the ‘threshold’ is 1.01. Then we will classify

anything with an abnormal probability value >= 1.01 as
22
abnormal.
We can vary the ‘threshold’ of these numerical results to
obtain different classification results (also see lec13
notebook)
Question: Even though we are not showing the
actual target values in this example, what would
the sensitivity be in the case that the threshold is
1.01? What would the specificity be?

Recall that sensitivity is TP/(TP+FN) or TP/(all

actual positives).

Recall that specificity is TN/(FP+TN) or TN/(all

actual negatives).

Suppose the the ‘threshold’ is 1.01. Then we will classify

anything with an abnormal probability value >= 1.01 as
23
abnormal.
We can vary the ‘threshold’ of these numerical results to
obtain different classification results (also see lec13
notebook)
Question: Even though we are not showing the
actual target values in this example, what would
the sensitivity be in the case that the threshold is
1.01? What would the specificity be?

Recall that sensitivity is TP/(TP+FN) or TP/(all

actual positives). Recall that specificity is
TN/(FP+TN) or TN/(all actual negatives).
Sensitivity = 0%
Specificity = 100%

Suppose the the ‘threshold’ is 1.01. Then we will classify

anything with an abnormal probability value >= 1.01 as
24
abnormal.
A Receiver Operating Characteristic (ROC) curve
plots sensitivity versus 1-specificity (across the
various threshold values)

from biomechanics example from image-segmentation example

25
A Receiver Operating Characteristic (ROC) curve
plots sensitivity versus 1-specificity (across the
various threshold values)
Threshold = ? Threshold = ?

from biomechanics example from image-segmentation example

26
A Receiver Operating Characteristic (ROC) curve
plots sensitivity versus 1-specificity (across the
various threshold values)
Threshold = 0 Threshold = 0

from biomechanics example from image-segmentation example

27
AUC: Area-Under-the-(ROC)-Curve

AUC = 0.96 AUC = 0.99

from biomechanics example from image-segmentation example

auc = metrics.roc_auc_score(actual_abnormal,
predictions_abnormal_prob)

28
29
30
Threshold

In sklearn, a low threshold means moving the PredictionBar all the way to
the right of the confusion matrix: we make everything positive. This occurs
in the top-right corner of the ROC graph.

A high threshold means we make many things negative—our PredictionBar

is slammed all the way left in the confusion matrix. This is what happens in
the bottom-left corner of the ROC graph.

31
Threshold versus Confusion Matrix
Confusion

Moving the threshold to the right

(on a traditional number line)….

32
Threshold versus Confusion Matrix
Confusion
Predicted
P N

P TP FN

Actual
Moving the threshold to the right
(on a traditional number line)…. N FP TN

… is analogous to sliding the division to

the left on the CM (higher threshold, fewer
TPs)
33
Threshold versus Confusion Matrix
Confusion
Predicted
P N

P TP FN

Actual
Moving the threshold to the right increases
(on a traditional number line)….
N FP TN

from image-
… is analogous to sliding the division to
… and moving to the LEFT on segmentation
the LEFT on the CM (higher threshold,
the ROC curve (higher threshold, example
fewer TPs)
fewer TPs)
Threshold > 1 34
Vice-versa
Predicted
P N

P TP FN

Actual
Moving the threshold to the left decreases
(on a traditional number line)…. Threshold = 0
N FP TN

from image-
… is analogous to sliding the division to
… and moving to the RIGHT on segmentation
the RIGHT on the CM (lower threshold,
the ROC curve (lower threshold, example
more TPs)
more TPs)
35

Roc Curve
No ratings yet
Roc Curve
43 pages
Validity of Diagnostic Tests 2015
No ratings yet
Validity of Diagnostic Tests 2015
26 pages
Understanding Sensitivity and Specificity
No ratings yet
Understanding Sensitivity and Specificity
72 pages
Evaluation in Ai
No ratings yet
Evaluation in Ai
25 pages
Sensitivity, Specificity, Accuracy, Associated Confidence Interval and ROC Analysis With Practical SAS Implementations
No ratings yet
Sensitivity, Specificity, Accuracy, Associated Confidence Interval and ROC Analysis With Practical SAS Implementations
9 pages
Jurnal Sensitivitas & Spesifisitas
No ratings yet
Jurnal Sensitivitas & Spesifisitas
9 pages
Diagnostic Talk 2012
No ratings yet
Diagnostic Talk 2012
28 pages
Lec 22
No ratings yet
Lec 22
43 pages
Critical Appraisal: On Article of Diagnostic Test (EBM-Diagnostic)
No ratings yet
Critical Appraisal: On Article of Diagnostic Test (EBM-Diagnostic)
28 pages
Performance Measure For A Classification Model.
No ratings yet
Performance Measure For A Classification Model.
5 pages
A The Post Hoc Pitfall - Rethinkink Sensitivity Ans Specificity in Clinical Practice
No ratings yet
A The Post Hoc Pitfall - Rethinkink Sensitivity Ans Specificity in Clinical Practice
5 pages
COVID-19 Test Accuracy Analysis
No ratings yet
COVID-19 Test Accuracy Analysis
43 pages
Interpreting Diagnostic Tests: Ian Mcdowell Department of Epidemiology & Community Medicine January 2010
No ratings yet
Interpreting Diagnostic Tests: Ian Mcdowell Department of Epidemiology & Community Medicine January 2010
30 pages
Diagnostic Testing and Decision Making Beauty Is.39
No ratings yet
Diagnostic Testing and Decision Making Beauty Is.39
7 pages
CAT Summary
No ratings yet
CAT Summary
3 pages
Bayes' Theorem in Medical Testing
No ratings yet
Bayes' Theorem in Medical Testing
10 pages
Group 6 Validation Studies
No ratings yet
Group 6 Validation Studies
3 pages
Understanding Sensitivity in Medical Tests
No ratings yet
Understanding Sensitivity in Medical Tests
45 pages
Likelihood Ratio PDF
No ratings yet
Likelihood Ratio PDF
5 pages
Chap 8-Diagnosis
No ratings yet
Chap 8-Diagnosis
21 pages
1603 - EvaluatingDiagnosis - PDF Version 1
No ratings yet
1603 - EvaluatingDiagnosis - PDF Version 1
5 pages
4.1 .C. Medicine - Lgis 27 Screening - II
No ratings yet
4.1 .C. Medicine - Lgis 27 Screening - II
61 pages
SodaPDF-converted - (MDR) 1.03.3 - INTRODUCTION TO EVIDENCE - BASED MEDICINE - Dr. Besa - FINAL
No ratings yet
SodaPDF-converted - (MDR) 1.03.3 - INTRODUCTION TO EVIDENCE - BASED MEDICINE - Dr. Besa - FINAL
5 pages
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
20 pages
Machine Learning Evaluation Metrics
No ratings yet
Machine Learning Evaluation Metrics
15 pages
Public Health Screening Overview
No ratings yet
Public Health Screening Overview
32 pages
Part 5 PDF
No ratings yet
Part 5 PDF
4 pages
Test Accuracy: Sensitivity & Specificity
No ratings yet
Test Accuracy: Sensitivity & Specificity
26 pages
9 Roc Auc
No ratings yet
9 Roc Auc
27 pages
Kohl PerformanceMeasures2012
No ratings yet
Kohl PerformanceMeasures2012
4 pages
Diagnostic Accuracy Measures: Methodological Notes
No ratings yet
Diagnostic Accuracy Measures: Methodological Notes
6 pages
Diagnostic Screening Tests-Tools
No ratings yet
Diagnostic Screening Tests-Tools
26 pages
A03 Screening
No ratings yet
A03 Screening
37 pages
Evaluation of A Screening Test
No ratings yet
Evaluation of A Screening Test
19 pages
DR - Dr. Juliandi Harahap, Ma Dept. of Communitiy Medicine FK Usu
No ratings yet
DR - Dr. Juliandi Harahap, Ma Dept. of Communitiy Medicine FK Usu
25 pages
Sensitivity and Specificity - Wikipedia
No ratings yet
Sensitivity and Specificity - Wikipedia
31 pages
EBM Diagnosis Slide
0% (1)
EBM Diagnosis Slide
28 pages
Medical Test Accuracy Explained
No ratings yet
Medical Test Accuracy Explained
12 pages
Diagnostic Testing and Decision Making Beauty Is.39
No ratings yet
Diagnostic Testing and Decision Making Beauty Is.39
7 pages
Assessing The Validity and Reliability of Diagnostic and Screening Tests
67% (3)
Assessing The Validity and Reliability of Diagnostic and Screening Tests
38 pages
10 Screening
No ratings yet
10 Screening
33 pages
1-3, Diagnostic Accuracy of Laboratory Tests
No ratings yet
1-3, Diagnostic Accuracy of Laboratory Tests
46 pages
Diagnostic Study May 2025 PPT Dr. A. Precilla Catherine
No ratings yet
Diagnostic Study May 2025 PPT Dr. A. Precilla Catherine
18 pages
Part II - Dignostic Research
No ratings yet
Part II - Dignostic Research
107 pages
Machine Learning Model Evaluation Metrics
No ratings yet
Machine Learning Model Evaluation Metrics
14 pages
Evaluating Diagnostic Test Utility
No ratings yet
Evaluating Diagnostic Test Utility
96 pages
Validity of Screening and Diagnostic Tests: - Reliability: Kappa Coefficient - Criterion Validity
No ratings yet
Validity of Screening and Diagnostic Tests: - Reliability: Kappa Coefficient - Criterion Validity
28 pages
Sensitivity, Specificity and Likelihood Ratios
No ratings yet
Sensitivity, Specificity and Likelihood Ratios
32 pages
A Software Tool For Calculating The Uncertainty of
No ratings yet
A Software Tool For Calculating The Uncertainty of
50 pages
06 Screening
No ratings yet
06 Screening
8 pages
Understanding Confusion Matrix Metrics
No ratings yet
Understanding Confusion Matrix Metrics
8 pages
13 Screening
No ratings yet
13 Screening
18 pages
Diagnostic Talk 2012
No ratings yet
Diagnostic Talk 2012
28 pages
Understanding Clinical Reasoning Errors
No ratings yet
Understanding Clinical Reasoning Errors
30 pages
3.1 Screening For Disease Control
No ratings yet
3.1 Screening For Disease Control
45 pages
Wa0013.
No ratings yet
Wa0013.
9 pages
Evolution of Disgnostic Testing
No ratings yet
Evolution of Disgnostic Testing
41 pages
Module 2
No ratings yet
Module 2
72 pages
lecture19-FromTreesToForests RandomForests
No ratings yet
lecture19-FromTreesToForests RandomForests
50 pages
Lecture21 GreedyFeatureSelection
No ratings yet
Lecture21 GreedyFeatureSelection
11 pages
Physics Exam 1 Study Guide Key
No ratings yet
Physics Exam 1 Study Guide Key
5 pages
Physics All Sample Questions
No ratings yet
Physics All Sample Questions
20 pages
GAC Referencing Guide V5.0 April 2017
No ratings yet
GAC Referencing Guide V5.0 April 2017
81 pages
Fire Brigade Organizational Structure
100% (57)
Fire Brigade Organizational Structure
13 pages
Batac Business Permit Process Guide
No ratings yet
Batac Business Permit Process Guide
25 pages
Understanding Learning Disabilities
No ratings yet
Understanding Learning Disabilities
14 pages
Labour and Delivery Simulation Reflection
No ratings yet
Labour and Delivery Simulation Reflection
2 pages
TrakCel Article - Personalized Supply Chains For Cell Therapies
No ratings yet
TrakCel Article - Personalized Supply Chains For Cell Therapies
20 pages
2023 - RL - Burashed - The Efficacy of Anterior Open Bite Closure W Invisa Attachments
No ratings yet
2023 - RL - Burashed - The Efficacy of Anterior Open Bite Closure W Invisa Attachments
6 pages
Malawi Child Labour Action Plan
No ratings yet
Malawi Child Labour Action Plan
67 pages
Santhosh Document
No ratings yet
Santhosh Document
2 pages
Практичне 31. Проблеми захисту навколишнього середовища в Україні
No ratings yet
Практичне 31. Проблеми захисту навколишнього середовища в Україні
4 pages
50 IV Therapy Tips and Tricks
No ratings yet
50 IV Therapy Tips and Tricks
7 pages
Meltzer in Venice Seminars With The Racker Group of Venice 1st Edition Maria Elena Petrilli Digital Download
No ratings yet
Meltzer in Venice Seminars With The Racker Group of Venice 1st Edition Maria Elena Petrilli Digital Download
59 pages
JNFH - Volume 10 - Issue 4 - Pages 271-275
No ratings yet
JNFH - Volume 10 - Issue 4 - Pages 271-275
5 pages
Darling v. Charleston: Hospital Liability Impact
No ratings yet
Darling v. Charleston: Hospital Liability Impact
11 pages
Lecture 10 Medically Comprosied Patient ا د حيدر فاضل سلو
No ratings yet
Lecture 10 Medically Comprosied Patient ا د حيدر فاضل سلو
9 pages
Clinical Case Pediatrics by Slidesgo
No ratings yet
Clinical Case Pediatrics by Slidesgo
40 pages
(Ebook PDF) Personal Financial Planning 14th Edition Download
No ratings yet
(Ebook PDF) Personal Financial Planning 14th Edition Download
56 pages
Biosecurity in Animal Production and Veterinary Me
No ratings yet
Biosecurity in Animal Production and Veterinary Me
11 pages
Minimum Land Rates
No ratings yet
Minimum Land Rates
28 pages
Alternative Sytem of Medicine: Presented By: Miss Shashi Chauahan M.Sc. Nursing 1 Year
No ratings yet
Alternative Sytem of Medicine: Presented By: Miss Shashi Chauahan M.Sc. Nursing 1 Year
48 pages
SHS Recreational Activities Guide
No ratings yet
SHS Recreational Activities Guide
5 pages
Wastewater Treatment
No ratings yet
Wastewater Treatment
27 pages
Wedding and Event Invitation Analysis
No ratings yet
Wedding and Event Invitation Analysis
14 pages
Mindmap
No ratings yet
Mindmap
1 page
Family Disaster Preparedness Plan
No ratings yet
Family Disaster Preparedness Plan
3 pages
CF 008 08 WE Display Screen Equipment Risk Assessment Worked Example
No ratings yet
CF 008 08 WE Display Screen Equipment Risk Assessment Worked Example
7 pages
Services Avoidance Group - Expectations Guidance - Rev 3 - July 2019
No ratings yet
Services Avoidance Group - Expectations Guidance - Rev 3 - July 2019
16 pages
Rajat Adhikari
No ratings yet
Rajat Adhikari
3 pages
Catholic Ethics in Medicine
No ratings yet
Catholic Ethics in Medicine
4 pages
Anie Zacharia 1
No ratings yet
Anie Zacharia 1
2 pages

Lecture15 evaluatingClassifiersPartII

Uploaded by

Lecture15 evaluatingClassifiersPartII

Uploaded by

Lecture 15: Evaluating

Classifiers (Part II)

Also see [Link]

Also see [Link]

(TP + TN) / (TP + FN + FP + TN)

Also see lec13 notebook for a review of computing

(TP + TN) / (TP + FN + FP + TN)

Also see lec13 notebook for a review of computing

(TP + TN) / (TP + FN + FP + TN)

Also see lec13 notebook for a review of computing

(TP + TN) / (TP + FN + FP + TN)

Also see lec13 notebook for a review of computing

Also see [Link]

Also see [Link]

specificity → true negative rate (with respect

accuracy → number correct:

(TP + TN) / (TP + FN + FP + TN)

Also see lec13 notebook for a review of computing

Suppose the the ‘threshold’ is 0.5. Then we will classify

Suppose that the ‘threshold’ is 0.3. Then we will classify

Note -- I colored the "Abnormal" rows yellow, but left the

Question: Even though we are not showing the

Recall that sensitivity is TP/(TP+FN) or TP/(all

Question: Even though we are not showing the

Recall that sensitivity is TP/(TP+FN) or TP/(all

Suppose the the ‘threshold’ is 0.5. Then we will classify

Suppose the ‘threshold’ is 0.7. Then we will classify

Suppose the the ‘threshold’ is 1.01. Then we will classify

Recall that sensitivity is TP/(TP+FN) or TP/(all

Recall that specificity is TN/(FP+TN) or TN/(all

Suppose the the ‘threshold’ is 1.01. Then we will classify

Recall that sensitivity is TP/(TP+FN) or TP/(all

Suppose the the ‘threshold’ is 1.01. Then we will classify

from biomechanics example from image-segmentation example

from biomechanics example from image-segmentation example

from biomechanics example from image-segmentation example

AUC = 0.96 AUC = 0.99

from biomechanics example from image-segmentation example

A high threshold means we make many things negative—our PredictionBar

Moving the threshold to the right

… is analogous to sliding the division to

You might also like