0% found this document useful (0 votes)

13 views8 pages

Lecture4 MCQ Guide

This document serves as a study guide for model selection and evaluation, covering key concepts such as model selection techniques, hyperparameter tuning, and evaluation metrics for classification and regression. It discusses the bias-variance tradeoff, regularization methods, ensemble techniques, and feature selection strategies, along with practice questions and calculations. The guide emphasizes understanding evaluation metrics, tradeoffs, and the effects of regularization on model coefficients.

Uploaded by

pereraasp2022

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views8 pages

Lecture4 MCQ Guide

Uploaded by

pereraasp2022

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Lecture 4: Model Selection and Evaluation -

MCQ Study Guide

Key Concepts Explained Simply
Model Selection
What is Model Selection? Model selection is the process of choosing the
best model from a set of candidate models. It’s like trying on different shoes to
find the pair that fits best.

Approaches to Model Selection

1. Hold-out Validation: Split data into training and validation sets
2. Cross-Validation: Split data into k folds, train on k-1 folds, test on the
remaining fold
3. Nested Cross-Validation: Cross-validation within cross-validation for
both model selection and evaluation

Cross-Validation Techniques
• K-fold Cross-Validation: Split data into k equal parts
• Stratified K-fold: Maintains the same class distribution in each fold
• Leave-One-Out (LOOCV): Use n-1 samples for training and 1 for test-
ing (where n is the total number of samples)
• Leave-P-Out: Use n-p samples for training and p for testing

Hyperparameter Tuning
• Grid Search: Try all combinations of a predefined set of hyperparameter
values
• Random Search: Randomly sample hyperparameter values from defined
distributions
• Bayesian Optimization: Use past evaluations to guide the search for
better hyperparameters

Model Evaluation
Classification Metrics
• Accuracy: Proportion of correct predictions
• Precision: Proportion of positive identifications that were actually cor-
rect
• Recall (Sensitivity): Proportion of actual positives that were identified
correctly
• F1 Score: Harmonic mean of precision and recall
• ROC Curve: Plot of True Positive Rate vs. False Positive Rate
• AUC (Area Under the Curve): Area under the ROC curve

1
• Confusion Matrix: Table showing correct and incorrect predictions

Regression Metrics
• Mean Absolute Error (MAE): Average of absolute differences between
predicted and actual values
• Mean Squared Error (MSE): Average of squared differences between
predicted and actual values
• Root Mean Squared Error (RMSE): Square root of MSE
• R-squared (Coeﬀicient of Determination): Proportion of variance
explained by the model
• Adjusted R-squared: R-squared adjusted for the number of predictors

Bias-Variance Tradeoff
Understanding Bias and Variance
• Bias: Error from overly simplistic assumptions (underfitting)
• Variance: Error from sensitivity to small fluctuations in training data
(overfitting)
• Tradeoff: Reducing bias typically increases variance and vice versa

Total Error Decomposition Total Error = Bias² + Variance + Irreducible

Error

How to Balance Bias and Variance

• High Bias (Underfitting): Use more complex models, add features
• High Variance (Overfitting): Use simpler models, add regularization,
get more training data

Regularization
What is Regularization? Regularization is a technique to prevent overfit-
ting by adding a penalty term to the loss function. It’s like adding weight to a
seesaw to keep it balanced.

Types of Regularization
• L1 Regularization (Lasso): Adds the sum of absolute values of coeffi-
cients to the loss function
– Can lead to sparse models (feature selection)
– Formula: Loss + � × Σ|w_i|
• L2 Regularization (Ridge): Adds the sum of squared values of coeffi-
cients to the loss function
– Shrinks coefficients towards zero but rarely to exactly zero
– Formula: Loss + � × Σ(w_i)²
• Elastic Net: Combination of L1 and L2

2
– Formula: Loss + �� × Σ|w_i| + �� × Σ(w_i)²

Regularization Parameter (�)

• Controls the strength of regularization
• Higher � = stronger regularization = simpler model
• Lower � = weaker regularization = more complex model
• Optimal � is typically found through cross-validation

Ensemble Methods
What are Ensemble Methods? Ensemble methods combine multiple mod-
els to improve performance. It’s like asking multiple experts and taking their
collective wisdom.

Types of Ensemble Methods

• Bagging (Bootstrap Aggregating):
– Train multiple models on random subsets of the data
– Combine by averaging (regression) or voting (classification)
– Example: Random Forest
• Boosting:
– Train models sequentially, each focusing on errors of previous models
– Combine by weighted voting
– Examples: AdaBoost, Gradient Boosting
• Stacking:
– Train multiple models and use their predictions as inputs to a meta-
model
– Meta-model learns how to best combine the predictions

Popular Ensemble Algorithms

• Random Forest: Ensemble of decision trees using bagging
• AdaBoost: Boosts weak learners by focusing on misclassified instances
• Gradient Boosting: Builds trees sequentially to correct errors
• XGBoost: Optimized implementation of gradient boosting
• Voting Classifier/Regressor: Combines different types of models

Feature Selection
Why Feature Selection?
• Reduces overfitting
• Improves model performance
• Reduces training time
• Makes models more interpretable

3
Feature Selection Methods
• Filter Methods: Select features based on statistical measures
– Correlation
– Chi-square test
– Information gain
• Wrapper Methods: Use a model to evaluate feature subsets
– Recursive Feature Elimination (RFE)
– Forward/Backward selection
• Embedded Methods: Feature selection as part of model training
– Lasso regression
– Decision trees
– Random Forest feature importance

MCQ Practice Questions

Question 1
Which cross-validation technique is most appropriate when dealing
with imbalanced classes? - A) K-fold Cross-Validation - B) Leave-One-Out
Cross-Validation - C) Stratified K-fold Cross-Validation - D) Random Subsam-
pling
Answer: C) Stratified K-fold Cross-Validation
Explanation: Stratified K-fold ensures that each fold has the same proportion
of classes as the original dataset, which is crucial for imbalanced datasets to
avoid bias in the validation process.

Question 2
Which regularization technique can reduce coefficients to exactly
zero, effectively performing feature selection? - A) L1 Regularization
(Lasso) - B) L2 Regularization (Ridge) - C) Dropout - D) Batch Normalization
Answer: A) L1 Regularization (Lasso)
Explanation: L1 regularization adds the sum of absolute values of coefficients
to the loss function, which can shrink some coefficients to exactly zero, effectively
removing those features from the model.

Question 3
What is the main difference between bagging and boosting ensemble
methods? - A) Bagging uses decision trees while boosting uses neural networks
- B) Bagging trains models in parallel while boosting trains them sequentially
- C) Bagging is for classification while boosting is for regression - D) Bagging
requires more data than boosting

4
Answer: B) Bagging trains models in parallel while boosting trains them se-
quentially
Explanation: In bagging, multiple models are trained independently on ran-
dom subsets of the data. In boosting, models are trained sequentially, with each
model focusing on the errors made by previous models.

Question 4
Which metric is most appropriate for evaluating a regression model
when outliers are a concern? - A) Mean Squared Error (MSE) - B) Root
Mean Squared Error (RMSE) - C) Mean Absolute Error (MAE) - D) R-squared
Answer: C) Mean Absolute Error (MAE)
Explanation: MAE uses absolute differences rather than squared differences,
making it less sensitive to outliers compared to MSE or RMSE.

Question 5
In the bias-variance tradeoff, what happens as model complexity in-
creases? - A) Bias increases, variance decreases - B) Bias decreases, variance
increases - C) Both bias and variance increase - D) Both bias and variance
decrease
Answer: B) Bias decreases, variance increases
Explanation: As model complexity increases, the model can fit the training
data better (reducing bias), but becomes more sensitive to fluctuations in the
training data (increasing variance).

Question 6
Which of the following is NOT a method for hyperparameter tuning?
- A) Grid Search - B) Random Search - C) Bayesian Optimization - D) Principal
Component Analysis
Answer: D) Principal Component Analysis
Explanation: Principal Component Analysis (PCA) is a dimensionality reduc-
tion technique, not a method for hyperparameter tuning.

Question 7
What does the Area Under the ROC Curve (AUC) measure? - A)
The accuracy of the model - B) The probability that a randomly chosen positive
instance is ranked higher than a randomly chosen negative instance - C) The
precision of the model - D) The recall of the model
Answer: B) The probability that a randomly chosen positive instance is ranked
higher than a randomly chosen negative instance

5
Explanation: AUC represents the probability that the model will rank a ran-
domly chosen positive instance higher than a randomly chosen negative instance,
making it a measure of the model’s ability to discriminate between classes.

Question 8
Which ensemble method is MOST likely to reduce bias in a model? -
A) Bagging - B) Boosting - C) Stacking - D) Voting with identical models
Answer: B) Boosting
Explanation: Boosting focuses on reducing bias by sequentially training mod-
els that focus on the errors made by previous models, making it particularly
effective at reducing bias.

Calculation Problems
Problem 1: Cross-Validation
You have a dataset with 1000 instances and want to perform 5-fold
cross-validation. How many instances will be used for training and
testing in each fold?
Solution: - Total instances: 1000 - Number of folds: 5 - Testing instances per
fold: 1000 ÷ 5 = 200 - Training instances per fold: 1000 - 200 = 800
Therefore, each fold will use 800 instances for training and 200 for testing.

Problem 2: Confusion Matrix Metrics

**A classification model produces the following confusion matrix for a binary
classification problem: - True Positives (TP): 120 - False Positives (FP): 30 -
False Negatives (FN): 20 - True Negatives (TN): 130
Calculate the accuracy, precision, recall, F1 score, and specificity.**
Solution: - Accuracy = (TP + TN) / (TP + TN + FP + FN) = (120 + 130)
/ (120 + 130 + 30 + 20) = 250 / 300 = 0.833 or 83.3% - Precision = TP / (TP
+ FP) = 120 / (120 + 30) = 120 / 150 = 0.8 or 80% - Recall (Sensitivity) =
TP / (TP + FN) = 120 / (120 + 20) = 120 / 140 = 0.857 or 85.7% - F1 Score
= 2 × (Precision × Recall) / (Precision + Recall) = 2 × (0.8 × 0.857) / (0.8
+ 0.857) = 2 × 0.686 / 1.657 = 1.372 / 1.657 = 0.828 or 82.8% - Specificity =
TN / (TN + FP) = 130 / (130 + 30) = 130 / 160 = 0.813 or 81.3%

Problem 3: Regularization Effect

In a linear regression model with two features, the unregularized
coeﬀicients are �� = 5 and �� = -3. If L2 regularization with � = 0.1
is applied, what will be the regularization penalty term added to the
loss function?

6
Solution: L2 regularization penalty = � × (��² + ��²) L2 regularization penalty
= 0.1 × (5² + (-3)²) L2 regularization penalty = 0.1 × (25 + 9) L2 regularization
penalty = 0.1 × 34 L2 regularization penalty = 3.4

Problem 4: R-squared Calculation

**A regression model produces the following predictions and actual values: -
Predicted: [12, 15, 18, 11, 20] - Actual: [10, 14, 17, 13, 22]
Calculate the R-squared value.**
Solution: First, calculate the mean of actual values: Mean(y) = (10 + 14 +
17 + 13 + 22) / 5 = 76 / 5 = 15.2
Then, calculate the total sum of squares (TSS): TSS = Σ(y_i - mean(y))² =
(10 - 15.2)² + (14 - 15.2)² + (17 - 15.2)² + (13 - 15.2)² + (22 - 15.2)² TSS =
(-5.2)² + (-1.2)² + (1.8)² + (-2.2)² + (6.8)² TSS = 27.04 + 1.44 + 3.24 + 4.84
+ 46.24 TSS = 82.8
Next, calculate the residual sum of squares (RSS): RSS = Σ(y_i - ŷ_i)² = (10
- 12)² + (14 - 15)² + (17 - 18)² + (13 - 11)² + (22 - 20)² RSS = (-2)² + (-1)² +
(-1)² + (2)² + (2)² RSS = 4 + 1 + 1 + 4 + 4 RSS = 14
Finally, calculate R-squared: R² = 1 - (RSS / TSS) = 1 - (14 / 82.8) = 1 - 0.169
= 0.831 or 83.1%

Key Formulas to Remember

1. Accuracy: (TP + TN) / (TP + TN + FP + FN)
2. Precision: TP / (TP + FP)
3. Recall (Sensitivity): TP / (TP + FN)
4. Specificity: TN / (TN + FP)
5. F1 Score: 2 × (Precision × Recall) / (Precision + Recall)
6. Mean Absolute Error (MAE): (1/n) × Σ|y_i - ŷ_i|
7. Mean Squared Error (MSE): (1/n) × Σ(y_i - ŷ_i)²
8. Root Mean Squared Error (RMSE): √MSE
9. R-squared: 1 - (RSS / TSS)
• RSS: Residual Sum of Squares = Σ(y_i - ŷ_i)²
• TSS: Total Sum of Squares = Σ(y_i - mean(y))²
10. L1 Regularization: Loss + � × Σ|w_i|
11. L2 Regularization: Loss + � × Σ(w_i)²

Tips for MCQ Questions

1. Understand evaluation metrics: Know which metrics are appropriate
for different types of problems.
2. Know the tradeoffs: Understand the bias-variance tradeoff and how
different techniques affect it.

7
3. Remember regularization effects: Know how L1 and L2 regularization
affect model coeﬀicients differently.
4. Understand ensemble methods: Know the differences between bag-
ging, boosting, and stacking.
5. Practice calculations: Be comfortable calculating common metrics from
raw data or confusion matrices.

Section 1: Cross-Validation and Model Performance
No ratings yet
Section 1: Cross-Validation and Model Performance
33 pages
Evaluating Machine Learning Models
100% (2)
Evaluating Machine Learning Models
10 pages
ML11 Generalization
No ratings yet
ML11 Generalization
40 pages
ML Endsem
No ratings yet
ML Endsem
14 pages
15-The Bias - Variance - Trade-Off-08-04-2024
No ratings yet
15-The Bias - Variance - Trade-Off-08-04-2024
23 pages
Cross-Validation in Machine Learning
50% (2)
Cross-Validation in Machine Learning
43 pages
Topic 3
No ratings yet
Topic 3
48 pages
Dimensionality Reduction & Model Evaluation
No ratings yet
Dimensionality Reduction & Model Evaluation
80 pages
Notes - Unit 3 - Machine Learning Lnctu-Bca (Aida) - IV Sem
No ratings yet
Notes - Unit 3 - Machine Learning Lnctu-Bca (Aida) - IV Sem
19 pages
Lecture 19
No ratings yet
Lecture 19
25 pages
INT354 MCQs Unit-4 To Unit-6 Answer
No ratings yet
INT354 MCQs Unit-4 To Unit-6 Answer
15 pages
Overfitting & Feature Engineering
No ratings yet
Overfitting & Feature Engineering
37 pages
AIML-Unit 5 Notes-Assignment 5
No ratings yet
AIML-Unit 5 Notes-Assignment 5
24 pages
Regularization CrossValidation
No ratings yet
Regularization CrossValidation
37 pages
Cross-Validation in Machine Learning
No ratings yet
Cross-Validation in Machine Learning
51 pages
??????? ???????? ??????????!
No ratings yet
??????? ???????? ??????????!
16 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
Aam Unit 1 QB With Answer
No ratings yet
Aam Unit 1 QB With Answer
12 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Ai - W7L14
No ratings yet
Ai - W7L14
22 pages
Ensemble Methods for ML Enthusiasts
No ratings yet
Ensemble Methods for ML Enthusiasts
3 pages
Improving Machine Learning Performance
No ratings yet
Improving Machine Learning Performance
14 pages
ML 04 Validation Regularization
No ratings yet
ML 04 Validation Regularization
57 pages
MI - Unit 5
No ratings yet
MI - Unit 5
72 pages
Regression
No ratings yet
Regression
13 pages
Unit 4 ML
No ratings yet
Unit 4 ML
25 pages
MLquestions
No ratings yet
MLquestions
26 pages
4-ResamplingMethods 1
No ratings yet
4-ResamplingMethods 1
23 pages
ML Notes (Module-3)
No ratings yet
ML Notes (Module-3)
21 pages
Unit 2
No ratings yet
Unit 2
7 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
116 pages
120 DS-With Answer
100% (1)
120 DS-With Answer
32 pages
Ensemble Learning Techniques Guide
No ratings yet
Ensemble Learning Techniques Guide
12 pages
Machine Learning
No ratings yet
Machine Learning
63 pages
Machine Learning Assignment-6.Sol
No ratings yet
Machine Learning Assignment-6.Sol
3 pages
CH 7 - Ensemble Learning and Random Forests
No ratings yet
CH 7 - Ensemble Learning and Random Forests
78 pages
PCA for Dimensionality Reduction
No ratings yet
PCA for Dimensionality Reduction
19 pages
ML 19.03 Sidenotes
No ratings yet
ML 19.03 Sidenotes
30 pages
Supervised Learning: Regression Insights
No ratings yet
Supervised Learning: Regression Insights
11 pages
AI & ML Notes
No ratings yet
AI & ML Notes
22 pages
Unit 5 ML
No ratings yet
Unit 5 ML
14 pages
Lect 03 Evaluation Part 2
No ratings yet
Lect 03 Evaluation Part 2
40 pages
PPT6-Buss Intel Analytics
No ratings yet
PPT6-Buss Intel Analytics
41 pages
Lecture 2.1 - AML
No ratings yet
Lecture 2.1 - AML
32 pages
Lec-1 Bias-variance-Tradeoff
No ratings yet
Lec-1 Bias-variance-Tradeoff
24 pages
Chap 2 Logistique Regression
No ratings yet
Chap 2 Logistique Regression
32 pages
Aids2 QB Ut2
No ratings yet
Aids2 QB Ut2
24 pages
Unit 2
No ratings yet
Unit 2
28 pages
Module 3 - ML
No ratings yet
Module 3 - ML
101 pages
Jkkklphftbbhuii
No ratings yet
Jkkklphftbbhuii
17 pages
Model Validation in Machine Learning
100% (2)
Model Validation in Machine Learning
26 pages
ML Interview Questions PDF
83% (6)
ML Interview Questions PDF
20 pages
SP 24 BADM 576 Final - Exam - Study - Guide
No ratings yet
SP 24 BADM 576 Final - Exam - Study - Guide
13 pages
Unit 5 New
No ratings yet
Unit 5 New
9 pages
Module 5,1 Ensemble - Bagging, RF, Boosting
No ratings yet
Module 5,1 Ensemble - Bagging, RF, Boosting
66 pages
ML Mod 5
No ratings yet
ML Mod 5
58 pages
Data Warehousing and Data Mining (CSE)
No ratings yet
Data Warehousing and Data Mining (CSE)
1 page
FHGH
No ratings yet
FHGH
63 pages
Module 4 - Assignment Rakesh Thakor
No ratings yet
Module 4 - Assignment Rakesh Thakor
13 pages
Statistics Mock 3 Test
No ratings yet
Statistics Mock 3 Test
7 pages
Power BI: Features, Careers, and Tools
50% (2)
Power BI: Features, Careers, and Tools
3 pages
Unit 4 A Major 1
No ratings yet
Unit 4 A Major 1
9 pages
Fintech Impact on Kenyan Banks' Performance
No ratings yet
Fintech Impact on Kenyan Banks' Performance
62 pages
SEM Structural Model Guide
No ratings yet
SEM Structural Model Guide
16 pages
The Data Science Process
100% (1)
The Data Science Process
53 pages
Ancova, Manova, Mancova
No ratings yet
Ancova, Manova, Mancova
18 pages
10 1108 - Bepam 11 2021 0135
No ratings yet
10 1108 - Bepam 11 2021 0135
17 pages
Two-Way (Between-Groups) ANOVA: Statstutor Community Project
No ratings yet
Two-Way (Between-Groups) ANOVA: Statstutor Community Project
4 pages
DWM Mid 2 Question Bank
No ratings yet
DWM Mid 2 Question Bank
5 pages
Scikit-Learn Python Cheat Sheet
100% (1)
Scikit-Learn Python Cheat Sheet
1 page
Data Analytics Study Guide
No ratings yet
Data Analytics Study Guide
3 pages
Marketing Research Essentials 9th Edition McDaniel Fast Access
No ratings yet
Marketing Research Essentials 9th Edition McDaniel Fast Access
323 pages
Business Analytics Using IOT, AI and ML
No ratings yet
Business Analytics Using IOT, AI and ML
15 pages
Data Analytics Tips From Experience
No ratings yet
Data Analytics Tips From Experience
13 pages
Data Visualization Guide for Analysts
No ratings yet
Data Visualization Guide for Analysts
19 pages
Hedonic and Eudaimonic Well-Being
No ratings yet
Hedonic and Eudaimonic Well-Being
15 pages
Lego & Tencent: Strategic Partnership Analysis
No ratings yet
Lego & Tencent: Strategic Partnership Analysis
11 pages
Data Analysis with WEKA Guide
No ratings yet
Data Analysis with WEKA Guide
21 pages
1) Statement: Descriptive Analytics, Is The Conventional Form of Business Intelligence and Data Analysis. B. False
100% (1)
1) Statement: Descriptive Analytics, Is The Conventional Form of Business Intelligence and Data Analysis. B. False
21 pages
E Gov Thai
No ratings yet
E Gov Thai
9 pages
Research Methodology Syllabus
No ratings yet
Research Methodology Syllabus
4 pages
Data Science Specialization Capstone Presentation
No ratings yet
Data Science Specialization Capstone Presentation
46 pages
Statistics For Managers Using Microsoft® Excel 5th Edition: Analysis of Variance
No ratings yet
Statistics For Managers Using Microsoft® Excel 5th Edition: Analysis of Variance
31 pages
Course Plan - Lab - FDS
No ratings yet
Course Plan - Lab - FDS
3 pages
Research Methodology - Lecture Note A-1
No ratings yet
Research Methodology - Lecture Note A-1
37 pages
1 s2.0 S0306457322000620 Main
No ratings yet
1 s2.0 S0306457322000620 Main
16 pages

Lecture4 MCQ Guide

Uploaded by

Lecture4 MCQ Guide

Uploaded by

Lecture 4: Model Selection and Evaluation -

MCQ Study Guide

Approaches to Model Selection

Total Error Decomposition Total Error = Bias² + Variance + Irreducible

How to Balance Bias and Variance

Regularization Parameter (�)

Types of Ensemble Methods

Popular Ensemble Algorithms

MCQ Practice Questions

Problem 2: Confusion Matrix Metrics

Problem 3: Regularization Effect

Problem 4: R-squared Calculation

Key Formulas to Remember

Tips for MCQ Questions

You might also like