Machine Learning Assignment-6.Sol

The document is an assignment on machine learning consisting of multiple-choice questions (Q1-Q9) and subjective questions (Q10-Q15). It covers topics such as overfitting, decision trees, ensemble techniques, regularization, and model evaluation metrics. The subjective questions require explanations on adjusted R-squared, differences between Ridge and Lasso regression, the importance of data scaling, and various metrics for assessing the goodness of fit in linear regression.

Uploaded by

faltukaamdone

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views3 pages

Machine Learning Assignment-6.Sol

Uploaded by

faltukaamdone

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

ASSIGNMENT - 6

MACHINE LEARNING

In Q1 to Q5, only one option is correct, Choose the correct option:

1. In which of the following you can say that the model is overfitting?
A) High R-squared value for train-set and High R-squared value for test-set.
B) Low R-squared value for train-set and High R-squared value for test-set.
C) High R-squared value for train-set and Low R-squared value for test-set.
D) None of the above
Answer: (C)High R-squared value for train-set and Low R-squared value for test-set.

2. Which among the following is a disadvantage of decision trees?

A) Decision trees are prone to outliers.
B) Decision trees are highly prone to overfitting.
C) Decision trees are not easy to interpret
D) None of the above.
Answer:B) Decision trees are highly prone to over ng

3. Which of the following is an ensemble technique?

A) SVM B) Logistic Regression
C) Random Forest D) Decision tree
Answer: C) Random Forest
4. Suppose you are building a classification model for detection of a fatal disease where detection of
the disease is most important. In this case which of the following metrics you would focus on?
A) Accuracy B) Sensitivity
C) Precision D) None of the above.
Answer: C) Sensi vity

5. The value of AUC (Area under Curve) value for ROC curve of model A is 0.70 and of model B is
0.85. Which of these two models is doing better job in classification?
A) Model A B) Model B
C) both are performing equal D) Data Insufficient
Answer: b) Model B

In Q6 to Q9, more than one options are correct, Choose all the correct options:
6. Which of the following are the regularization technique in Linear Regression??
A) Ridge B) R-squared
C) MSE D) Lasso
Answer: a. Ridge and d. Lasso

7. Which of the following is not an example of boosting technique?

A) Adaboost B) Decision Tree
C) Random Forest D) Xgboost.
Answer: b. Decision Tree and c. Random Forest
8. Which of the techniques are used for regularization of Decision Trees?
A) Pruning B) L2 regularization
C) Restricting the max depth of the tree D) All of the above
ti
fi
tti
Answer: a. Pruning and c. Restric ng the max depth of the tree

9. Which of the following statements is true regarding the Adaboost technique?

A) We initialize the probabilities of the distribution as 1/n, where n is the number of data-points
B) A tree in the ensemble focuses more on the data points on which the previous tree was not
performing well
C) It is example of bagging technique
D) None of the above
Answer: A ) We initialize the probabilities of the distribution as 1/n, where n is the number of data-points
B) A tree in the ensemble focuses more on the data points on which the previous tree was not
performing well.

Q10 to Q15 are subjective answer type questions, Answer them briefly.
10. Explain how does the adjusted R-squared penalize the presence of unnecessary predictors in the
model?
Answer: The adjusted R-squared is a modi ed version of the R- squared that penalizes
the addi on of unnecessary predictors to the model. As more predictors are added to the model,
the R- squared value will typically increase, even if the predictors are not useful. The adjusted R-
squared accounts for this by adjus ng the R- squared value based on the number of predictors in
the model. It is a more reliable indicator of the goodness of t of a model and is useful in
comparing models with di erent numbers of predictors.

11. Differentiate between Ridge and Lasso Regression.

Answer: Ridge and Lasso regression are both regulariza on techniques used to prevent
over ng in linear regression. Ridge regression adds a penalty term to the cost func on that is
propor onal to the square of the magnitude of the coe cients. This helps to shrink the
coe cients of less important predictors towards zero, but does not set any coe cients exactly to
zero. Lasso regression, on the other hand, adds a penalty term to the cost func on that is
propor onal to the absolute value of the coe cients. This can result in some coe cients being
set exactly to zero, e ec vely performing feature selec on.

12. What is VIF? What is the suitable value of a VIF for a feature to be included in a regression
modelling?
Answer: VIF stands for Variance In a on Factor, it is a measure of how much the variance of
the es mated regression coe cients are increased because of collinearity. A high VIF indicates
that the corresponding predictor is highly correlated with one or more of the other predictors. A
VIF value of 1 indicates that there is no correla on between this predictor and any other
predictors, while a value greater than 1 indicates that there is correla on. A suitable value of VIF
for a feature to be included in a regression modeling is typically less than 5 or 10.

13. Why do we need to scale the data before feeding it to the train the model?
Answer: Scaling the data is important before training a model because many machine learning algorithms use
distance based calcula ons. So if the data is not scaled, then the algorithm will be sensi ve to the scale of the
data. For example, if one feature is measured in kilometers and another feature is measured in meters, then the
algorithm will be biased towards the feature measured in kilometers. Scaling the data ensures that all the features
are on the same scale, which leads to a fair comparison of the importance of each feature.
ffi
fi
tti
ti
ti
ti
ti
ti
ff
ti
ff
ffi
ti
ti
fl
ti
ffi
ti
fi
ti
ffi
ti
fi
ti
ti
ti
ffi
ffi
ti
ASSIGNMENT - 6

MACHINE LEARNING

14. What are the different metrics which are used to check the goodness of fit in linear regression?
Answer: There are several metrics used to check the goodness of fit in linear regression,
some examples are:
R-squared: R-squared measures the propor on of varia on in the dependent variable that is
explained by the independent variables. It ranges from 0 to 1, where 1 indicates a perfect t.
Mean Squared Error (MSE): MSE is the average of the square of the residuals, it measures the
average di erence between the predicted values and the true values.
Root Mean Squared Error (RMSE): it is the square root of MSE and it gives the error in the same
unit as the response variable.
Mean Absolute Error (MAE): it is the mean of the absolute values of the residuals.
Adjusted R-squared: It is a modi ed version of the R-squared that penalizes the addi on of
unnecessary predictors to the model.
15. From the following confusion matrix calculate sensitivity, specificity, precision, recall and accuracy.

Actual/Predicted True False

True 1000 50
False 250 1200

Answer: Sensi vity (also known as recall) = True Posi ves / (True Posi ves + False Nega ves) =
1000 / (1000 + 250) = 0.8 Speci city = True nega ves / (True nega ves + False posi ves) =
1200 / (1200 + 50) = 0.96
Precision = True Posi ves / (True Posi ves + False Posi ves) = 1000 / (1000 + 50) = 0.95 Recall
= True Posi ves / (True Posi ves + False Nega ves) = 1000 / (1000 + 250) = 0.8
Accuracy = (True Posi ves + True nega ves) / (Total) = (1000 + 1200) / (1000+50+250+1200) =
0.89
Note: sensi vity and recall are the same.
ti
ff
ti
ti
ti
ti
ti
fi
ti
ti
fi
ti
ti
ti
ti
ti
ti
ti
ti
ti
ti
fi
ti

SP 24 BADM 576 Final - Exam - Study - Guide
No ratings yet
SP 24 BADM 576 Final - Exam - Study - Guide
13 pages
Regression
No ratings yet
Regression
13 pages
6.classification & Regression
No ratings yet
6.classification & Regression
45 pages
Quiz 2 2021 Sol
No ratings yet
Quiz 2 2021 Sol
8 pages
Machine Learning Exam Paper-2
No ratings yet
Machine Learning Exam Paper-2
9 pages
Linear Models: Key Concepts Quiz
No ratings yet
Linear Models: Key Concepts Quiz
4 pages
INT354 MCQs Unit-4 To Unit-6 Answer
No ratings yet
INT354 MCQs Unit-4 To Unit-6 Answer
15 pages
Unit2 ML Notes
No ratings yet
Unit2 ML Notes
19 pages
MLRS Assignment 1 24070146008 Sreemanth Mannem
No ratings yet
MLRS Assignment 1 24070146008 Sreemanth Mannem
12 pages
Oral Qu Stions
No ratings yet
Oral Qu Stions
12 pages
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
No ratings yet
Exam in Statistical Machine Learning Statistisk Maskininlärning (1RT700)
12 pages
Data Science Final Mock Test
No ratings yet
Data Science Final Mock Test
47 pages
Machine
No ratings yet
Machine
21 pages
Linear Regression and SVM Concepts
No ratings yet
Linear Regression and SVM Concepts
8 pages
Lecture4 MCQ Guide
No ratings yet
Lecture4 MCQ Guide
8 pages
Ids Unit 5
No ratings yet
Ids Unit 5
4 pages
ML Solved Endsem
No ratings yet
ML Solved Endsem
16 pages
ISE 529 Mock Test Answers
No ratings yet
ISE 529 Mock Test Answers
6 pages
Statistical Machine Learning Exam Guide
No ratings yet
Statistical Machine Learning Exam Guide
10 pages
Supervised Learning: Regression Insights
No ratings yet
Supervised Learning: Regression Insights
11 pages
Cross-Validation in Machine Learning
50% (2)
Cross-Validation in Machine Learning
43 pages
Regression Analysis Exam Questions
No ratings yet
Regression Analysis Exam Questions
566 pages
Machine Learning Questions and Answers For Interview
No ratings yet
Machine Learning Questions and Answers For Interview
20 pages
Supervised Machine Learning Regression
No ratings yet
Supervised Machine Learning Regression
6 pages
ML Questions Answers
No ratings yet
ML Questions Answers
4 pages
Statistical ML Exam Guide
No ratings yet
Statistical ML Exam Guide
13 pages
SDSC3006 - Assignment 3
No ratings yet
SDSC3006 - Assignment 3
4 pages
ECS7020P Sample Paper Solutions
No ratings yet
ECS7020P Sample Paper Solutions
6 pages
120 DS-With Answer
100% (1)
120 DS-With Answer
32 pages
ML Paper
No ratings yet
ML Paper
23 pages
Data Science Interview Prep Guide
No ratings yet
Data Science Interview Prep Guide
3 pages
QUESTION BANK, Sample Paper, and Many More
No ratings yet
QUESTION BANK, Sample Paper, and Many More
43 pages
Advanced Regression Assignment
No ratings yet
Advanced Regression Assignment
5 pages
Understanding Linear Regression Techniques
No ratings yet
Understanding Linear Regression Techniques
31 pages
IML-IITKGP - Assignment 5 Solution
No ratings yet
IML-IITKGP - Assignment 5 Solution
7 pages
INT354 Machine Learning Exam Instructions
No ratings yet
INT354 Machine Learning Exam Instructions
4 pages
ML Unit 3
No ratings yet
ML Unit 3
2 pages
Machine Learning & AI Quiz Answers
No ratings yet
Machine Learning & AI Quiz Answers
15 pages
Linear Regression
No ratings yet
Linear Regression
89 pages
Advance Machine Learning
No ratings yet
Advance Machine Learning
16 pages
ML Ii
No ratings yet
ML Ii
5 pages
ML Cheat
No ratings yet
ML Cheat
9 pages
Machine Learning Insem-01 QP
No ratings yet
Machine Learning Insem-01 QP
6 pages
Machine Learning Exam Sample Questions
No ratings yet
Machine Learning Exam Sample Questions
8 pages
IE425 Spring25 Quiz1
No ratings yet
IE425 Spring25 Quiz1
3 pages
Machine Learning: Random Forests & Regression
No ratings yet
Machine Learning: Random Forests & Regression
26 pages
ML PYQs
No ratings yet
ML PYQs
32 pages
Aam Unit 1 QB With Answer
No ratings yet
Aam Unit 1 QB With Answer
12 pages
QB Unit 2
No ratings yet
QB Unit 2
5 pages
Key Concepts in Machine Learning
No ratings yet
Key Concepts in Machine Learning
2 pages
PS Notes (Machine Learning
No ratings yet
PS Notes (Machine Learning
14 pages
Lecture 10 - 04.09.2024 - Regression-02 Lecture Slides
No ratings yet
Lecture 10 - 04.09.2024 - Regression-02 Lecture Slides
61 pages
Week 4 Q&A
No ratings yet
Week 4 Q&A
7 pages
Data Analysis and Statistical Concepts
No ratings yet
Data Analysis and Statistical Concepts
150 pages
Exam Final
100% (1)
Exam Final
21 pages
Q and A BIS
No ratings yet
Q and A BIS
7 pages
ML Ques Mod-1
No ratings yet
ML Ques Mod-1
25 pages
Statistics Assignment 6.sol
No ratings yet
Statistics Assignment 6.sol
5 pages
Machine Learning Assignment-7.Sol
No ratings yet
Machine Learning Assignment-7.Sol
6 pages
विज्ञान प्रगति 23 अप्रैल 2025
No ratings yet
विज्ञान प्रगति 23 अप्रैल 2025
65 pages
Area and Volume Formulas Explained
No ratings yet
Area and Volume Formulas Explained
16 pages
End of Term 1 Math G4
No ratings yet
End of Term 1 Math G4
5 pages
SSC Si Syllabus
No ratings yet
SSC Si Syllabus
3 pages
IADC-04-03 - ProtectionManual - v7.1 - Interagency Debris Committee
No ratings yet
IADC-04-03 - ProtectionManual - v7.1 - Interagency Debris Committee
292 pages
Explicit Dynamics Chapter 6 Explicit Meshing
No ratings yet
Explicit Dynamics Chapter 6 Explicit Meshing
50 pages
Module 3 - Line
No ratings yet
Module 3 - Line
2 pages
Trigonometry Situation Set 01
No ratings yet
Trigonometry Situation Set 01
2 pages
G4-SHS Content List
No ratings yet
G4-SHS Content List
347 pages
CBSE Class 10 Maths Qs Paper 2017 SA 2 Set 1
No ratings yet
CBSE Class 10 Maths Qs Paper 2017 SA 2 Set 1
11 pages
Design of Hyd. Cylinders
100% (3)
Design of Hyd. Cylinders
10 pages
CBSE Class 8 Mathematics Worksheet - Square and Square Roots
89% (72)
CBSE Class 8 Mathematics Worksheet - Square and Square Roots
3 pages
Teacher's Guide in Math For Grade 5: Sherlyn T. Angeles
No ratings yet
Teacher's Guide in Math For Grade 5: Sherlyn T. Angeles
40 pages
Quiz 7 Linear Algebra
No ratings yet
Quiz 7 Linear Algebra
1 page
Str8ts: Engaging Logic Puzzle for Students
No ratings yet
Str8ts: Engaging Logic Puzzle for Students
2 pages
Adaptive and Array Signal Processing
No ratings yet
Adaptive and Array Signal Processing
44 pages
Matrices: Elementary Matrix Theory
No ratings yet
Matrices: Elementary Matrix Theory
17 pages
Basic RPG400 Programming
100% (1)
Basic RPG400 Programming
126 pages
AAM Test-2 Scheme and Solutions-December 2024
No ratings yet
AAM Test-2 Scheme and Solutions-December 2024
8 pages
Maths Test 2024
No ratings yet
Maths Test 2024
5 pages
Coordinates
No ratings yet
Coordinates
9 pages
Chapter 5 - Complex Numbers Notes
No ratings yet
Chapter 5 - Complex Numbers Notes
4 pages
选用文献
No ratings yet
选用文献
7 pages
Project Name: Sprinkler System of UML Area Reference: Most Unfavourable
No ratings yet
Project Name: Sprinkler System of UML Area Reference: Most Unfavourable
6 pages
Adiabatic Piston Dynamics and Gas Behavior
No ratings yet
Adiabatic Piston Dynamics and Gas Behavior
9 pages
Solar Radiation Estimation in Kathmandu
No ratings yet
Solar Radiation Estimation in Kathmandu
9 pages
The Structure of Crystalline Solids
No ratings yet
The Structure of Crystalline Solids
77 pages
Automated Discovery of Workflow Models From Hospital Data: L.Maruster and W.van Der Aalst
No ratings yet
Automated Discovery of Workflow Models From Hospital Data: L.Maruster and W.van Der Aalst
5 pages
Elements of Fracture Mechanics: Birla Institute of Technology & Science Pilani (Rajasthan)
No ratings yet
Elements of Fracture Mechanics: Birla Institute of Technology & Science Pilani (Rajasthan)
4 pages
Numerical Analysis Prelim Guide
No ratings yet
Numerical Analysis Prelim Guide
236 pages
Electromagnetic Field Theory Assignment
No ratings yet
Electromagnetic Field Theory Assignment
73 pages
c26 Btest-15 Math Paper
No ratings yet
c26 Btest-15 Math Paper
17 pages

Machine Learning Assignment-6.Sol

Uploaded by

Machine Learning Assignment-6.Sol

Uploaded by

ASSIGNMENT - 6

In Q1 to Q5, only one option is correct, Choose the correct option:

2. Which among the following is a disadvantage of decision trees?

3. Which of the following is an ensemble technique?

7. Which of the following is not an example of boosting technique?

9. Which of the following statements is true regarding the Adaboost technique?

11. Differentiate between Ridge and Lasso Regression.

Actual/Predicted True False

You might also like