0% found this document useful (0 votes)

15 views16 pages

Advance Machine Learning

The document discusses various aspects of Advance Machine Learning, focusing on regression analysis, assumptions of linear regression, regularization techniques, data leakage, and hyperparameter tuning. It explains key concepts such as TSS, RSS, polynomial regression, bias-variance tradeoff, and methods for detecting and mitigating data leakage. Additionally, it compares Grid Search CV and Randomized Search CV for hyperparameter tuning, highlighting their significance in optimizing machine learning models.

Uploaded by

atharvakhobrekar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views16 pages

Advance Machine Learning

Uploaded by

atharvakhobrekar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Advance Machine Learning [AML]

2.1 Regression Analysis

a) Discuss the significance of Regression Analysis. What

is the statistical connection with Regression Analysis?
Ans –
Regression analysis is a powerful statistical tool used to
understand the relationship between two or more
variables.
It helps in predicting the value of one variable based on
the values of other variables.
The core idea is to find a mathematical equation that
best fits the data, allowing us to make predictions or
understand the impact of one variable on another.
In simpler terms, regression analysis helps us uncover
patterns and relationships in data, making it invaluable
in fields like economics, finance, psychology, and many
others.

b) Explain the terms TSS, RSS, ESS, F-statistic, Prob(F-

statistic), R-squared, Adjusted R-squared, T-Statistic,
and Confidence Intervals for Coefficients in the context
of Regression Analysis.
Ans –
1. TSS (Total Sum of Squares): The total amount of
variation in the dependent variable (Y) explained by the
regression model.

2. RSS (Residual Sum of Squares): The total amount of

unexplained variation in the dependent variable (Y) by
the regression model.
3. ESS (Explained Sum of Squares): The portion of the
total variation in the dependent variable (Y) explained
by the independent variables in the regression model.

4. F-statistic: A measure used to test the overall

significance of the regression model. It compares the
variance explained by the model to the unexplained
variance.

5. Prob(F-statistic): The probability associated with the

F-statistic, indicating the likelihood that the observed F-
value occurred by chance.

6. R-squared: A measure of how well the independent

variables explain the variability of the dependent
variable. It ranges from 0 to 1, with higher values
indicating a better fit of the model to the data.

7. Adjusted R-squared: A modified version of R-squared

that adjusts for the number of predictors in the model,
providing a more accurate assessment of model fit.

8. T-Statistic: A measure used to test the significance of

individual coefficients in the regression model. It
assesses whether the coefficient is significantly
different from zero.

9. Confidence Intervals for Coefficients: Intervals that

estimate the range within which the true population
parameter (coefficient) is likely to fall. They provide a
measure of the uncertainty associated with the
estimated coefficient.

c) Define Polynomial Regression. Why might we need

Polynomial Regression, and how is it formulated?
Ans –
Polynomial regression is a type of regression analysis
where the relationship between the independent
variable (X) and the dependent variable (Y) is modeled
as an nth degree polynomial.

We might need polynomial regression when the

relationship between the variables is not linear but can
be better described by a curve. It allows us to capture
more complex patterns in the data that cannot be
adequately represented by a straight line.

The formulation involves fitting a polynomial function to

the data, typically using the least squares method, to
minimize the sum of the squares of the differences
between the observed and predicted values. The
polynomial equation takes the form:

Y = β₀ + β₁X + β₂X² + ... + βₙXⁿ

Where Y is the dependent variable, X is the

independent variable, and β₀, β₁, β₂, ..., βₙ are the
coefficients of the polynomial terms.

2.2 Assumptions of Linear Regression

a) List and explain the assumptions of Linear

Regression. What are the implications if these
assumptions are violated?
Ans –
The assumptions of linear regression are:
1. Linearity: The relationship between the independent
and dependent variables is linear.
2. Independence: The residuals (the differences
between observed and predicted values) are
independent of each other.
3. Homoscedasticity: The variance of the residuals is
constant across all levels of the independent variables.
4. Normality: The residuals are normally distributed.
5. No multicollinearity: The independent variables are
not highly correlated with each other.

If these assumptions are violated:

1. Linearity: The model may not accurately represent

the relationship between variables, leading to biased
and unreliable predictions.
2. Independence: The estimated coefficients may be
biased and inefficient, and the standard errors may be
incorrect.
3. Homoscedasticity: Confidence intervals and
hypothesis tests may be inaccurate, and the model
may have unequal influence on different parts of the
data.
4. Normality: Confidence intervals and hypothesis tests
may be inaccurate, and the model may not provide
reliable predictions.
5. No multicollinearity: Estimated coefficients may be
unreliable and difficult to interpret, and the model may
have difficulty distinguishing the effects of individual
predictors.

b) Discuss methods to detect and handle

multicollinearity in Linear Regression models.
Ans - To detect multicollinearity:

1. Correlation matrix: Check for high correlations

(typically above 0.7 or 0.8) between independent
variables.
2. Variance Inflation Factor (VIF): Calculate the VIF for
each independent variable, with values above 5
indicating multicollinearity.
To handle multicollinearity:

1. Remove redundant variables: Drop one of the highly

correlated variables.
2. Combine variables: Create new composite variables
that capture the essence of the correlated variables.
3. Ridge regression or Lasso regression: Regularization
techniques that penalize large coefficients, effectively
reducing multicollinearity effects.
4. Principal Component Analysis (PCA): Transform the
original variables into a smaller set of uncorrelated
variables.

3. Regularization And Model Evaluation

3.1 Bias-Variance Tradeoff

a) Explain the concept of Bias and Variance in the

context of machine learning models. Why is it
important to study the Bias-Variance Tradeoff?
Ans –
Bias refers to the error introduced by approximating a
real-world problem with a simplified model. Variance
refers to the model's sensitivity to fluctuations in the
training data.

It's crucial to study the Bias-Variance Tradeoff because

it helps balance these two types of errors in machine
learning models. A model with high bias may
oversimplify the problem, leading to underfitting, while
a model with high variance may capture noise in the
training data, leading to overfitting. Achieving the right
balance between bias and variance is essential for
building models that generalize well to unseen data.
b) Define Regularization. How does Regularization
contribute to managing the Bias-Variance Tradeoff?
Provide a mathematical explanation and describe the
geometric intuition behind Regularization.
Ans –
Regularization is a technique used in machine learning
to prevent overfitting by adding a penalty term to the
loss function, discouraging the model from learning
overly complex patterns from the training data.

Mathematically, regularization is achieved by adding a

regularization term to the cost function, such as the L1
norm (Lasso) or the L2 norm (Ridge). The regularized
cost function is then minimized during training.

Geometrically, regularization adds a constraint to the

optimization problem, effectively shrinking the
coefficients of the model towards zero. This constraint
reduces the model's complexity, preventing it from
fitting the noise in the training data too closely, which
helps manage the bias-variance tradeoff.

c) Discuss the types of Regularization techniques

commonly used in machine learning. Provide examples
of situations where each type of Regularization might
be beneficial.
Ans - Two common types of regularization techniques
are Lasso (L1 regularization) and Ridge (L2
regularization):

1. Lasso (L1 regularization): Lasso adds the absolute

value of the coefficients as a penalty term to the loss
function. It encourages sparsity by driving some
coefficients to exactly zero. Lasso is useful when
dealing with high-dimensional datasets with many
irrelevant features. For example, in feature selection
tasks where you want to identify the most important
features while discarding less relevant ones.

2. Ridge (L2 regularization): Ridge adds the squared

magnitude of the coefficients as a penalty term to the
loss function. It penalizes large coefficients, effectively
shrinking them towards zero. Ridge is beneficial when
dealing with multicollinearity, where predictors are
highly correlated. It helps stabilize the model and
reduce the impact of multicollinearity by spreading the
coefficient values across correlated variables.

d) Implement Ridge Regression using scikit-learn for

both 2D and n-dimensional datasets. Additionally,
provide code examples of Ridge Regression
implemented from scratch and using Gradient Descent.
Ans –
Here's how you can implement Ridge Regression using
scikit-learn for both 2D and n-dimensional datasets:

```python
from sklearn.linear_model import Ridge

# For 2D dataset
X_2d = [[1, 2], [2, 3], [3, 4]] # Example 2D dataset
y_2d = [2, 3, 4] # Example 2D target

ridge_2d = Ridge(alpha=1.0) # Initialize Ridge

Regression model
ridge_2d.fit(X_2d, y_2d) # Fit the model to the data

# For n-dimensional dataset

X_nd = ... # Example n-dimensional dataset
y_nd = ... # Example n-dimensional target

ridge_nd = Ridge(alpha=1.0) # Initialize Ridge

Regression model
ridge_nd.fit(X_nd, y_nd) # Fit the model to the data
```
Now, here are code examples for implementing Ridge
Regression from scratch and using Gradient Descent:
Ridge Regression from scratch:
```python
import numpy as np

def ridge_regression(X, y, alpha):

n_samples, n_features = X.shape
I = np.identity(n_features) # Identity matrix
w = np.linalg.inv(X.T.dot(X) + alpha *
I).dot(X.T).dot(y)
return w

# Usage example
X = ... # Features
y = ... # Target
alpha = 1.0 # Regularization parameter
weights = ridge_regression(X, y, alpha)
```

Ridge Regression using Gradient Descent:

```python
import numpy as np

def ridge_regression_gradient_descent(X, y, alpha,

learning_rate, n_iterations):
n_samples, n_features = X.shape
w = np.zeros(n_features) # Initialize weights
for _ in range(n_iterations):
# Compute gradients
gradients = -(2/n_samples) * X.T.dot(y - X.dot(w))
+ 2 * alpha * w
# Update weights
w -= learning_rate * gradients
return w
# Usage example
X = ... # Features
y = ... # Target
alpha = 1.0 # Regularization parameter
learning_rate = 0.01 # Learning rate
n_iterations = 1000 # Number of iterations
weights = ridge_regression_gradient_descent(X, y,
alpha, learning_rate, n_iterations)
```

3.2 Data Leakage

a) What is Data Leakage in the context of machine

learning? Discuss the potential problems associated
with Data Leakage and how it can impact model
performance.
Ans –
Data leakage in machine learning occurs when
information from outside the training dataset is
inadvertently used to train the model, leading to
inflated performance metrics and misleading results.

Potential problems associated with data leakage

include:
1. Overly optimistic performance: Data leakage can
lead to overly optimistic evaluation metrics, making the
model appear more accurate than it actually is.
2. Unrealistic generalization: Models trained on leaked
data may not generalize well to unseen data, leading to
poor performance in real-world scenarios.
3. Misleading feature importance: Data leakage can
distort the importance of features, leading to incorrect
conclusions about which features are truly informative
for the target variable.
4. Ethical and legal concerns: Using leaked information,
especially sensitive or private data, can raise ethical
and legal issues, such as violating privacy regulations
or fairness principles.

b) Identify and explain the various ways in which Data

Leakage can occur during the data preprocessing and
model training phases. How can Data Leakage be
detected and mitigated?
Ans –
Data leakage can occur during data preprocessing and
model training phases in several ways:

1. Feature engineering: Including information from

the target variable or future data points when creating
features.
2. **Imputation**: Using information from the entire
dataset, including the target variable, to fill missing
values.
3. **Scaling or normalization**: Calculating statistics
(e.g., mean, standard deviation) using the entire
dataset, including the target variable.
4. **Cross-validation**: Leakage can occur if cross-
validation is not properly performed, such as when
preprocessing steps are applied before splitting the
data.
To detect and mitigate data leakage:
1. **Careful feature engineering**: Ensure that features
are created using only information available at the time
of prediction, not using the target variable or future
data.
2. **Holdout set**: Reserve a portion of the data as a
holdout set for validation to check for leakage during
model training.
3. **Cross-validation**: Perform preprocessing steps
within each fold of cross-validation to prevent
information leakage.
4. **Feature importance analysis**: Analyze feature
importance to identify any unexpected high-ranking
features that may indicate leakage.
5. **Domain knowledge**: Utilize domain knowledge to
scrutinize the preprocessing steps and model training
process for potential leakage sources.

c) Describe the concept of a Validation Set and its role

in preventing Data Leakage. How does the Validation
Set contribute to ensuring the generalization ability of a
machine learning model?
Ans –
A validation set is a portion of the dataset that is set
aside and not used during model training. It is used to
evaluate the performance of the trained model and
tune hyperparameters without introducing data
leakage.
The validation set helps ensure the generalization
ability of a machine learning model by providing an
unbiased estimate of its performance on unseen data.
By evaluating the model on data that it hasn't seen
during training, the validation set helps detect
overfitting and assesses how well the model will
perform on new, unseen data. This process helps
ensure that the model can generalize well to real-world
scenarios beyond the training data.

3.3 Hyperparameter Tuning

a) Distinguish between Parameters and
Hyperparameters in machine learning models. What
distinguishes the term "hyper" in Hyperparameters?
Ans –
Parameters are the internal coefficients or weights that
the machine learning model learns from the training
data. They are optimized during the training process to
minimize the error between the predicted and actual
outputs.

Hyperparameters, on the other hand, are external

configuration settings that govern the behavior of the
learning algorithm. They are not learned from the data
but are set prior to the training process. Examples
include learning rate, regularization strength, and the
number of hidden layers in a neural network.

The term "hyper" in hyperparameters distinguishes

them as parameters that control the learning process
itself, rather than being learned from the data. They
are set at a higher level of abstraction than the
parameters and influence how the model learns and
generalizes from the data.

b) Discuss the requirements for Hyperparameter Tuning

and its significance in optimizing machine learning
models.
Ans –
Hyperparameter tuning involves selecting the optimal
values for the hyperparameters of a machine learning
model to improve its performance. It's significant
because:

1. Performance optimization: Proper tuning can

significantly enhance a model's performance, leading to
better accuracy and generalization.
2. Avoiding overfitting: Tuning helps prevent overfitting
by finding the best hyperparameters that balance bias
and variance.
3. Model robustness: Optimizing hyperparameters
ensures that the model performs well across different
datasets and real-world scenarios.
4. Efficient resource utilization: Tuning helps allocate
computational resources effectively by focusing on
hyperparameters that have the most impact on model
performance.

c) Compare and contrast Grid Search Cross-Validation

(Grid Search CV) and Randomized Search Cross-
Validation (Randomized Search CV) as methods for
Hyperparameter Tuning. How does each method
contribute to improving model performance?
Ans –
Grid Search Cross-Validation (Grid Search CV) and
Randomized Search Cross-Validation (Randomized
Search CV) are both methods for hyperparameter
tuning, but they differ in their approach:

1. Grid Search CV:

- Exhaustively searches through a specified grid of
hyperparameter values.
- Evaluates the model's performance for each
combination of hyperparameters using cross-validation.
- Guarantees finding the optimal hyperparameters
within the specified grid.
- Suitable for a relatively small hyperparameter
search space.

2. Randomized Search CV:

- Randomly samples hyperparameter values from
specified distributions.
- Evaluates a random subset of hyperparameter
combinations using cross-validation.
- May not guarantee finding the optimal
hyperparameters but is more computationally efficient.
- Ideal for large hyperparameter search spaces or
when computational resources are limited.

Both methods contribute to improving model

performance by systematically exploring the
hyperparameter space and selecting the combination
of hyperparameters that yield the best performance,
thus helping to optimize the model for better accuracy
and generalization.

d) Explain with Example Grid Search CV, Randomized

Search CV
Ans –
**Grid Search CV Example:**
```python
from sklearn.model_selection import GridSearchCV
from sklearn.ensemble import RandomForestClassifier
from sklearn.datasets import load_iris

# Load dataset
iris = load_iris()
X = iris.data
y = iris.target

# Define hyperparameters grid

param_grid = {'n_estimators': [50, 100, 200],
'max_depth': [None, 5, 10]}

# Initialize GridSearchCV
grid_search = GridSearchCV(RandomForestClassifier(),
param_grid, cv=5)

# Fit model
grid_search.fit(X, y)
# Best hyperparameters
print("Best hyperparameters:",
grid_search.best_params_)
```

Randomized Search CV Example:

```python
from sklearn.model_selection import
RandomizedSearchCV
from scipy.stats import randint
from sklearn.ensemble import RandomForestClassifier

# Define hyperparameters distributions

param_dist = {'n_estimators': randint(50, 200),
'max_depth': [None, 5, 10]}

# Initialize RandomizedSearchCV
random_search =
RandomizedSearchCV(RandomForestClassifier(),
param_dist, n_iter=5, cv=5)

# Fit model
random_search.fit(X, y)

# Best hyperparameters
print("Best hyperparameters:",
random_search.best_params_)
```

In both examples, we're tuning hyperparameters for a

RandomForestClassifier using Grid Search CV and
Randomized Search CV on the Iris dataset. The
hyperparameters we're tuning are the number of
estimators (`n_estimators`) and the maximum depth of
the trees (`max_depth`).
e) Explain Wrapper method and Types of wrapper
method
Ans –
Wrapper methods are feature selection techniques that evaluate
different subsets of features using a specific machine learning
algorithm to determine the best subset. These methods use the
performance of the model as a criterion for selecting features.
Types of wrapper methods include:
1. Forward selection: Starts with an empty set of features and
iteratively adds the most predictive feature until a stopping criterion
is met.
2. Backward elimination: Begins with all features and iteratively
removes the least predictive feature until a stopping criterion is met.
3. Recursive feature elimination (RFE): Selects features by recursively
fitting the model and removing the least important features at each
iteration based on feature weights or coefficients.

3 Da
No ratings yet
3 Da
16 pages
Regression Questionnaire
No ratings yet
Regression Questionnaire
10 pages
SumitBurnwal ML
No ratings yet
SumitBurnwal ML
13 pages
UNIT II Regration
No ratings yet
UNIT II Regration
62 pages
Linear Regression Algorithm
No ratings yet
Linear Regression Algorithm
16 pages
Unit 2
No ratings yet
Unit 2
92 pages
Linear Regression
No ratings yet
Linear Regression
5 pages
Isn't Linear Regression From Statistics?
No ratings yet
Isn't Linear Regression From Statistics?
4 pages
Linear Regression
No ratings yet
Linear Regression
35 pages
Understanding Linear Regression Techniques
No ratings yet
Understanding Linear Regression Techniques
31 pages
Lect 10 Regression
No ratings yet
Lect 10 Regression
7 pages
21csc305p ML Unit 2
No ratings yet
21csc305p ML Unit 2
115 pages
Regression
No ratings yet
Regression
45 pages
Lecture+Notes+-+Advanced+Regression
No ratings yet
Lecture+Notes+-+Advanced+Regression
12 pages
Practical 5
No ratings yet
Practical 5
8 pages
Regression Techniques Explained
No ratings yet
Regression Techniques Explained
16 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
Complete Linear Regression Algorithm
No ratings yet
Complete Linear Regression Algorithm
4 pages
Machine Learning Questions and Answers For Interview
No ratings yet
Machine Learning Questions and Answers For Interview
20 pages
RRB - Unit 2 Regresion
No ratings yet
RRB - Unit 2 Regresion
53 pages
DS Unit 2: Regression & Classification
No ratings yet
DS Unit 2: Regression & Classification
17 pages
Lecture 3
No ratings yet
Lecture 3
33 pages
Linear Regression for Data Analysts
No ratings yet
Linear Regression for Data Analysts
3 pages
Linear Regression & Data Science Basics
No ratings yet
Linear Regression & Data Science Basics
10 pages
Lecture Notes - Linear Regression
No ratings yet
Lecture Notes - Linear Regression
26 pages
Data Science Interview Preparation
100% (1)
Data Science Interview Preparation
113 pages
OE-ML Unit - 3
No ratings yet
OE-ML Unit - 3
29 pages
3 Unit - Dspu
No ratings yet
3 Unit - Dspu
23 pages
Understanding Linear Regression Concepts
No ratings yet
Understanding Linear Regression Concepts
43 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Machine Learning QB
No ratings yet
Machine Learning QB
32 pages
(Unit-04) Part-01 - ML Algo
No ratings yet
(Unit-04) Part-01 - ML Algo
49 pages
Analytics Compendium
No ratings yet
Analytics Compendium
41 pages
Linear Regression
No ratings yet
Linear Regression
60 pages
Teit ML2
No ratings yet
Teit ML2
11 pages
ML Solved Endsem
No ratings yet
ML Solved Endsem
16 pages
Describe in Brief Different Types of Regression Algorithms
No ratings yet
Describe in Brief Different Types of Regression Algorithms
25 pages
DAV 2201079 Exp 2 2-1
No ratings yet
DAV 2201079 Exp 2 2-1
35 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
MLDAP Module2
No ratings yet
MLDAP Module2
32 pages
LR GD
No ratings yet
LR GD
5 pages
Unit - Iv
No ratings yet
Unit - Iv
11 pages
Machine Learning (CSO851) - Lecture 02
No ratings yet
Machine Learning (CSO851) - Lecture 02
74 pages
Unit-2 Ak
No ratings yet
Unit-2 Ak
106 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
Unit-2 ML
No ratings yet
Unit-2 ML
199 pages
Linear Regression Essentials
No ratings yet
Linear Regression Essentials
14 pages
Module 5.2
No ratings yet
Module 5.2
51 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
ML Question Bank
No ratings yet
ML Question Bank
13 pages
Linear Regression
No ratings yet
Linear Regression
31 pages
Linear Regression Case Study
No ratings yet
Linear Regression Case Study
6 pages
ML Unit-2
No ratings yet
ML Unit-2
34 pages
ML EasySol
No ratings yet
ML EasySol
62 pages
Comprehensive Guide to Regression Analysis
No ratings yet
Comprehensive Guide to Regression Analysis
7 pages
Curve Fitting and Interpolation
No ratings yet
Curve Fitting and Interpolation
14 pages
Grade 8 02-18-20
No ratings yet
Grade 8 02-18-20
2 pages
Statistical Inference: Parameter Estimation
No ratings yet
Statistical Inference: Parameter Estimation
5 pages
MATH
No ratings yet
MATH
34 pages
Quality Control and Assurance (QAQC)
No ratings yet
Quality Control and Assurance (QAQC)
25 pages
Mean, Mode & Median Exercises for Class 7
No ratings yet
Mean, Mode & Median Exercises for Class 7
3 pages
De PTKD 1 de Kiem Tra
No ratings yet
De PTKD 1 de Kiem Tra
5 pages
Biostatistic Outline
No ratings yet
Biostatistic Outline
2 pages
Independent Component Analysis: A Statistical Perspective: Klaus Nordhausen - Hannu Oja
No ratings yet
Independent Component Analysis: A Statistical Perspective: Klaus Nordhausen - Hannu Oja
23 pages
Synthetic CV Chart Analysis
No ratings yet
Synthetic CV Chart Analysis
16 pages
Range and Quartiles
0% (1)
Range and Quartiles
4 pages
Pearson 1920
No ratings yet
Pearson 1920
21 pages
QBM101 Tutorial 7
No ratings yet
QBM101 Tutorial 7
2 pages
Machine Learning Exam Guide
No ratings yet
Machine Learning Exam Guide
6 pages
Grade 11 Statistics - 095129 - 072455
No ratings yet
Grade 11 Statistics - 095129 - 072455
9 pages
CSE455/CSE552 Machine Learning (Spring 2024) Homework #1: Hand-In Policy Collaboration Policy Grading
No ratings yet
CSE455/CSE552 Machine Learning (Spring 2024) Homework #1: Hand-In Policy Collaboration Policy Grading
2 pages
4.1 Multiple Regression Models
No ratings yet
4.1 Multiple Regression Models
6 pages
Independent Samples T Test
No ratings yet
Independent Samples T Test
4 pages
BigMart Retail Sales Analysis Case Study
No ratings yet
BigMart Retail Sales Analysis Case Study
5 pages
Probability of Defects Analysis
No ratings yet
Probability of Defects Analysis
23 pages
FDS Slot 1
No ratings yet
FDS Slot 1
19 pages
Samplecode (HDPS)
No ratings yet
Samplecode (HDPS)
29 pages
Statistics Word Search 4: Name: - Date
No ratings yet
Statistics Word Search 4: Name: - Date
4 pages
Probability Exam Guide 2019
No ratings yet
Probability Exam Guide 2019
5 pages
MBA SFM Probs On Measures of Central Tendency
No ratings yet
MBA SFM Probs On Measures of Central Tendency
6 pages
CH 04 Ex 1
No ratings yet
CH 04 Ex 1
2 pages
Water Quality Impact of Beaver Ponds
No ratings yet
Water Quality Impact of Beaver Ponds
9 pages
Double Exponential Smoothing Analysis
No ratings yet
Double Exponential Smoothing Analysis
11 pages
Central Tendency for Stats Students
No ratings yet
Central Tendency for Stats Students
11 pages
Chapter 1 Learning Target Keys
No ratings yet
Chapter 1 Learning Target Keys
1 page
Chi-Square Statistic: 11.42857
No ratings yet
Chi-Square Statistic: 11.42857
3 pages

Advance Machine Learning

Uploaded by

Advance Machine Learning

Uploaded by

Advance Machine Learning [AML]

2.1 Regression Analysis

a) Discuss the significance of Regression Analysis. What

b) Explain the terms TSS, RSS, ESS, F-statistic, Prob(F-

2. RSS (Residual Sum of Squares): The total amount of

4. F-statistic: A measure used to test the overall

5. Prob(F-statistic): The probability associated with the

6. R-squared: A measure of how well the independent

7. Adjusted R-squared: A modified version of R-squared

8. T-Statistic: A measure used to test the significance of

9. Confidence Intervals for Coefficients: Intervals that

c) Define Polynomial Regression. Why might we need

We might need polynomial regression when the

The formulation involves fitting a polynomial function to

Y = β₀ + β₁X + β₂X² + ... + βₙXⁿ

Where Y is the dependent variable, X is the

2.2 Assumptions of Linear Regression

a) List and explain the assumptions of Linear

If these assumptions are violated:

1. Linearity: The model may not accurately represent

b) Discuss methods to detect and handle

1. Correlation matrix: Check for high correlations

1. Remove redundant variables: Drop one of the highly

3. Regularization And Model Evaluation

a) Explain the concept of Bias and Variance in the

It's crucial to study the Bias-Variance Tradeoff because

Mathematically, regularization is achieved by adding a

Geometrically, regularization adds a constraint to the

c) Discuss the types of Regularization techniques

1. Lasso (L1 regularization): Lasso adds the absolute

2. Ridge (L2 regularization): Ridge adds the squared

d) Implement Ridge Regression using scikit-learn for

ridge_2d = Ridge(alpha=1.0) # Initialize Ridge

# For n-dimensional dataset

ridge_nd = Ridge(alpha=1.0) # Initialize Ridge

def ridge_regression(X, y, alpha):

Ridge Regression using Gradient Descent:

def ridge_regression_gradient_descent(X, y, alpha,

3.2 Data Leakage

a) What is Data Leakage in the context of machine

Potential problems associated with data leakage

b) Identify and explain the various ways in which Data

1. **Feature engineering**: Including information from

c) Describe the concept of a Validation Set and its role

3.3 Hyperparameter Tuning

Hyperparameters, on the other hand, are external

The term "hyper" in hyperparameters distinguishes

b) Discuss the requirements for Hyperparameter Tuning

1. Performance optimization: Proper tuning can

c) Compare and contrast Grid Search Cross-Validation

1. Grid Search CV:

2. Randomized Search CV:

Both methods contribute to improving model

d) Explain with Example Grid Search CV, Randomized

# Define hyperparameters grid

**Randomized Search CV Example:**

# Define hyperparameters distributions

In both examples, we're tuning hyperparameters for a

You might also like

1. Feature engineering: Including information from

Randomized Search CV Example: