0% found this document useful (0 votes)

7 views10 pages

Regularization in Machine Learning

The document discusses the concept of regularization in machine learning, focusing on its role in preventing overfitting of models to training data. It introduces techniques such as reducing the number of features and applying regularization to maintain model performance on unseen data. Additionally, it covers the implications of regularization on cost functions and gradient descent in both linear and logistic regression contexts.

Uploaded by

datavvr1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views10 pages

Regularization in Machine Learning

Uploaded by

datavvr1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

1/9/2025

MACHINE LEARNING:
REGULARIZATION
Presented by
Vikas Chandra
Scientist ‘C’
ETDC Goa

INTRODUCTION
 Machine learning models need to generalize well
to new examples that the model has not seen in
practice. In this module, we introduce
regularization, which helps prevent models from
over fitting the training data.

1
1/9/2025

THE PROBLEM OF OVER FITTING

 Example : Linear Regression (housing prices)

 Over fitting: if we have too many features, the

learned hypothesis may fit the training set very
well (such that cost function JΘ(x) ≈ 0 ), but fail
to generalize to new examples (predict prices on
new example).

THE PROBLEM OF OVER FITTING

 Example : Logistic Regression

2
1/9/2025

POP QUIZ
 Consider the medical diagnosis problem of classifying
tumours as malignant or benign. If a hypothesis ℎ𝜃(𝑥)
has over fit the training set, it means that:
a) It makes accurate predictions for examples in the
training set and generalizes well to make accurate
predictions on new, previously unseen examples.
b) It does not make accurate predictions for examples in the
training set, but it does generalize well to make accurate
predictions on new, previously unseen examples.
c) It makes accurate predictions for examples in the
training set, but it does not generalize well to make
accurate predictions on new, previously unseen examples.
d) It does not make accurate predictions for examples in the
training set and does not generalize well to make
accurate predictions on new, previously unseen examples.

ADDRESSING OVERFITTING
 Housing pricing example:

3
1/9/2025

ADDRESSING OVER FITTING

1. Reduce the number of features:
 Manually select which features to keep.
 Use a model selection algorithm (out of scope of this
course)
 Principal Component Analysis (PCA): Transforms the data
into a set of linearly uncorrelated components.
 Recursive Feature Elimination (RFE): Iteratively builds
models and eliminates the least important features based
on model coefficients.
2. Regularization
 Keep all the features, but reduce the magnitude of
parameters 𝜃j.
 Regularization works well when we have a lot of
slightly useful features.

COST FUNCTION : INTUITION

 For 4th degree polynomial fit, what if we make Θ3
and Θ4 really small, in that case our hypothesis
will be similar to the 2nd case (with 2nd degree
polynomial fit).

 How can we make Θ3 and Θ4 small?

4
1/9/2025

COST FUNCTION : INTUITION

 Regularization : Small value for parameters Θi’s
 Simpler hypothesis
 Less prone to over fitting
 Cost function
here λ is regularization parameter, Θ0 is not
regularized.

POP QUIZ
 In regularized linear regression, we choose theta
to minimize

 What if λ is set to an extremely large value (say

λ= 1010 ?
a) Algorithm works fine; setting to be very large can’t
hurt it
b) Algorithm fails to eliminate over fitting.
c) Algorithm results in under fitting. (Fails to fit even
training datawell).
d) Gradient descent will fail to converge.

5
1/9/2025

POP QUIZ: ANSWER

 What if λ is set to an extremely large value (say
λ= 1010 ?
a) Algorithm works fine; setting to be very large can’t
hurt it
b) Algorithm fails to eliminate over fitting.
c) Algorithm results in under fitting. (Fails to fit even
training datawell).
d) Gradient descent will fail to converge.

hΘ(x)= Θ0 + Θ1x + Θ2x2 + Θ3x3 + Θ4x4

REGULARIZED LINEAR REGRESSION:

GRADIENT DESCENT
 Cost function

 Gradient descent previously:

 We will modify our gradient descent function to
separate out 𝜃0 from the rest of the parameters
because we do not want to penalize 𝜃0.

6
1/9/2025

REGULARIZED LINEAR REGRESSION:

GRADIENT DESCENT

The term λ/m * Θj performs our regularization. With some

manipulation our update rule can also be represented as

POP QUIZ
 Suppose you are doing gradient descent on a
training set of 𝑚>0 examples, using a fairly small
learning rate α>0 and some regularization
parameter 𝜆>0. Consider the update rule:

7
1/9/2025

REGULARIZED LINEAR REGRESSION:

NORMAL EQUATION
 m =4 Size #Bedro #floo Age Price(
feet2 oms rs of $) in
 x0i =1
(x1) (x2) (x3) home 1000's
For i =1,2,3,4 (year (y)
s)
(x4)
2104 5 1 5 460
1416 3 2 7 232
1534 3 2 3 315
852 2 4 1 178

Θ= (XTX + λ )-1XTy

REGULARIZED LINEAR REGRESSION:

NORMAL EQUATION

8
1/9/2025

REGULARIZED LOGISTIC REGRESSION:

COST FUNCTION
 Previously

REGULARIZED LOGISTIC REGRESSION:

GRADIENT DESCENT
 Gradient Descent:

9
1/9/2025

REGULARIZED LOGISTIC REGRESSION:

ADVANCE OPTIMIZATION

Overfitting in Linear Regression
No ratings yet
Overfitting in Linear Regression
8 pages
Regularization
No ratings yet
Regularization
8 pages
Lecture 4.2. Generalization and Regularization
No ratings yet
Lecture 4.2. Generalization and Regularization
23 pages
07: Regularization: The Problem of Overfitting
No ratings yet
07: Regularization: The Problem of Overfitting
5 pages
Regularization for Overfitting Prevention
No ratings yet
Regularization for Overfitting Prevention
7 pages
Regularization
No ratings yet
Regularization
7 pages
The Problem of Overfitting: Overfitting With Linear Regression
No ratings yet
The Problem of Overfitting: Overfitting With Linear Regression
32 pages
Regularization in Cost Functions
No ratings yet
Regularization in Cost Functions
32 pages
Regularisation in Machine Learning Models
No ratings yet
Regularisation in Machine Learning Models
79 pages
02 - Linear Models - C - Regularization - Logistic - Regression
No ratings yet
02 - Linear Models - C - Regularization - Logistic - Regression
16 pages
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
No ratings yet
Introduction To Machine Learning: Dr. Muhammad Amjad Iqbal
20 pages
Regularization
No ratings yet
Regularization
25 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
Scribe Notes Fall 2022
No ratings yet
Scribe Notes Fall 2022
41 pages
AIML Exp 7 1227
No ratings yet
AIML Exp 7 1227
3 pages
U4 PDF
No ratings yet
U4 PDF
18 pages
Logistic Regression
No ratings yet
Logistic Regression
24 pages
NNDL Notes
No ratings yet
NNDL Notes
73 pages
07 Regularization
No ratings yet
07 Regularization
51 pages
L11+ Regularization
No ratings yet
L11+ Regularization
24 pages
Regularization
No ratings yet
Regularization
46 pages
Lasso Regression in Logistic Models
No ratings yet
Lasso Regression in Logistic Models
43 pages
Multiclass Classification Regularization
No ratings yet
Multiclass Classification Regularization
31 pages
Regularization Quiz Feedback
100% (1)
Regularization Quiz Feedback
5 pages
Deep Learning Basics Lecture 3 Regularization I
No ratings yet
Deep Learning Basics Lecture 3 Regularization I
32 pages
Module - 2 Ver 1.4
No ratings yet
Module - 2 Ver 1.4
35 pages
Skript Opt Mach
No ratings yet
Skript Opt Mach
49 pages
Deep Learning Important Questions For Ia 1
No ratings yet
Deep Learning Important Questions For Ia 1
11 pages
Regularization For Deep Learning: Tsz-Chiu Au Chiu@unist - Ac.kr
No ratings yet
Regularization For Deep Learning: Tsz-Chiu Au Chiu@unist - Ac.kr
100 pages
Advanced Linear Regression Guide
No ratings yet
Advanced Linear Regression Guide
45 pages
Subtitle
No ratings yet
Subtitle
3 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
Data Science L20 - Regularization
No ratings yet
Data Science L20 - Regularization
41 pages
Linear Regression Techniques
No ratings yet
Linear Regression Techniques
25 pages
Group 30
No ratings yet
Group 30
33 pages
04 Linear
No ratings yet
04 Linear
31 pages
1.2 Overfitting Under Fitting and Cross Validation and Confusion Matrix
No ratings yet
1.2 Overfitting Under Fitting and Cross Validation and Confusion Matrix
17 pages
Regularization
No ratings yet
Regularization
74 pages
Bias Variance
No ratings yet
Bias Variance
3 pages
4 MachineLearningForCV
No ratings yet
4 MachineLearningForCV
73 pages
Regularization in Polynomial Regression
No ratings yet
Regularization in Polynomial Regression
30 pages
Regularization 1704650055
No ratings yet
Regularization 1704650055
32 pages
Lecture 7
No ratings yet
Lecture 7
19 pages
CS2011 2
No ratings yet
CS2011 2
14 pages
Machine Learning Overfitting Fixes
No ratings yet
Machine Learning Overfitting Fixes
23 pages
Lecture 7
No ratings yet
Lecture 7
19 pages
Deep Learning Regularization Guide
No ratings yet
Deep Learning Regularization Guide
77 pages
Understanding Regularization in ML
No ratings yet
Understanding Regularization in ML
19 pages
Lecture 7
No ratings yet
Lecture 7
19 pages
Understanding Regularization in ML
No ratings yet
Understanding Regularization in ML
19 pages
Curve Fitting
No ratings yet
Curve Fitting
17 pages
Overfitting Problem Regularization (Ridge, Lasso, Elastic) Dropout and Early Stopping
No ratings yet
Overfitting Problem Regularization (Ridge, Lasso, Elastic) Dropout and Early Stopping
17 pages
Linear Regression Review
67% (3)
Linear Regression Review
4 pages
DL Unit1
100% (1)
DL Unit1
61 pages
Numerical Methods Course Outline
No ratings yet
Numerical Methods Course Outline
12 pages
Numerical Analysis: Interpolation Techniques
No ratings yet
Numerical Analysis: Interpolation Techniques
16 pages
Optimum Workforce Scheduling Under The (14, 21) Days-Off Timetable
No ratings yet
Optimum Workforce Scheduling Under The (14, 21) Days-Off Timetable
9 pages
Application of Pythagorean Fuzzy Set in MCDM: Presented by Under The Guidance
No ratings yet
Application of Pythagorean Fuzzy Set in MCDM: Presented by Under The Guidance
20 pages
Package Fuzzymcdm': R Topics Documented
No ratings yet
Package Fuzzymcdm': R Topics Documented
8 pages
11 Prolog7 Puzzle
No ratings yet
11 Prolog7 Puzzle
16 pages
Optimize Assignments: Hungarian Method
No ratings yet
Optimize Assignments: Hungarian Method
5 pages
MCL261 Assignment 2
No ratings yet
MCL261 Assignment 2
2 pages
Numerical Error Analysis Guide
100% (1)
Numerical Error Analysis Guide
8 pages
MAthcad Polinoame Interpolation
No ratings yet
MAthcad Polinoame Interpolation
5 pages
Operational Research Final Exam
100% (2)
Operational Research Final Exam
5 pages
Pepar Ot
No ratings yet
Pepar Ot
4 pages
Kuhn Tucker Conditions
No ratings yet
Kuhn Tucker Conditions
15 pages
Free Study Resources for Students
No ratings yet
Free Study Resources for Students
7 pages
MTH 601
No ratings yet
MTH 601
3 pages
Module 1 Jacobi
No ratings yet
Module 1 Jacobi
24 pages
Polynomial Matrix Inversion Method
No ratings yet
Polynomial Matrix Inversion Method
19 pages
Review Paper On The Runge-Kutta Methods To Study Numerical Solutions of Initial Value Problems in Ordinary Differential Equations
No ratings yet
Review Paper On The Runge-Kutta Methods To Study Numerical Solutions of Initial Value Problems in Ordinary Differential Equations
10 pages
Karush Kuhn Tucker Slides
No ratings yet
Karush Kuhn Tucker Slides
45 pages
Numerical Methods
No ratings yet
Numerical Methods
33 pages
Solution of Non Linear Equations
No ratings yet
Solution of Non Linear Equations
22 pages
MBA Assignment
No ratings yet
MBA Assignment
27 pages
Understanding Bézier Curves and Reparameterization
No ratings yet
Understanding Bézier Curves and Reparameterization
21 pages
Newton-Raphson Method Overview
No ratings yet
Newton-Raphson Method Overview
32 pages
AND Gate: Function and Symbols Explained
No ratings yet
AND Gate: Function and Symbols Explained
2 pages
Excel Solver: Nonlinear Problem Tips
No ratings yet
Excel Solver: Nonlinear Problem Tips
3 pages
Linear Programming
No ratings yet
Linear Programming
20 pages
Chapter 5 Matrices and Systems of Linear Equations - t2
No ratings yet
Chapter 5 Matrices and Systems of Linear Equations - t2
35 pages
Linear Regression and Curve Fitting
No ratings yet
Linear Regression and Curve Fitting
11 pages
Seminar DCSC31021
No ratings yet
Seminar DCSC31021
26 pages

Regularization in Machine Learning

Uploaded by

Regularization in Machine Learning

Uploaded by

1/9/2025

THE PROBLEM OF OVER FITTING

 Over fitting: if we have too many features, the

THE PROBLEM OF OVER FITTING

ADDRESSING OVER FITTING

COST FUNCTION : INTUITION

 How can we make Θ3 and Θ4 small?

COST FUNCTION : INTUITION

 What if λ is set to an extremely large value (say

POP QUIZ: ANSWER

hΘ(x)= Θ0 + Θ1x + Θ2x2 + Θ3x3 + Θ4x4

REGULARIZED LINEAR REGRESSION:

 Gradient descent previously:

REGULARIZED LINEAR REGRESSION:

The term λ/m * Θj performs our regularization. With some

REGULARIZED LINEAR REGRESSION:

REGULARIZED LINEAR REGRESSION:

REGULARIZED LOGISTIC REGRESSION:

REGULARIZED LOGISTIC REGRESSION:

REGULARIZED LOGISTIC REGRESSION:

You might also like