0% found this document useful (0 votes)

2 views8 pages

Regularization in Machine Learning

Regularization is a machine learning technique used to prevent overfitting by adding extra information to models, helping them generalize better to unseen data. The two main types of regularization techniques are Ridge Regression, which reduces model complexity by shrinking coefficients, and Lasso Regression, which not only reduces complexity but also performs feature selection by potentially eliminating some features. Both techniques modify the cost function of models to achieve better predictive accuracy and reduce overfitting.

Uploaded by

deepa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views8 pages

Regularization in Machine Learning

Uploaded by

deepa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Regularization in Machine Learning

What is Regularization?
Regularization is one of the most important concepts of machine learning.
It is a technique to prevent the model from overfitting by adding extra
information to it.

Sometimes the machine learning model performs well with the training
data but does not perform well with the test data. It means the model is
not able to predict the output when deals with unseen data by introducing
noise in the output, and hence the model is called overfitted. This problem
can be deal with the help of a regularization technique.

This technique can be used in such a way that it will allow to maintain all
variables or features in the model by reducing the magnitude of the
variables. Hence, it maintains accuracy as well as a generalization of the
model.

It mainly regularizes or reduces the coefficient of features toward zero. In

simple words, "In regularization technique, we reduce the magnitude of
the features by keeping the same number of features."

Regularization refers to techniques used to calibrate machine

learning models to minimize the adjusted loss function and avoid
overfitting or underfitting.
Working of Regularization
Regularization works by adding a penalty or complexity term to the
complex model. Let's consider the simple linear regression equation:

y= β0+β1x1+β2x2+β3x3+⋯+βnxn +b

In the above equation, Y represents the value to be predicted

X1, X2, …Xn are the features for Y.

β0,β1,…..βn are the weights or magnitude attached to the features,

respectively. Here represents the bias of the model, and b represents the
intercept.

Linear regression models try to optimize the β0 and b to minimize the cost
function. The equation for the cost function for the linear model is given
below:

Now, we will add a loss function and optimize parameter to make the
model that can predict the accurate value of Y. The loss function for the
linear regression is called as RSS or Residual sum of squares.

Techniques of Regularization
There are mainly two types of regularization techniques, which are given
below:

o Ridge Regression
o Lasso Regression

Ridge Regression
o Ridge regression is one of the types of linear regression in which a small
amount of bias is introduced so that we can get better long-term
predictions.
o Ridge regression is a regularization technique, which is used to reduce the
complexity of the model. It is also called as L2 regularization.
o In this technique, the cost function is altered by adding the penalty term
to it. The amount of bias added to the model is called Ridge Regression
penalty. We can calculate it by multiplying with the lambda to the
squared weight of each individual feature.
o The equation for the cost function in ridge regression will be:

o In the above equation, the penalty term regularizes the coefficients of the
model, and hence ridge regression reduces the amplitudes of the
coefficients that decreases the complexity of the model.
o As we can see from the above equation, if the values of λ tend to zero,
the equation becomes the cost function of the linear regression
model. Hence, for the minimum value of λ, the model will resemble the
linear regression model.
o A general linear or polynomial regression will fail if there is high
collinearity between the independent variables, so to solve such problems,
Ridge regression can be used.
o It helps to solve the problems if we have more parameters than samples.

Also known as Ridge Regression, it modifies the over-fitted or under

fitted models by adding the penalty equivalent to the sum of the
squares of the magnitude of coefficients.

This means that the mathematical function representing our

machine learning model is minimized and coefficients are
calculated. The magnitude of coefficients is squared and added.
Ridge Regression performs regularization by shrinking the
coefficients present. The function depicted below shows the cost
function of ridge regression :
o

o Figure 7: Cost Function of Ridge

Regression
o In the cost function, the penalty term is represented by Lambda λ.
By changing the values of the penalty function, we are controlling
the penalty term. The higher the penalty, it reduces the magnitude
of coefficients. It shrinks the parameters. Therefore, it is used to
prevent multicollinearity, and it reduces the model complexity by
coefficient shrinkage.
o Consider the graph illustrated below which represents Linear
regression :

o
o Figure 8: Linear regression model

Cost function = Loss + λ x∑‖w‖^2

For Linear Regression line, let’s consider two points that are on the
line,
Loss = 0 (considering the two points on the line)
λ= 1
w = 1.4
Then, Cost function = 0 + 1 x 1.42
= 1.96
For Ridge Regression, let’s assume,
Loss = 0.32 + 0.22 = 0.13
λ=1
w = 0.7
Then, Cost function = 0.13 + 1 x 0.72
= 0.62

o
o Figure 9: Ridge regression model
o Comparing the two models, with all data points, we can see that the
Ridge regression line fits the model more accurately than the linear
regression line.

o
o Figure 10: Optimization of model fit using Ridge Regression
Lasso Regression:
o Lasso regression is another regularization technique to reduce the
complexity of the model. It stands for Least Absolute and Selection
Operator.
o It is similar to the Ridge Regression except that the penalty term contains
only the absolute weights instead of a square of weights.
o Since it takes absolute values, hence, it can shrink the slope to 0, whereas
Ridge Regression can only shrink it near to 0.
o It is also called as L1 regularization. The equation for the cost function
of Lasso regression will be:

o Some of the features in this technique are completely neglected for model
evaluation.
o Hence, the Lasso regression can help us to reduce the overfitting in the
model as well as the feature selection.

It modifies the over-fitted or under-fitted models by adding the

penalty equivalent to the sum of the absolute values of coefficients.

Lasso regression also performs coefficient minimization, but

instead of squaring the magnitudes of the coefficients, it takes the
true values of coefficients. This means that the coefficient sum can
also be 0, because of the presence of negative coefficients. Consider
the cost function for Lasso regression :
Figure 11: Cost function for Lasso
Regression

We can control the coefficient values by controlling the penalty

terms, just like we did in Ridge Regression. Again consider a Linear
Regression model :

Figure 12: Linear Regression Model

Cost function = Loss + λ x ∑‖w‖
For Linear Regression line, let’s assume,
Loss = 0 (considering the two points on the line)
λ=1
w = 1.4
Then, Cost function = 0 + 1 x 1.4
= 1.4
For Ridge Regression, let’s assume,
Loss = 0.32 + 0.12 = 0.1
λ=1
w = 0.7
Then, Cost function = 0.1 + 1 x 0.7
= 0.8

Comparing the two models, with all data points, we can see that the
Lasso regression line fits the model more accurately than the linear
regression line.

Key Difference between Ridge Regression and Lasso

Regression
o Ridge regression is mostly used to reduce the overfitting in the model,
and it includes all the features present in the model. It reduces the
complexity of the model by shrinking the coefficients.
o Lasso regression helps to reduce the overfitting in the model as well as
feature selection.

Regularization
No ratings yet
Regularization
3 pages
AI34
No ratings yet
AI34
3 pages
Regularization
No ratings yet
Regularization
5 pages
21csc305p ML Unit 2
No ratings yet
21csc305p ML Unit 2
115 pages
Machine Learning With Ridge and Lasso Regression
No ratings yet
Machine Learning With Ridge and Lasso Regression
19 pages
PA Notes 2
No ratings yet
PA Notes 2
23 pages
Ridge and Lasso Regression in Python
No ratings yet
Ridge and Lasso Regression in Python
18 pages
Machine Learning PPT Part II
No ratings yet
Machine Learning PPT Part II
56 pages
Ridge Mt1cars
No ratings yet
Ridge Mt1cars
4 pages
Unit 2
No ratings yet
Unit 2
8 pages
Lab 1
No ratings yet
Lab 1
6 pages
Feature Selection
No ratings yet
Feature Selection
19 pages
LASSO and Ridge-1
No ratings yet
LASSO and Ridge-1
15 pages
Ridge vs Lasso Regression Guide
No ratings yet
Ridge vs Lasso Regression Guide
5 pages
Unit 2
No ratings yet
Unit 2
92 pages
9 - Linear Regression-Problems and Solutions
No ratings yet
9 - Linear Regression-Problems and Solutions
23 pages
Slide 1
No ratings yet
Slide 1
4 pages
Data Science Lab: Linear Regression
No ratings yet
Data Science Lab: Linear Regression
9 pages
CSL0777 L17
No ratings yet
CSL0777 L17
27 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
Data Analytics - Ridge and LASSO Regression
No ratings yet
Data Analytics - Ridge and LASSO Regression
15 pages
Understanding Linear Regression Concepts
No ratings yet
Understanding Linear Regression Concepts
43 pages
Regularization and Feature Selectio N
No ratings yet
Regularization and Feature Selectio N
102 pages
10 - Linear Regression-Problems and Solutions
No ratings yet
10 - Linear Regression-Problems and Solutions
23 pages
10 - Linear Regression-Problems and Solutions
No ratings yet
10 - Linear Regression-Problems and Solutions
23 pages
Lecture 3
No ratings yet
Lecture 3
16 pages
Module 3.3 Classification Models, An Overview
No ratings yet
Module 3.3 Classification Models, An Overview
11 pages
445 Lecture 7
No ratings yet
445 Lecture 7
30 pages
Regularization
No ratings yet
Regularization
13 pages
Detailed Breakdown Ridge Lasso
No ratings yet
Detailed Breakdown Ridge Lasso
2 pages
L3 Linear Regression
No ratings yet
L3 Linear Regression
23 pages
PGN AI and ML Presentation
No ratings yet
PGN AI and ML Presentation
28 pages
Lecture-6 Linear Regression Addition
No ratings yet
Lecture-6 Linear Regression Addition
15 pages
Linear Regression for Car Mileage
No ratings yet
Linear Regression for Car Mileage
13 pages
Module 3
No ratings yet
Module 3
35 pages
Lecture+Notes+-+Advanced+Regression
No ratings yet
Lecture+Notes+-+Advanced+Regression
12 pages
Regression Shrinkage Techniques
No ratings yet
Regression Shrinkage Techniques
5 pages
Pa 1 Unit
No ratings yet
Pa 1 Unit
23 pages
Regression Models & Regularization
No ratings yet
Regression Models & Regularization
15 pages
ML Solved Endsem
No ratings yet
ML Solved Endsem
16 pages
Lecture-2.1.2-Cross Validation and Regularization
No ratings yet
Lecture-2.1.2-Cross Validation and Regularization
7 pages
B Ridge - and - Lasso - Regression
No ratings yet
B Ridge - and - Lasso - Regression
5 pages
Regularization Methods Intro 1694372556
No ratings yet
Regularization Methods Intro 1694372556
38 pages
Dependent Independent Variable (S) : Regression: What Is Regression
No ratings yet
Dependent Independent Variable (S) : Regression: What Is Regression
15 pages
Aiml 6
No ratings yet
Aiml 6
30 pages
ML EasySol
No ratings yet
ML EasySol
62 pages
EDA 4th Module
No ratings yet
EDA 4th Module
26 pages
Chapter 6 - 1 Handsout Machine Learning
No ratings yet
Chapter 6 - 1 Handsout Machine Learning
29 pages
Chapter2 - Optimisation
No ratings yet
Chapter2 - Optimisation
7 pages
SLChapter 5
No ratings yet
SLChapter 5
16 pages
Ch5 Regularization
No ratings yet
Ch5 Regularization
23 pages
Group30 Linear Regression
No ratings yet
Group30 Linear Regression
20 pages
Advanced Regression Assignment
No ratings yet
Advanced Regression Assignment
5 pages
Linear Regression Regularization
No ratings yet
Linear Regression Regularization
13 pages
Regression Techniques Explained
No ratings yet
Regression Techniques Explained
16 pages
Aml 3
No ratings yet
Aml 3
19 pages
Tutorial 5 - Solution Data Science
No ratings yet
Tutorial 5 - Solution Data Science
9 pages
A Layman's Guide To Ridge Regression
No ratings yet
A Layman's Guide To Ridge Regression
4 pages
Lect 6
No ratings yet
Lect 6
10 pages
ML Gtu Papers
No ratings yet
ML Gtu Papers
12 pages
Optimizing The Seed-Cell Filling Performance of An Inclined Plate Seed Metering Device Using Integrated ANN-PSO Approach
No ratings yet
Optimizing The Seed-Cell Filling Performance of An Inclined Plate Seed Metering Device Using Integrated ANN-PSO Approach
12 pages
Electric Power Systems Research: Sciencedirect
No ratings yet
Electric Power Systems Research: Sciencedirect
12 pages
Bias-Variance Tradeoff in Model Selection
No ratings yet
Bias-Variance Tradeoff in Model Selection
66 pages
Supervised and Unsupervised Learning
No ratings yet
Supervised and Unsupervised Learning
92 pages
Decision Trees for Data Scientists
No ratings yet
Decision Trees for Data Scientists
44 pages
Algorithmic Trading in Python
50% (2)
Algorithmic Trading in Python
28 pages
10 Advice For Applying Machine Learning
No ratings yet
10 Advice For Applying Machine Learning
25 pages
Model Evaluation-I
No ratings yet
Model Evaluation-I
68 pages
Chapter 02 Overview - 4
No ratings yet
Chapter 02 Overview - 4
43 pages
L2 数量课件
No ratings yet
L2 数量课件
196 pages
Qa DL
No ratings yet
Qa DL
48 pages
Backtest vs OOS in Trading Algorithms
No ratings yet
Backtest vs OOS in Trading Algorithms
19 pages
Advanced Detection of AI-Generated Images Through Vision Transformers
No ratings yet
Advanced Detection of AI-Generated Images Through Vision Transformers
9 pages
FDS KGRL
No ratings yet
FDS KGRL
137 pages
Foundations of Data Science
No ratings yet
Foundations of Data Science
138 pages
200 Data Science Interview Questions
No ratings yet
200 Data Science Interview Questions
16 pages
Fair - Optimized Yamnet Model For Early Detection of Emergency Situations in Smart Safety Systems
No ratings yet
Fair - Optimized Yamnet Model For Early Detection of Emergency Situations in Smart Safety Systems
13 pages
Machine Learning - UNIT I Notes
No ratings yet
Machine Learning - UNIT I Notes
31 pages
Sample Paper 1 AI Class 10
No ratings yet
Sample Paper 1 AI Class 10
8 pages
Cintttseriesdataxdy 0001 Session 71725493802205
No ratings yet
Cintttseriesdataxdy 0001 Session 71725493802205
36 pages
Regularization Techniques in ML
No ratings yet
Regularization Techniques in ML
21 pages
Fraud Detection Using Machine Learning
No ratings yet
Fraud Detection Using Machine Learning
36 pages
Paper Id - ICCCAI25 - 188
No ratings yet
Paper Id - ICCCAI25 - 188
8 pages
Cyberbully Detection on Twitter Using NLP & CNN
No ratings yet
Cyberbully Detection on Twitter Using NLP & CNN
50 pages
Ai Tools For Financial Forecasting and Risk Management PPT For Anna CLG (Aids) 1 Yr
No ratings yet
Ai Tools For Financial Forecasting and Risk Management PPT For Anna CLG (Aids) 1 Yr
7 pages
Decision Tree Classifier Insights
No ratings yet
Decision Tree Classifier Insights
3 pages
Intro to Decision Trees for ML Students
No ratings yet
Intro to Decision Trees for ML Students
15 pages
124 Diagnosing and Fixing Manifold
No ratings yet
124 Diagnosing and Fixing Manifold
48 pages
Ibm Watson Test
No ratings yet
Ibm Watson Test
17 pages

Regularization in Machine Learning

Uploaded by

Regularization in Machine Learning

Uploaded by

Regularization in Machine Learning

It mainly regularizes or reduces the coefficient of features toward zero. In

Regularization refers to techniques used to calibrate machine

In the above equation, Y represents the value to be predicted

X1, X2, …Xn are the features for Y.

β0,β1,…..βn are the weights or magnitude attached to the features,

Also known as Ridge Regression, it modifies the over-fitted or under

This means that the mathematical function representing our

o Figure 7: Cost Function of Ridge

Cost function = Loss + λ x∑‖w‖^2

It modifies the over-fitted or under-fitted models by adding the

Lasso regression also performs coefficient minimization, but

We can control the coefficient values by controlling the penalty

Figure 12: Linear Regression Model

Key Difference between Ridge Regression and Lasso

You might also like