0% found this document useful (0 votes)

7 views28 pages

Linear Regression

The document provides an overview of linear regression within the context of machine learning, explaining its purpose in predicting values based on relationships between variables. It covers types of data, methods for regression including least squares and gradient descent, and the formulation of regression problems. Additionally, it discusses performance metrics for evaluating regression models.

Uploaded by

Sandeep Raghuwanshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views28 pages

Linear Regression

Uploaded by

Sandeep Raghuwanshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Linear Regression

Objectives

v What is machine learning

vTypes of data and terminology

vTypes of machine learning
v Supervised learning
v Linear regression
v Least square and Gradient Descent

v Hands on implementing Linear Regression from sketch.

Machine Learning

v Machine Learning is the science to make computers learn from data

without explicitly program them and improve their learning over time
in autonomous fashion.

v This learning comes by feeding them data in the form of

observations and real-world interactions.”

v Machine Learning can also be defined as a tool to predict future

events or values using past data.
Types of Data

vBased on Values
vContinuous data (ex. Age – 0-100)

v Categorical data (ex. Gender- Male/Female)

v Based on pattern
v Structured data (ex. Databases)
v Unstructured data (ex. Audio, Video, Text)
Types of Data- continued

v Labelled data – consists of input output pair. For every set

input features the output/response/label is present in
dataset. (ex- labelled image as cat’s or dog’s photo)
{ 𝑥# , 𝑦# , 𝑥& , 𝑦& , 𝑥' , 𝑦' … … … … … 𝑥) , 𝑦) }
vUnlabelled data- There is no output/response/label for the
input features in data. (ex. news articles, tweets, audio)
{𝑥# , 𝑥& , 𝑥' … … … … 𝑥) }
Types of Data- continued

vTraining Data – Sample data points which are used to train the
machine learning model.
vTest Data- sample data points that are used to test the
performance of machine learning model.

Note- For modelling, the original dataset is partitioned into the ratio of 70:30 or 75:25 as
training data and test data.
Types of Machine Learning

Machine
Learning

Supervised Unsupervised Reinforcement

Learning Learning Learning

Regression Classification Clustering PCA

Supervised Learning

v Class of machine learning that work on externally supplied instances in form of

predictor attributes and associated target values.

v The model learns from the training data using these ‘target variables’ as
reference variables.
vEx1 : model to predict the resale value of a car based on its mileage, age,
color etc.

v The target values are the ‘correct answers’ for the predictor model which can
either be a regression model or a classification model.
Motivation for learning
v It is being assumed that there exists a relationship/association between input
features and target variable.
v Relationship can be observed by plotting a scatter plot between the two
variables.

v Relationship measure can be quantified by calculating correlation between two

the variables.
𝑐𝑜𝑣(𝑥, 𝑦) ∑ 𝑥5 − 𝑥̅ ∗ 𝑦5 − 𝑦9
𝑐𝑜𝑟𝑟 𝑥, 𝑦 = =
𝑣𝑎𝑟 𝑥 . 𝑣𝑎𝑟(𝑦)
∑ 𝑥5 − 𝑥̅ & ∑ 𝑦5 − 𝑦9 &
Linear Regression

v Linear regression is a way to identify a relationship between two or more

variables and use these relationships to predict values of one variable for given
value(s) of other variable(s).
v Linear regression assume the relationship between variables can be modelled
through linear equation or an equation of line
Slope

Dependent/Regressed variable 𝒚 = 𝒘𝟎 + 𝒘𝟏 𝑿 Independent/Regressor variable

Intercept
Multiple Regression

v Last slide showed the linear regression model with one independent and one
dependent variable.
v In Real world a data point has various important attributes and they need to be
catered to while developing a regression model. (Many independent variables and
one dependent variable)

𝑦 = 𝑤A + 𝑤#𝑥# + 𝑤&𝑥& + 𝑤'𝑥'. … … … . wCxC

Regression –Problem Formulation

Let you have given with a data:

170
Age in Years Blood Pressure
(X) (Y) 160

Blood Pressure (Y)

56 147 150
49 145
140
72 160
38 115 130

63 130 120
47 128
110
35 45 55 65 75
Age in Year (X)
Linear Regression

v For given example the Linear Regression is modeled as:

𝐵𝑙𝑜𝑜𝑑𝑃𝑟𝑒𝑠𝑠𝑢𝑟𝑒 𝑦 = 𝑤A + 𝑤# 𝐴𝑔𝑒𝑖𝑛𝑌𝑒𝑎𝑟(𝑋)
OR

𝑦 = 𝑤A + 𝑤# 𝑋 – Equation of line

with 𝑤A 𝑖𝑠 𝑖𝑛𝑡𝑒𝑟𝑐𝑒𝑝𝑡 𝑜𝑛 𝑌_𝑎𝑥𝑖𝑠 𝑎𝑛𝑑 𝑤# 𝑖𝑠 𝑠𝑙𝑜𝑝𝑒 𝑜𝑓 𝑙𝑖𝑛𝑒

Blood Pressure - Dependent Variable

Age in Year - Independent Variable

Linear Regression- Best Fit Line

v Regression uses line to show the trend of distribution.

vThere can be many lines that try to fit the data points in scatter diagram

v The aim is to find Best fit Line

170

160
Blood Pressure (Y)
150

140

130

120

110
35 45 55 65 75
Age in Year (X)
What is Best Fit Line

v Best fit line tries to explain the variance in given data. (minimize the total residual/error)
What is Best Fit Line

v Best fit line tries to explain the variance in given data. (minimize the total residual/error)
Linear Regression- Methods to Get Best

vLeast Square

v Gradient Descent
Linear Regression- Least Square

Model: 𝑌 = 𝑤A + 𝑤# 𝑋
Task: Estimate 𝑡ℎ𝑒 𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑤A 𝑎𝑛𝑑 𝑤#

According to pr𝑖𝑛𝑐𝑖𝑝𝑙𝑒 𝑜𝑓 𝑙𝑒𝑎𝑠𝑡 𝑠𝑞𝑢𝑎𝑟𝑒 𝑡ℎ𝑒 𝑛𝑜𝑟𝑚𝑎𝑙 𝑒𝑞𝑢𝑎𝑡𝑖𝑜𝑛𝑠 𝑡𝑜 𝑠𝑜𝑙𝑣𝑒 𝑓𝑜𝑟 𝑤A 𝑎𝑛𝑑 𝑤#
) )

c 𝑌5 = 𝑛 𝑤A + 𝑤# c 𝑋5 … … … … … (1)
5d# 5d#

) )
)
c 𝑋5 𝑌5 = 𝑤A c 𝑋5 + 𝑤# c 𝑋5& … … … … . (2)
5d#
5d# 5d#
Linear Regression–Least Square

Let divide the equation (1) by n (number of sample points) we get:

) )
1 1
c 𝑌5 = 𝑤A + 𝑤# c 𝑋5
𝑛 𝑛
5d# 5d#

OR
𝑦9 = 𝑤A + 𝑤#𝑥………….(3)
̅

So line of regression will always passes through the points (𝑥,̅ 𝑦)

9
Linear Regression–Least Square

Now we know :
# ) # )
𝑐𝑜𝑣 𝑥, 𝑦 = ∑ 𝑥𝑦 − 𝑥̅ 𝑦9 ≔ ∑ 𝑥𝑦 = 𝑐𝑜𝑣 𝑥, 𝑦 + 𝑥̅ 𝑦9 ………(4)
) 5d# 5 5 ) 5d# 5 5

and
# ) # )
𝑣𝑎𝑟 𝑥 = ∑5d# 𝑥5& − 𝑥̅ & and 𝑣𝑎𝑟 𝑦 = ∑5d# 𝑦5& − 𝑦9 &
) )

Dividing equation (2) by n and using equation (4) and (5) we get:

𝑐𝑜𝑣 𝑥, 𝑦 + 𝑥̅ 𝑦9 = 𝑤A𝑥̅ + 𝑤#(𝑣𝑎𝑟 𝑥 + 𝑥̅ &)…………………….(5)

Linear Regression–Least Square

Now by using equation

𝑦9 = 𝑤A + 𝑤#𝑥̅
and
𝑐𝑜𝑣 𝑥, 𝑦 + 𝑥̅ 𝑦9 = 𝑤A𝑥̅ + 𝑤#(𝑣𝑎𝑟 𝑥 + 𝑥̅ &)
We will get:

𝑐𝑜𝑣(𝑥, 𝑦)
𝑤# =
𝑣𝑎𝑟(𝑥)

and 𝑤A = 𝑦9 − 𝑤#𝑥̅
Performance metric for least square regression

1 )
∑5d#(𝑦5 − 𝑦ℎ𝑎𝑡5 )&
𝑅& = 1 − 𝑛
1 )
9 &
∑5d#(𝑦5 − 𝑦)
𝑛

& (1 − 𝑅&)(𝑛 − 1)
𝑅klm =1−
(𝑛 − 𝑘 − 1)
Linear Regression- Gradient Descent

Model: 𝑌 = 𝑤A + 𝑤# 𝑋

Task: Estimate 𝑡ℎ𝑒 𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑤A 𝑎𝑛𝑑 𝑤#

Define the cost function,

)
1
𝑐𝑜𝑠𝑡 𝑤A , 𝑤# = c(𝑦5 − 𝑦ℎ𝑎𝑡5 )&
𝑛
5d#
Objective of gradient Descent

)
1
𝐦𝐢𝐧 𝑐𝑜𝑠𝑡 𝑤A , 𝑤# = c(𝑦5 − (𝑤A + 𝑤# 𝑥5 ))&
𝒘𝟎 ,𝒘𝟏 𝑛
5d#
Linear Regression- Gradient Descent

Model: 𝑌 = 𝑤A + 𝑤#𝑋

Task: Estimate 𝑡ℎ𝑒 𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑤A 𝑎𝑛𝑑 𝑤#

Cost(w0, w1)
the objective,

)
1
𝐦𝐢𝐧 𝑐𝑜𝑠𝑡 𝑤A, 𝑤# = c(𝑦5 − (𝑤A + 𝑤#𝑥5 ))&
𝒘𝟎 ,𝒘𝟏 𝑛
5d#

w0
Linear Regression- Gradient Descent

v Gradient descent works if following steps:

1. Initialize the parameters to some random variable

2. Calculate the gradient of cost function w. r. t. to parameters

3. Update the parameters using gradient in opposite direction.

4. Repeat step-2 and step-3 for some number of times or till it reaches to minimum
cost value.
Linear Regression- Gradient Descent
)
1
𝑐𝑜𝑠𝑡 𝑤A, 𝑤# = c(𝑦5 − (𝑤A + 𝑤#𝑥5 ))&
𝑛
5d#
Calculating gradients of cost function:

𝜕𝑐𝑜𝑠𝑡(𝑤A , 𝑤# ) 2 )
𝑔𝑟𝑎𝑑𝑤A = = c (𝑦5 − (𝑤A + 𝑤# 𝑥5 ))(−1)
𝜕𝑤A 𝑛 5d#

𝜕𝑐𝑜𝑠𝑡(𝑤A , 𝑤# ) 2 )
𝑔𝑟𝑎𝑑𝑤# = = c (𝑦5 − (𝑤A + 𝑤# 𝑥5 ))(−𝑥)
𝜕𝑤# 𝑛 5d#
Parameter update:
𝑤A = 𝑤A − 𝑙𝑒𝑎𝑟𝑛𝑖𝑛𝑔𝑟𝑎𝑡𝑒 ∗ 𝑔𝑟𝑎𝑑𝑤A

𝑤# = 𝑤# − 𝑙𝑒𝑎𝑟𝑛𝑖𝑛𝑔𝑟𝑎𝑡𝑒 ∗ 𝑔𝑟𝑎𝑑𝑤#
Performance metric for gradient based regression

Root Mean Square Error (RMSE) is the standard deviation of prediction errors.

(𝑦5 − 𝑦ℎ𝑎𝑡5 )&

𝑅𝑀𝑆𝐸 =
𝑛
Mean absolute error (MAE) is a measure of difference between two variables.

𝑦5 − 𝑦ℎ𝑎𝑡5
𝑀𝐴𝐸 =
𝑛
Thank you !

Let see the hands on…

Progression Linaire
No ratings yet
Progression Linaire
187 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
Week 9 - PROG 8510 Week 9
No ratings yet
Week 9 - PROG 8510 Week 9
27 pages
ML 02 Regression 2
No ratings yet
ML 02 Regression 2
30 pages
Day.9 SML
No ratings yet
Day.9 SML
23 pages
Linear Regression Explained
No ratings yet
Linear Regression Explained
26 pages
Predictive Analytics
No ratings yet
Predictive Analytics
46 pages
Supervised Learning: Regression Techniques
No ratings yet
Supervised Learning: Regression Techniques
34 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
36 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
ML Introduction
No ratings yet
ML Introduction
76 pages
Unit-2 Supervised Machine Learning
No ratings yet
Unit-2 Supervised Machine Learning
132 pages
Introml 02 Regression Annotated PDF
No ratings yet
Introml 02 Regression Annotated PDF
26 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
10 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
Unit - 2, Updated Notes
No ratings yet
Unit - 2, Updated Notes
121 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
1 page
AI ML 3 Updated
No ratings yet
AI ML 3 Updated
34 pages
Wk05 Machine Learning
No ratings yet
Wk05 Machine Learning
6 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
Supervised Learning. wk3
No ratings yet
Supervised Learning. wk3
18 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
Unit 2
No ratings yet
Unit 2
136 pages
Understanding Machine Learning Types
No ratings yet
Understanding Machine Learning Types
49 pages
AI14 - MachineLearning
No ratings yet
AI14 - MachineLearning
49 pages
ML Unit3b
No ratings yet
ML Unit3b
175 pages
ML Exp 1
No ratings yet
ML Exp 1
6 pages
Lecture3 Supervised Learning I
No ratings yet
Lecture3 Supervised Learning I
84 pages
CL IV Manual
No ratings yet
CL IV Manual
108 pages
IDA117V Supervised ML
No ratings yet
IDA117V Supervised ML
39 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
S&ML Unit 5 - Q & A
No ratings yet
S&ML Unit 5 - Q & A
15 pages
Lecture 3 - Regression
No ratings yet
Lecture 3 - Regression
47 pages
S1 - 25 (NSP) - ML - CS 34 - 10th17th Aug 2025
No ratings yet
S1 - 25 (NSP) - ML - CS 34 - 10th17th Aug 2025
89 pages
Linear Regression Model Presentation
No ratings yet
Linear Regression Model Presentation
7 pages
ML101 C&a
No ratings yet
ML101 C&a
33 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
LinearRegression PDF
No ratings yet
LinearRegression PDF
4 pages
Linear Regression
No ratings yet
Linear Regression
89 pages
Ai ML 3
No ratings yet
Ai ML 3
27 pages
Linear Regression - 1st Draft
No ratings yet
Linear Regression - 1st Draft
5 pages
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
2a Linear Regression 18may
No ratings yet
2a Linear Regression 18may
28 pages
Linear Regression
No ratings yet
Linear Regression
14 pages
2-Linear Regression
No ratings yet
2-Linear Regression
31 pages
Solving One Variable Linear Equations
No ratings yet
Solving One Variable Linear Equations
10 pages
Intro to Supervised Learning in ML
No ratings yet
Intro to Supervised Learning in ML
35 pages
Foundation of Machine Learning F-PMLFML02-WS
No ratings yet
Foundation of Machine Learning F-PMLFML02-WS
352 pages
Unit 2
No ratings yet
Unit 2
18 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
18 pages
Linear Regression Guide for Students
No ratings yet
Linear Regression Guide for Students
35 pages
Unit 6
No ratings yet
Unit 6
107 pages
6 - Classification and Regression Tasks
No ratings yet
6 - Classification and Regression Tasks
115 pages
UNIT II Regration
No ratings yet
UNIT II Regration
62 pages
Storm Water Layout
No ratings yet
Storm Water Layout
1 page
Essential AI Tools for Journalists
No ratings yet
Essential AI Tools for Journalists
20 pages
HVAC Equipment Specifications Summary
No ratings yet
HVAC Equipment Specifications Summary
1 page
Inclinometer
No ratings yet
Inclinometer
45 pages
Hatlapa L35 PDF
50% (2)
Hatlapa L35 PDF
126 pages
Mechanical Engg Exam Guide
No ratings yet
Mechanical Engg Exam Guide
1 page
Paperback 8.500x11.000 64 BW White en Us
No ratings yet
Paperback 8.500x11.000 64 BW White en Us
1 page
A Detailed Presentation and Implementation Procedure of Axisymmetric Method of Characteristics For Rocket Nozzle Design
No ratings yet
A Detailed Presentation and Implementation Procedure of Axisymmetric Method of Characteristics For Rocket Nozzle Design
50 pages
Overview of the Wheatstone Bridge
50% (2)
Overview of the Wheatstone Bridge
9 pages
Gen. Nov., A New Leucosiid Genus (Crustacea, Brachyura) .: Ihleus
No ratings yet
Gen. Nov., A New Leucosiid Genus (Crustacea, Brachyura) .: Ihleus
6 pages
Research and Design I
100% (1)
Research and Design I
18 pages
5 Standard Costing
No ratings yet
5 Standard Costing
5 pages
GE8077 Total Quality Management-By WWW - LearnEngineering.in
No ratings yet
GE8077 Total Quality Management-By WWW - LearnEngineering.in
120 pages
Presentation On Building
No ratings yet
Presentation On Building
18 pages
Tybms Sem - Vi (Apr 2023)
No ratings yet
Tybms Sem - Vi (Apr 2023)
39 pages
Pneumatic Conveyor Systems Guide
100% (3)
Pneumatic Conveyor Systems Guide
36 pages
Techno Trade: Quantity Product Unit Price
No ratings yet
Techno Trade: Quantity Product Unit Price
3 pages
Fluke 92B 96B 99B 105B Service Manual
No ratings yet
Fluke 92B 96B 99B 105B Service Manual
322 pages
Plaidoirie en Francais-1.Fr - en
No ratings yet
Plaidoirie en Francais-1.Fr - en
67 pages
Resource Management in Distributed Systems
No ratings yet
Resource Management in Distributed Systems
46 pages
MODEL7211A Series: Desktop Installation 1 100M Ethernet Copper Port To 1 E1 Network Bridge
No ratings yet
MODEL7211A Series: Desktop Installation 1 100M Ethernet Copper Port To 1 E1 Network Bridge
5 pages
Best ADX Strategy Built by Professional Traders PDF
100% (1)
Best ADX Strategy Built by Professional Traders PDF
13 pages
Method Statement For Pipe Culvert by Anil Kumar
0% (1)
Method Statement For Pipe Culvert by Anil Kumar
2 pages
Smart Garbage System
No ratings yet
Smart Garbage System
4 pages
Fourth Quarter Lesson Plan May 7, 2024 I. Objectives Content Standards
No ratings yet
Fourth Quarter Lesson Plan May 7, 2024 I. Objectives Content Standards
6 pages
Banana Island Game Setup Guide
No ratings yet
Banana Island Game Setup Guide
10 pages
Beam Design Principles and Analysis
No ratings yet
Beam Design Principles and Analysis
49 pages
Hassan Khan
No ratings yet
Hassan Khan
3 pages
SELE Brochure Ascensori ENG 271017
No ratings yet
SELE Brochure Ascensori ENG 271017
24 pages
OPC Security WP2
100% (2)
OPC Security WP2
39 pages

Linear Regression

Uploaded by

Linear Regression

Uploaded by

Linear Regression

v What is machine learning

vTypes of data and terminology

v Hands on implementing Linear Regression from sketch.

v Machine Learning is the science to make computers learn from data

v This learning comes by feeding them data in the form of

v Machine Learning can also be defined as a tool to predict future

v Categorical data (ex. Gender- Male/Female)

v Labelled data – consists of input output pair. For every set

Supervised Unsupervised Reinforcement

Regression Classification Clustering PCA

v Class of machine learning that work on externally supplied instances in form of

v Relationship measure can be quantified by calculating correlation between two

v Linear regression is a way to identify a relationship between two or more

Dependent/Regressed variable 𝒚 = 𝒘𝟎 + 𝒘𝟏 𝑿 Independent/Regressor variable

𝑦 = 𝑤A + 𝑤#𝑥# + 𝑤&𝑥& + 𝑤'𝑥'. … … … . wCxC

Let you have given with a data:

Blood Pressure (Y)

v For given example the Linear Regression is modeled as:

with 𝑤A 𝑖𝑠 𝑖𝑛𝑡𝑒𝑟𝑐𝑒𝑝𝑡 𝑜𝑛 𝑌_𝑎𝑥𝑖𝑠 𝑎𝑛𝑑 𝑤# 𝑖𝑠 𝑠𝑙𝑜𝑝𝑒 𝑜𝑓 𝑙𝑖𝑛𝑒

Blood Pressure - Dependent Variable

Age in Year - Independent Variable

v Regression uses line to show the trend of distribution.

v The aim is to find Best fit Line

Let divide the equation (1) by n (number of sample points) we get:

So line of regression will always passes through the points (𝑥,̅ 𝑦)

𝑐𝑜𝑣 𝑥, 𝑦 + 𝑥̅ 𝑦9 = 𝑤A𝑥̅ + 𝑤#(𝑣𝑎𝑟 𝑥 + 𝑥̅ &)…………………….(5)

Now by using equation

Task: Estimate 𝑡ℎ𝑒 𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑤A 𝑎𝑛𝑑 𝑤#

Task: Estimate 𝑡ℎ𝑒 𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑤A 𝑎𝑛𝑑 𝑤#

v Gradient descent works if following steps:

2. Calculate the gradient of cost function w. r. t. to parameters

3. Update the parameters using gradient in opposite direction.

(𝑦5 − 𝑦ℎ𝑎𝑡5 )&

Let see the hands on…

You might also like