0% found this document useful (0 votes)

41 views51 pages

Simple Linear Regression Analysis..

The document provides an overview of simple linear regression analysis, detailing its historical context, objectives, and the mathematical framework involved. It explains the relationship between dependent and independent variables, the assumptions of classical linear regression, and the importance of data quality in regression analysis. Additionally, it covers key concepts such as the ordinary least squares (OLS) method, homoskedasticity, and model specification errors.

Uploaded by

rajeshkumarjena2350

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views51 pages

Simple Linear Regression Analysis..

Uploaded by

rajeshkumarjena2350

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

SIMPLE LINEAR REGRESSION

ANALYSIS

Mr. Biswajit Sahoo

Guest Faculty, P.G. Department of Commerce
Utkal University, Vani Vihar
Introduction
The term “Regression” was introduced by Francis Galton and Galton‘s Law of Universal
regression was confirmed by his Friend, Karl Pearson. The modern interpretation of
regression is quiet different from their analysis.

Regression analysis is concerned with the study of the dependence of one variable
(dependent variable), on one or more other variables, the explanatory variables
(Independent Variable), with a view to estimating and/or predicting the mean or average
value of the former in terms of the n own or fixed values of the later.
Modern Interpretation Of Regression
• The modern interpretation of regression is, however, quite different.
Broadly speaking, we may say Regression analysis is concerned with the
study of the dependence of one variable, the dependent variable, on one
or more other variables, the explanatory variables, with a view to
estimating and/or predicting the (population) mean or average value of the
former in terms of the known or fixed (in repeated sampling) values of the
latter.
Objective
1. The key objective behind regression analysis is the statistical dependence of one
variable, the dependent variable, on one or more other variables, the explanatory
variables.
2. The objective of such analysis is to estimate and/or predict the mean or average
value of the dependent variable on the basis of the known or fixed values of the
explanatory variables.
3. In practice the success of regression analysis depends on the availability of the
appropriate data.
4. In any research, the researcher should clearly state the sources of the data used
in the analysis, their definitions, their methods of collection, and any gaps or
omissions in the data as well as any revisions in the data.
5. The data used by the researcher are properly gathered and that the
computations and analysis are correct.
Simple Regression Model
• The term “simple” regression means, it is a regression in which the dependent
variable is related to a single explanatory variable.
• The term “model” is broadly used to represent any phenomenon in a
mathematical framework.
Two variable or bivariate
• Means regression in which the dependent variable is related to a single
explanatory variable (the regression).
• When mean values depend upon conditioning (variable X) is called conditional
expected value. Regression analysis is largely concerned with estimating and/or
predicting the (population) mean value of the dependent variable on the basis of
the known or fixed values of the explanatory variable (s).
Regression analysis, we are going to estimate the mean value, or average value or
expected value of the dependant variable “Y” based on the known values of the
independent variables “X”s. That is we are estimating E(Y/Xi)
ie, conditional expectation of Y given Xi.

Suppose the outcome of any process is denoted by a random variable Y , called as

dependent (or study) variable, depends on k independent (or explanatory) variables
denoted by X1, X2, X3......Xk Suppose the behaviour of Y can be explained by a relationship
given by,
Y = f (X1 , X2 ,..., Xk , β1, β2, ...... βk)+u

where f is some well-defined function and β1, β2, ...... βk are the parameters which
characterize the role and contribution of X1 , X2 ,... Xk respectively. The term u reflects the
stochastic nature of the relationship between Y and X1 , X2 ,..., Xk and indicates that such a
relationship is not exact in nature. When u=0, then the relationship is called the
mathematical model otherwise the statistical model.
The linear model
Consider a simple linear regression model
y = b0 + b1 X + e
where y is termed as the dependent or study variable and X is termed as the independent
or explanatory variable.
The terms b0 and b1 are the parameters of the model.
The parameter b0 is termed as an intercept term, and the parameter b1 is termed as the
slope parameter. These parameters are usually called a regression coefficients.
The unobservable error component e accounts for the failure of data to lie on a straight
line and represents the difference between the true and observed realization of y .
There can be several reasons for such difference, e.g., the effect of all deleted variables in
the model, variables may be qualitative, inherent randomness in the observations etc. We
assume that e is observed as an independent and identically distributed random variable
with mean zero and constant variance additionally assume that e is normally distributed.
Simple Regression

• The simple regression model

can be used to study the
relationship between two
variables.

• Suppose the scatter plot

between x and y looks like
Fig 1.
8

Fig. 1
Simple Regression
Simple Regression
Simple Regression
Simple Regression
•
OLS: Graphically
To obtain the best fitted
line, the method of OLS
entails taking each
vertical distance from
the point to
squaring the
minimizing line, it
sum of the and areas
then the
of 13

squares; see Fig. 2

total Fig. 2
OLS: Graphically

Fig. 3
OLS: Mathematically
OLS: Mathematically
Estimated Parameters
Use of Parameter Estimates
Example
Example
Change of Scale
Example
• In the table, estimates from two regressions of annual gross
bank credit to industry (y) on annual call money rate (x) for
the period 1990 – 2018 are reported with two different scales
of measurement of y.

y (in ₹ billion) y (in ₹ million)

Intercept 1708.635 17086345
22

Call money rate -943.589 -943589

Linearity
• In order to use OLS, a model has to be linear.
• This means that the relationship between x and y must be capable
of being expressed diagrammatically using a straight line. More
specifically, the model must be linear in the parameters (α and β).
• By ‘linear in the parameters’, it is meant that the parameters are
not multiplied together, divided, squared, or cubed, etc.
23
Linearity
Example
Linearity
Goodness-of-Fit
Goodness-of-Fit
Goodness-of-Fit
Goodness-of-Fit

30
Regression through the Origin
Estimators
• So far we have assumed that we have exact information about
the random variable under discussion, in particular that we
know the probability distribution, in the case of a
discrete random variable, or the probability density
function, in the case of a continuous variable.
• With this information it is possible to work out the population
mean and variance and any other population characteristics in
which we might be interested.
Estimators
• In practice, except for artificially simple random variables such as
the numbers on thrown dice, we do not know the exact probability
distribution or density function.
• It follows that we do not know the population mean or variance.
• However, we would like to obtain an estimate of them or some
other population characteristic.
• This is done by taking a sample of n observations and derive an
estimate of the population characteristic using some
appropriate formula. This formula is technically known as an
estimator and the number obtained is known as the estimate.
Estimators
• The estimator is a general rule or formula, whereas the estimate
is a specific number that will vary from sample to sample.
• Estimator of the population mean μ is

• Estimator of the population variance is

Assumptions of Classical Linear
Regression

3
Classical Linear Regression Model (CLRM)
• In the specification, y t = α + tβx +t u , data fort x is observation,
but yt also depends on ut, it is necessary to be specific about
since
how the uts are generated.

• Certain assumptions are usually made concerning the uts,

the unobservable error or disturbance terms.

• Note that no assumptions are made concerning their observable

counterparts, the estimated model’s residuals.
Assumptions of OLS
1. In the population, the regression model is linear in the
parameters, i.e. y t = α + βxt + ut
2. We have a random sample of size T, {(xt, yt): t = 1, 2, …, T}.
3. The sample outcomes on x, namely, {xt, t = 1, …, T}, are not all
the same values.
4. The error u has an expected value of zero given any value of the
explanatory variable. In other words, E(u|x) = 0.
5. The error u has the same variance given any value of
37

the explanatory variable, i.e. Var (u|x) = σ2

Classical Linear Regression Model (CLRM)
• Alternatively, the model y t = α + tβx +t u is known as CLRM
given
that it fulfils the following assumptions.
1. E(ut ) = 0 the errors have mean zero
2. Var (ut ) = σ2 < the error variance is constant and finite
∞ the errors are linearly independent
i j
3. Cov (ut, u ) = 0 the regressor is unrelated to the
4. Cov (ut, x ) = 0 error
38
term ut is normally distributed
5. u ~ N(0, σ2)
t
ASSUMPTIONS OF CLASSICAL REGRESSION
MODEL
• Assumption-1
The regression model is linear in parameters. It may or may not
be linear in variables. For example, the equation given below is
linear in parameters as well as variables.
parameters, i.e. yt = α + βxt +
u
• ASSUMPTION-2
The explanatory variable is not correlated with the disturbance
term u.
This assumption requires that Σ u,t x t = 0 .
In other words, the covariance between error term and
explanatory
t variable is zero.
This assumption is automatically fulfilled if X is non-stochastic.
It requires that the X values are kept fixed in repeated samples.
• ASSUMPTION-3
The expected value or mean value of the error term u is zero.
In symbols, 𝐸(𝑢 |𝑋 ) = 0. It does not mean that all error terms are
t t
zero.
It implies that the error terms cancel out each other.
• Given E(u|x) = 0, the population regression function (PRF)
is given by E(y|x) = α + βx.
Population Regression Function (PRF)
• This tells us how
the average orexpected
value of y changes with x
or a one-unit
increase in x, changes
the expected value of y by
the amount β.
• Also, this implies that for 42

any given value of x, the

distribution of y is centered
about E(y|x).
Sample Regression Function (SRF)
• ASSUMPTION-4
The variance of each 𝑢 is constant. In symbols, Var (u ) = σ2 . The conditional
distribution of the error term has been displayed in 1.1
• The corresponding error variance for a specific value of the error term has
been depicted in 1.2
• From the figure you can make out that the error variance is constant at all
levels of the X variable. It describes the case of ‘homoscedasticity’.
Homoskedasticity
• Homoskedasticity or constant variance assumption states that the error u
has the same variance given any value of the explanatory variable. In other
words, Var(u|x) = σ2.
• The importance of this assumption will be reflected in deriving the
properties of OLS estimators and their variances.
• Since, Var(u|x) = E(u2|x) – [E(u|x)]2 and E(u|x) = 0, E(u2|x) = σ2, σ2 is
also the unconditional variance of u, i.e. Var (u) = σ2.
46

σ2 is often called the error variance and σ is the

•
standard
deviation of the error term.
Homoskedasticity
• Given the assumption of homoskedasticity,
Var (y|x) = Var(u|x) = σ2
because for given values of x, α, β, and x, all are constant.

• When Var(u|x) depends on x, the error term is said to exhibit

heteroskedasticity (or nonconstant variance).
47
• Because Var(u|x) = Var(y|x), heteroskedasticity is
present whenever Var(y|x) is a function of x.
• ASSUMPTION-5
(v) There is no correlation between the two error terms. This is the

𝑐𝑜𝑣(𝑢i, 𝑢j) = 0 𝑖 ≠ 𝑗
assumption of no autocorrelation.

This assumption implies that the error terms 𝑢i are random.

It implies that there is no systematic relationship between two error terms.

Since two error terms are assumed to be uncorrelated, any two Y values will also
be uncorrelated, i.e., 𝑐𝑜𝑣(𝑌i, 𝑌j) = 0

(i) No Autocorrelation (ii) Positive Autocorrelation (iii) Negative Autocorrelation

• ASSUMPTION-6
The regression model is correctly specified, that is, there is no
specification error in the model.
If certain relevant variable is not included or certain irrelevant variable is
included in the regression model then we commit model specification
error.
For instance, suppose we study the demand for automobiles. If we take
the price of automobiles only and do not include the income of the
consumer income then there is some specification error.
The Simple
Regression Model
under
Homoskedasticity

E(y|x) = α + βx
Example of
Heteroskedastic Variance

51
E(y|x) = α + βx

Econometrics Unit 3 Tedy Best
No ratings yet
Econometrics Unit 3 Tedy Best
147 pages
Simple Regression Analysis Guide
No ratings yet
Simple Regression Analysis Guide
28 pages
Chapter 2. Simple Linear Regression Module May13
No ratings yet
Chapter 2. Simple Linear Regression Module May13
20 pages
Simple Regression Model: Erbil Technology Institute
No ratings yet
Simple Regression Model: Erbil Technology Institute
9 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
17 pages
Basic Regression Analysis in Econometrics
No ratings yet
Basic Regression Analysis in Econometrics
5 pages
Week 5 Notes
No ratings yet
Week 5 Notes
175 pages
Understanding Simple Regression Models
No ratings yet
Understanding Simple Regression Models
32 pages
Econometrics For MGT ppt-2
No ratings yet
Econometrics For MGT ppt-2
58 pages
Econometrics II: Revision Class: Introduction To Econometrics
No ratings yet
Econometrics II: Revision Class: Introduction To Econometrics
55 pages
Econ3061 Chapter 2
No ratings yet
Econ3061 Chapter 2
67 pages
M2L2 CLRM & Simple Linear Regression Analysis
No ratings yet
M2L2 CLRM & Simple Linear Regression Analysis
13 pages
Introduction to Simple Linear Regression
No ratings yet
Introduction to Simple Linear Regression
47 pages
DS 3 2
No ratings yet
DS 3 2
17 pages
Chapter 2 Simple Linear Regression
No ratings yet
Chapter 2 Simple Linear Regression
31 pages
CH 3
No ratings yet
CH 3
123 pages
Simple Linear Regression1
No ratings yet
Simple Linear Regression1
36 pages
1 - Linear Models
No ratings yet
1 - Linear Models
22 pages
Finance Students' Guide to Regression
No ratings yet
Finance Students' Guide to Regression
41 pages
Regression Analysis in Data Science
No ratings yet
Regression Analysis in Data Science
267 pages
Chapter Two: Simple Linear Regression Models: Assumptions and Estimation
100% (3)
Chapter Two: Simple Linear Regression Models: Assumptions and Estimation
34 pages
Econometrics Theory Note
No ratings yet
Econometrics Theory Note
13 pages
Chapter Two: Bivariate Regression Mode
100% (1)
Chapter Two: Bivariate Regression Mode
54 pages
1486016038da Mod12 Q1 e Text
No ratings yet
1486016038da Mod12 Q1 e Text
11 pages
Classical Linear Regression Model (CLRM)
100% (1)
Classical Linear Regression Model (CLRM)
68 pages
Simple Regression Analysis Overview
No ratings yet
Simple Regression Analysis Overview
12 pages
OLS Regression Explained by Dr. Mitiku
No ratings yet
OLS Regression Explained by Dr. Mitiku
80 pages
1 - Stat-701 Regression
No ratings yet
1 - Stat-701 Regression
18 pages
Linear Regression Chap01
100% (1)
Linear Regression Chap01
7 pages
Chapter 13
No ratings yet
Chapter 13
33 pages
Regression Analysis: Causal Relationship Between The Explanatory and
No ratings yet
Regression Analysis: Causal Relationship Between The Explanatory and
17 pages
Coefficient of Determination in Regression
No ratings yet
Coefficient of Determination in Regression
47 pages
Linear Regression Models Guide
No ratings yet
Linear Regression Models Guide
42 pages
Econometrics I Handout
No ratings yet
Econometrics I Handout
41 pages
Bus 173 - Lecture 5
No ratings yet
Bus 173 - Lecture 5
38 pages
Econometrics: Linear Regression Analysis
No ratings yet
Econometrics: Linear Regression Analysis
20 pages
Understanding Simple Regression Analysis
100% (1)
Understanding Simple Regression Analysis
8 pages
Regression Analysis in STAT 445
No ratings yet
Regression Analysis in STAT 445
49 pages
Theme 2 Ordinary Least Squares Regression
No ratings yet
Theme 2 Ordinary Least Squares Regression
10 pages
Econometrics Chapter Two
No ratings yet
Econometrics Chapter Two
36 pages
Research Methods & Designs: Regression Analysis
No ratings yet
Research Methods & Designs: Regression Analysis
18 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
46 pages
Ria Stats Regression Analysiss
No ratings yet
Ria Stats Regression Analysiss
2 pages
3-Econometrics-Linear Regression
No ratings yet
3-Econometrics-Linear Regression
13 pages
Chapter 2
No ratings yet
Chapter 2
41 pages
9 Regression Analysis
No ratings yet
9 Regression Analysis
38 pages
Two-Variable Regression Model Basics
No ratings yet
Two-Variable Regression Model Basics
17 pages
Inferential Statistics & Regression Analysis
No ratings yet
Inferential Statistics & Regression Analysis
68 pages
Linear Regression - Module 3
No ratings yet
Linear Regression - Module 3
16 pages
Outline - Simple Regression
No ratings yet
Outline - Simple Regression
51 pages
Econometrics Chapter - Two
No ratings yet
Econometrics Chapter - Two
71 pages
Chapter 2 - Simple Linear Regression Function
100% (1)
Chapter 2 - Simple Linear Regression Function
49 pages
STAT 445-Lecture 1 - 2021
No ratings yet
STAT 445-Lecture 1 - 2021
42 pages
Linear Regression
No ratings yet
Linear Regression
10 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
17 pages
Regression Analysis Fundamentals
No ratings yet
Regression Analysis Fundamentals
4 pages
4 STAT-602 Regression & Correlation (Mid&Final)
No ratings yet
4 STAT-602 Regression & Correlation (Mid&Final)
22 pages
Presentation 1
No ratings yet
Presentation 1
5 pages
HRD Unit 2
No ratings yet
HRD Unit 2
37 pages
Convertibility, Reserve, Sterilization
No ratings yet
Convertibility, Reserve, Sterilization
6 pages
BOP and Corrective Measures
No ratings yet
BOP and Corrective Measures
9 pages
FA&A Unit 2
No ratings yet
FA&A Unit 2
59 pages
Code For Independent Directors of The Bank
No ratings yet
Code For Independent Directors of The Bank
4 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
43 pages
Econometric
No ratings yet
Econometric
8 pages
Autoregressive 1
No ratings yet
Autoregressive 1
13 pages
Basic Econometrics-Selective Questions
No ratings yet
Basic Econometrics-Selective Questions
2 pages
Casebook On Environmental Law PDF
No ratings yet
Casebook On Environmental Law PDF
386 pages
Targeting by Radar
No ratings yet
Targeting by Radar
39 pages
Attachment II List of Importers 赤藓糖醇进口商 e=美国
100% (1)
Attachment II List of Importers 赤藓糖醇进口商 e=美国
6 pages
Helmut v. Fuchs (Auth.) - Applied Acoustics Conc
100% (3)
Helmut v. Fuchs (Auth.) - Applied Acoustics Conc
606 pages
Junior Doctors Forum Meeting Update
No ratings yet
Junior Doctors Forum Meeting Update
37 pages
TOR Psychologist (3 Oct 2025)
No ratings yet
TOR Psychologist (3 Oct 2025)
3 pages
1213sem2 Me5612
100% (1)
1213sem2 Me5612
5 pages
Introduction To Windows and Introduction To Ms-Word
No ratings yet
Introduction To Windows and Introduction To Ms-Word
31 pages
Wfa113885 (Xci55ax44mt0)
No ratings yet
Wfa113885 (Xci55ax44mt0)
3 pages
(3.4) 3608.4468.22 - GA5490 - 6167.7335.02 - PTT - Footswitch - For - SCWP - Specification - v0100
No ratings yet
(3.4) 3608.4468.22 - GA5490 - 6167.7335.02 - PTT - Footswitch - For - SCWP - Specification - v0100
10 pages
HISTORYOFLEAR
No ratings yet
HISTORYOFLEAR
2 pages
Passport Service Fee Receipt
No ratings yet
Passport Service Fee Receipt
3 pages
Question Paper Code
No ratings yet
Question Paper Code
2 pages
Annual Report 2022 PDF
No ratings yet
Annual Report 2022 PDF
125 pages
Geographical Indication
100% (1)
Geographical Indication
28 pages
CS-T180 User Manaul - 1038111 - 2018-06
No ratings yet
CS-T180 User Manaul - 1038111 - 2018-06
322 pages
Door and Window Schedule Details
No ratings yet
Door and Window Schedule Details
6 pages
Kaizen Boosts Bangladesh Apparel Productivity
No ratings yet
Kaizen Boosts Bangladesh Apparel Productivity
13 pages
Cas 9 Cost Accounting Standard On Packing Material Cost: Adapted From CAS 1 para 6.5.19
No ratings yet
Cas 9 Cost Accounting Standard On Packing Material Cost: Adapted From CAS 1 para 6.5.19
7 pages
Decoding The Mind of A Successful Entrepreneur - Part 1
No ratings yet
Decoding The Mind of A Successful Entrepreneur - Part 1
4 pages
DOJ Motion For No Jail AJ Discala Jan 7 2022
No ratings yet
DOJ Motion For No Jail AJ Discala Jan 7 2022
25 pages
Civpro Digest-Summons
No ratings yet
Civpro Digest-Summons
6 pages
Shri Ram City Union Finance Analysis
No ratings yet
Shri Ram City Union Finance Analysis
15 pages
Some Solutions Second Part
No ratings yet
Some Solutions Second Part
8 pages
Rotary Vane Compressor Guide
No ratings yet
Rotary Vane Compressor Guide
22 pages
4562 140876 8110833275 PDF
No ratings yet
4562 140876 8110833275 PDF
3 pages
342 Partners Available For Ticketing in The United Arab Emirates
No ratings yet
342 Partners Available For Ticketing in The United Arab Emirates
2 pages
ASEAN Project Grade12 12pages
No ratings yet
ASEAN Project Grade12 12pages
2 pages
Identifying Occupational Carcinogens An Update From
No ratings yet
Identifying Occupational Carcinogens An Update From
11 pages
Performance Marketing Courses Free
No ratings yet
Performance Marketing Courses Free
16 pages

Simple Linear Regression Analysis..

Uploaded by

Simple Linear Regression Analysis..

Uploaded by

SIMPLE LINEAR REGRESSION

Mr. Biswajit Sahoo

Suppose the outcome of any process is denoted by a random variable Y , called as

• The simple regression model

• Suppose the scatter plot

squares; see Fig. 2

y (in ₹ billion) y (in ₹ million)

Call money rate -943.589 -943589

• Estimator of the population variance is

• Certain assumptions are usually made concerning the uts,

• Note that no assumptions are made concerning their observable

the explanatory variable, i.e. Var (u|x) = σ2

any given value of x, the

σ2 is often called the error variance and σ is the

• When Var(u|x) depends on x, the error term is said to exhibit

This assumption implies that the error terms 𝑢i are random.

(i) No Autocorrelation (ii) Positive Autocorrelation (iii) Negative Autocorrelation

You might also like