0% found this document useful (0 votes)

10 views31 pages

Lecture 3 - Conditional Expectation Function

The document discusses Simple Linear Regression and the Conditional Expectations Function (CEF) as a method to explain the relationship between two variables, Y and X. It emphasizes the importance of minimizing the mean squared error to find the best-fitting function that explains variation in Y based on changes in X. The document also outlines the Population Regression Function (PRF) and its role in estimating the CEF, highlighting that the true relationship between Y and X is linear in parameters.

Uploaded by

aatmikajain448

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views31 pages

Lecture 3 - Conditional Expectation Function

Uploaded by

aatmikajain448

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Simple Linear Regression

Abhishek Dureja

Teaching Assistant: Anshika Arora

Plaksha University
Linear Regression
Conditional Expectations Function

▶ As Econometricians, we try to summarize and explain various

economic relationships and outcomes
▶ In particular, given 2 variables Y and X from population, we are
interested in:

i Explaining Y in terms of X

ii Studying how Y varies with changes in X

▶ We are eventually trying to use variation in x to explain variation

in y

2 / 31
Linear Regression
Conditional Expectations Function

▶ Example I: Y is hourly wage and X is years of education

=⇒ We are trying to explain how wage rate changes with years

of education

▶ Example II: Y is consumption and X is income

=⇒ We are trying to explain how consumption varies income

3 / 31
Linear Regression
Conditional Expectations Function

▶ We want to explain variation in y using variation in x

▶ The important question is: How can we do this?

▶ The conditional expectation function (CEF) provides a very

good means to summarize relationships between 2 variables

▶ Let us understand the idea and use of a conditional expectations

function in estimating the relationship between Y and X using
example II

4 / 31
Regression Analysis
Conditional Expectation Function

▶ Assume a population (not sample) of 60 families

▶ Information on weekly consumption and income recorded

▶ We want to understand how consumption (Y) varies with income

(X)

▶ To understand how consumption (Y) varies with income (X) we

will make use of the conditional expectations function

5 / 31
Regression Analysis
Conditional Expectation Function

6 / 31
Regression Analysis
Conditional Expectation Function

▶ The following gives you the scatter plot of the population data

Scatterplot of Weekly Income vs. Consumption Expenditure

180
Weekly Consumption Expenditure ($)

160

140

120

100

75 100 125 150 175 200 225 250

Weekly Income ($)

7 / 31
Linear Regression
Conditional Expectations Function

▶ The important question is how do we explain Y in terms of X

▶ We know one thing for sure:

→ We need to explain Yi in terms of Xi such that the most of

the variation in Yi is explained by variation of Xi
(i is indexing individual i)
=⇒ This is like a fitting problem
▶ Which function of Xi best fits the scatterplot so as to explain
variation in Yi
▶ Let m(Xi ) be any (arbitrary) function of Xi

(More generally, we can also write m(X) as well)

8 / 31
Linear Regression
Conditional Expectations Function

▶ We need to find m(Xi ) such that we are able to explain most

variation in Yi
▶ Mathematically, this is the problem of minimising the mean squared
error
arg min E (Yi − m(Xi ))2

m(Xi )

▶ For any given Xi , m(Xi ) will give us the predicted value of Yi ≡ Ŷ

▶ But actual value is Yi

=⇒ Yi − m(Xi ) is the error

=⇒ E [(Yi − m(Xi ))]2 is the mean-squared error (MSE)

▶ Minimising MSE is our decision criterion

9 / 31
Linear Regression
Conditional Expectations Function

arg min E (Yi − m(Xi ))2

m(Xi )

▶ Hence, we need to find a functional form of m(Xi ) that minimises

the mean squared error

▶ The functional form, i.e. m(Xi ), that minimises the mean

squared error is nothing but the conditional expectations
function (CEF)

▶ Mathematically,

E [Yi | Xi ] = arg min E (Yi − m(Xi ))2 ,

m(Xi )

10 / 31
Linear Regression
Conditional Expectations Function

CEF-Prediction Property
▶ Let m(Xi ) be any function of Xi

▶ The Conditional Expectation Function (CEF) solves

E [Yi | Xi ] = arg min E (Yi − m(Xi ))2 ,

m(Xi )

=⇒ CEF is the minimum mean square error (MMSE)

predictor of Yi given Xi

▶ E (Yi − m(Xi ))2 is minimized when m(Xi ) = E (Yi |Xi )

▶ We are not doing the proof of this

11 / 31
Linear Regression
Conditional Expectations Function

▶ Now we theoretically know that CEF has a very strong predictive

power

▶ CEF can help us to summarise relationships between 2

variables: Y and X

▶ Let us now return to example II and see how well CEF does in
summarising the consumption-income relationship

12 / 31
Regression Analysis
Conditional Expectation Function

▶ Since conditional expectation is the mean value of Y (dependent

variable) conditional on/ given X (independent variable)

=⇒ Let us divide the population into 10 income groups

=⇒ We are dividing the population based on the value of

independent variables

▶ Let us now compute the mean consumption conditional on each

value of income

13 / 31
Regression Analysis
Conditional Expectation Function

▶ The last row provides the conditional mean of Y

14 / 31
Regression Analysis
Conditional Expectation Function

15 / 31
Linear Regression
Conditional Expectations Function

CEF for years of schooling and wage rate

▶ The following figure plots the CEF for wage and years of education

16 / 31
Linear Regression and Causality
Economic Relationships and the Conditional Expectation Function

▶ The figure plots the CEF of log weekly wages given schooling for
a sample of middle-aged men
(implying those who have completed education)

▶ The distribution of earnings is also plotted for a few key values: 4,

8, 12, and 16 years of schooling

▶ The CEF in the figure reflects an important fact

▶ Despite enormous variation in individual circumstances, people

with more schooling generally earn more, on average

=⇒ CEF is able to summarize the relationship between earnings and

years of schooling

17 / 31
Linear Regression and Causality
Economic Relationships and the Conditional Expectation Function

▶ The properties of CEF are central to the linear regression

The CEF-Decomposition Property

▶ The CEF-Decomposition Property states that Yi can be decomposed

as
Yi = E [Yi |Xi ] + ϵi

where ϵi is mean independent of Xi i.e. E [ϵi |Xi ] = 0

=⇒ ϵi is uncorrelated with any function of Xi

18 / 31
Regression Analysis
Economic Relationships and the Conditional Expectation Function

▶ Given Xi , E [Y |Xi ] is fixed; but Yi varies

▶ Yi can be written as:
Yi = E [Y |Xi ] + ϵi

▶ E [Y |Xi ] is the mean value of Y given Xi

▶ ϵi is stochastic disturbance or stochastic error term
▶ ϵi represents the effect of all excluded/omitted explanatory variables
that affect Y
▶ For many reasons, it is not possible to include these variables:
i Unavailability of data
ii Peripheral variables: Joint influence of these variables may be very
small
iii Intrinsic randomness in human behavior

19 / 31
Linear Regression
Population Regression Function

▶ To estimate the relationship between Y and X

=⇒ We can use the CEF function

▶ The next important question is: How can we estimate the CEF?

▶ We cannot by hand join all the conditional means

▶ We need a way to estimate the CEF function

20 / 31
Linear Regression
Population Regression Function

▶ The important question to ask is:

→ How to estimate the CEF?

▶ If the joint distribution of (Y,X) is bivariate normal

=⇒ CEF is linear in X (independent variable)

=⇒ Conditional expectation of Y can be written as a linear

function of X

=⇒ E [Y |X ] = β0 + β1 X

21 / 31
Linear Regression
Population Regression Function

=⇒ E [Y |X ] = β0 + β1 X

▶ The assumption that the joint distribution of (Y,X) is normal may

be strong

▶ The above relationship will still hold if the conditional distribution of

Y |X is normal

▶ How does the normality of Y |X look like?

22 / 31
Linear Regression
Normality of Y |Xi

23 / 31
Linear Regression
Population Regression Function

▶ If the conditional distribution of Y |Xi follows a Normal distribution,

then CEF can be written as

E [Y |Xi ] = β0 + β1 Xi

▶ Using the CEF decomposition property:

Yi = E [Y |Xi ] + ui

=⇒ Yi = β0 + β1 Xi + ui

▶ More generally:

=⇒ Y = β0 + β1 X + u
24 / 31
Linear Regression
Conditional Expectations Function

=⇒ Y = β0 + β1 X + u

▶ The above relationship is known as the population regression

function (PRF)

▶ The true relationship between Y and X in the population is

known as the population regression function (PRF)

▶ PRF represents the true functional form relationship between Y and

25 / 31
Linear Regression
Conditional Expectations Function

▶ The PRF is given as

=⇒ Y = β1 + β2 X + u

▶ PRF represents the true relationship between Y and X in the

population

▶ We have assumed that the PRF (true relationship) between Y and

X is linear in parameters (β0 and β1 ) and variable X

26 / 31
Linear Regression
Conditional Expectations Function

=⇒ Y = β1 + β2 X + u

▶ Even if the Y |X is not normally distributed, we can still estimate

CEF using the above relationship

▶ The reason is that (linear) PRF can then be seen as a (best)

linear estimation of CEF

27 / 31
Linear Regression
Conditional Expectations Function

PRF estimation of CEF

28 / 31
Linear Regression
Conditional Expectations Function

▶ The dark line represents the CEF, which captures the relationship
between weekly earnings and years of education

▶ The dotted line represents the population regression line

▶ The regression line fits the somewhat bumpy and nonlinear

CEF

▶ Even though the regression line is a model for Yi

=⇒ Regression line fits the nonlinear CEF as if we were estimating a

model for the CEF i.e. E [Yi |Xi ]

=⇒ We will use linear regression to model relationship

between Y and X

=⇒ Linear regression is used to model the conditional

expectations function
29 / 31
Linear Regression
Population Regression Function

▶ Now, given our discussion, we know two things:

i CEF, i.e. E [Y |X ], is the best predictor of Y as it minimises the

MMSE

ii True relationship between Y and X is given by the PRF:

Y = β0 + β1 X + u

30 / 31
Linear Regression
Conditional Expectations Function

▶ We have assumed that the true relationship between Y and X in the

population is given as follows

Y = β0 + β1 X + u

where
i Y is the dependent variable

ii X is the independent variable

iii u is called the error term or disturbance in the relationship

▶ u represents factors other than X that affect Y

31 / 31

CEF and OLS in Econometrics
No ratings yet
CEF and OLS in Econometrics
41 pages
Lecture 1
No ratings yet
Lecture 1
33 pages
Conditional Expectation in Linear Regression
No ratings yet
Conditional Expectation in Linear Regression
14 pages
Chapter 6
No ratings yet
Chapter 6
15 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
Applied Econometrics: OLS & Regression
No ratings yet
Applied Econometrics: OLS & Regression
227 pages
Two-Variable Regression Analysis
100% (1)
Two-Variable Regression Analysis
27 pages
Two-Variable Regression Analysis, Some Basic Ideas
No ratings yet
Two-Variable Regression Analysis, Some Basic Ideas
28 pages
Chapter4 Intro To Regression
No ratings yet
Chapter4 Intro To Regression
129 pages
FECO Ex3
No ratings yet
FECO Ex3
4 pages
Lesson 3 - Regression
No ratings yet
Lesson 3 - Regression
14 pages
A1 Regression
No ratings yet
A1 Regression
31 pages
01 Po Dags Identification
No ratings yet
01 Po Dags Identification
120 pages
Two-Variable Regression Basics
No ratings yet
Two-Variable Regression Basics
11 pages
Lecture 2
No ratings yet
Lecture 2
32 pages
Linear Regression
No ratings yet
Linear Regression
72 pages
Chapter 1 - Regression Recap
No ratings yet
Chapter 1 - Regression Recap
24 pages
Chapter 04 Linear Regression With One Regressor
No ratings yet
Chapter 04 Linear Regression With One Regressor
111 pages
L2 TwoVariable Regression 2023
No ratings yet
L2 TwoVariable Regression 2023
22 pages
Bản Sao 2 - Simple-Regression - Note
No ratings yet
Bản Sao 2 - Simple-Regression - Note
71 pages
Two Variable
No ratings yet
Two Variable
27 pages
Econometrics: Linear Regression Analysis
No ratings yet
Econometrics: Linear Regression Analysis
20 pages
003 - Math & Stats Review & SLR
No ratings yet
003 - Math & Stats Review & SLR
78 pages
Week 5 Notes
No ratings yet
Week 5 Notes
175 pages
Lecture 4
No ratings yet
Lecture 4
57 pages
Eco670e 01
No ratings yet
Eco670e 01
26 pages
Chapter 2 (Econometrics)
No ratings yet
Chapter 2 (Econometrics)
36 pages
Chapter5 Multivariate Regression
No ratings yet
Chapter5 Multivariate Regression
287 pages
The Fundamentals of Regression Analysis PDF
No ratings yet
The Fundamentals of Regression Analysis PDF
99 pages
Understanding Simple Regression Models
No ratings yet
Understanding Simple Regression Models
32 pages
Linear Regression Overview and Causality
No ratings yet
Linear Regression Overview and Causality
110 pages
Bivariate Regression: Class Size Impact
No ratings yet
Bivariate Regression: Class Size Impact
30 pages
CN Ols
No ratings yet
CN Ols
133 pages
Ch2 Two Variable Analysis
No ratings yet
Ch2 Two Variable Analysis
13 pages
CH 03 Two Variable Re. Analysis
No ratings yet
CH 03 Two Variable Re. Analysis
37 pages
1 205
No ratings yet
1 205
205 pages
Econometrics Module 2
No ratings yet
Econometrics Module 2
38 pages
Lecture 1a
No ratings yet
Lecture 1a
17 pages
Econometrics Basics and Data Types
No ratings yet
Econometrics Basics and Data Types
264 pages
Reading 07-Correlation and Regression
No ratings yet
Reading 07-Correlation and Regression
18 pages
Lecture Note 2 Edit
No ratings yet
Lecture Note 2 Edit
12 pages
Understanding Regression Analysis
No ratings yet
Understanding Regression Analysis
6 pages
Se LN 5
No ratings yet
Se LN 5
9 pages
ECON6001 F2021 Topic4
No ratings yet
ECON6001 F2021 Topic4
76 pages
Econometric Lec1
No ratings yet
Econometric Lec1
72 pages
Lecture 5 and 6
No ratings yet
Lecture 5 and 6
23 pages
Chapter 2
No ratings yet
Chapter 2
18 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
The Simple Regression Model (2 Variable Model) : Empirical Economics 26163 Handout 1
No ratings yet
The Simple Regression Model (2 Variable Model) : Empirical Economics 26163 Handout 1
9 pages
Univariate Regression: Concepts & Implementation
No ratings yet
Univariate Regression: Concepts & Implementation
61 pages
Simple Linear Regression Model I
No ratings yet
Simple Linear Regression Model I
83 pages
Simple Regression Model - Specification
No ratings yet
Simple Regression Model - Specification
5 pages
Chapter - 2
No ratings yet
Chapter - 2
59 pages
Some Basic Ideas of Regression Analysis
No ratings yet
Some Basic Ideas of Regression Analysis
2 pages
Econometrics: Linear Regression Basics
No ratings yet
Econometrics: Linear Regression Basics
21 pages
f23 Econ103 Week1 Ta Note
No ratings yet
f23 Econ103 Week1 Ta Note
6 pages
Classical Linear Regression Analysis
No ratings yet
Classical Linear Regression Analysis
38 pages
PPE Proposal
No ratings yet
PPE Proposal
4 pages
CFA Scholarship
80% (5)
CFA Scholarship
2 pages
Telecom Invoice for Businesses
No ratings yet
Telecom Invoice for Businesses
4 pages
Gambaran ASI Eksklusif di Dalung 2021
No ratings yet
Gambaran ASI Eksklusif di Dalung 2021
116 pages
(DG) C.W. Handbook v2.0
No ratings yet
(DG) C.W. Handbook v2.0
18 pages
Market Power and Market Concentration
No ratings yet
Market Power and Market Concentration
11 pages
James16 Forex Trading Insights
No ratings yet
James16 Forex Trading Insights
12 pages
Cm2 Compiler Setted (1 To 9) With Answers (18.03.23)
100% (2)
Cm2 Compiler Setted (1 To 9) With Answers (18.03.23)
237 pages
7th Sem Law of Taxation Oct 2023
No ratings yet
7th Sem Law of Taxation Oct 2023
4 pages
WhiteOak Market Perspective January 2025 - MUST READ REPORT
No ratings yet
WhiteOak Market Perspective January 2025 - MUST READ REPORT
49 pages
Code For Lifting Appliances in A Marine Environment, July 2022
No ratings yet
Code For Lifting Appliances in A Marine Environment, July 2022
3 pages
Goodwill-Nature & Valuation WS 4
No ratings yet
Goodwill-Nature & Valuation WS 4
20 pages
Asia Alpha Management's Stock Strategies
No ratings yet
Asia Alpha Management's Stock Strategies
3 pages
Price Elasticity of Supply Guide
No ratings yet
Price Elasticity of Supply Guide
13 pages
Corporate Briefing Notes - NICL 7-11-25
No ratings yet
Corporate Briefing Notes - NICL 7-11-25
4 pages
A Study On Pradhan Mantri Kisan Samman Nidhi (Pm-Kisan) : January 2023
No ratings yet
A Study On Pradhan Mantri Kisan Samman Nidhi (Pm-Kisan) : January 2023
11 pages
Past Papers Discussion - Partnership Act
No ratings yet
Past Papers Discussion - Partnership Act
23 pages
Chapter 1 6 Short Answer Questions
No ratings yet
Chapter 1 6 Short Answer Questions
2 pages
Lean Canvas of Learn at Home
No ratings yet
Lean Canvas of Learn at Home
16 pages
E-Paper 3-9 April 2025 .....
No ratings yet
E-Paper 3-9 April 2025 .....
18 pages
Gray1995, Corporate Social and Environmental Reporting
No ratings yet
Gray1995, Corporate Social and Environmental Reporting
75 pages
DSDSD
No ratings yet
DSDSD
1 page
Đề Thi Học Kì 2 Lớp 6 Môn Tiếng Anh Friends Plus
100% (1)
Đề Thi Học Kì 2 Lớp 6 Môn Tiếng Anh Friends Plus
3 pages
BCI CENTRAL Company Presentation
No ratings yet
BCI CENTRAL Company Presentation
8 pages
Understanding Total, Average, and Marginal Revenue
No ratings yet
Understanding Total, Average, and Marginal Revenue
26 pages
COT Trading Strategy
No ratings yet
COT Trading Strategy
4 pages
PET - RPET Moisture Content
No ratings yet
PET - RPET Moisture Content
6 pages
USA Best Place To Buy,,Verified-PayPal, Account
No ratings yet
USA Best Place To Buy,,Verified-PayPal, Account
8 pages
Optimal Monetary Policy Lecture
No ratings yet
Optimal Monetary Policy Lecture
18 pages
Financial Management & Analysis Guide
No ratings yet
Financial Management & Analysis Guide
11 pages