0% found this document useful (0 votes)

16 views5 pages

Multiple Linear Regression

Multiple Linear Regression (MLR) analyzes the impact of multiple explanatory variables on a single dependent variable, extending Simple Linear Regression (SLR) by estimating the effects of each variable explicitly. Key concepts include Adjusted R-squared, Standard Error, and multicollinearity, which can complicate interpretation of results. The document also provides examples, including CGPA and apartment price predictions, illustrating how MLR can yield different significance levels compared to SLR due to multicollinearity.

Uploaded by

gkearth8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views5 pages

Multiple Linear Regression

Uploaded by

gkearth8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Multiple Linear Regression (MLR): Simplified &

Structured Notes

1. Introduction to Multiple Linear Regression (MLR)

Definition and Purpose

• MLR is used to study the impact of multiple explanatory variables on a single dependent
variable.
• It extends Simple Linear Regression (SLR), which involves only one explanatory variable.
• MLR helps analyze the individual and combined effects of multiple variables on the outcome,
including interactions between correlated variables.

Comparison: MLR vs. SLR

• SLR Equation: Y = β0 + β1X1 + ε

• One explanatory variable.
• Other variables' effects are absorbed into the error term (ε).
• MLR Equation: Y = β0 + β1X1 + β2X2 + ... + βkXk + ε
• Multiple explanatory variables.
• Each variable's effect is estimated explicitly.

MLR Model Components

• Y: Response variable
• X1, X2, ..., Xk: Explanatory variables
• β0, β1, ..., βk: Coefficients
• ε: Error term

Error Term Assumptions

• Errors are independent

• Errors have equal variance (homoscedasticity)
• Errors are normally distributed
• E[ε] = 0

Expected Value of Y

• E[Y | X1, ..., Xk] = β0 + β1X1 + β2X2 + ... + βkXk

1
2. Key Concepts in MLR

Adjusted R-squared (R̄²)

• Definition: Adjusted R² accounts for the number of predictors (k) and sample size (n).
• Purpose: Prevents misleading increases in R² by penalizing unnecessary variables.
• Key Points:
• Adjusted R² is generally less than R².
• Higher Adjusted R² = Better model.

Standard Error (Se)

• Definition: Estimate of population standard deviation of error terms (σε).

• Smaller Se = Better model.
• In MLR, Se usually decreases as more variables are added.
• R̄² and Se² move in opposite directions.

Coefficient of Correlation (R)

• In SLR: R = correlation between X and Y.

• In MLR: R = correlation between observed Y and predicted Y (Ŷ).

Marginal vs. Partial Slopes

• Marginal Slope (SLR): Total effect of a variable on Y, ignoring other variables.

• Partial Slope (MLR): Effect of a variable on Y holding other variables constant.
• Marginal and partial slopes are the same only if explanatory variables are independent (rare).

3. Collinearity / Multicollinearity
• Definition: High correlation among explanatory variables.
• Effect: Makes MLR results hard to interpret.
• Tools: Path diagram, Variance Inflation Factor (VIF).

4. Path Diagram: Direct and Indirect Effects

• Visualizes relationships among explanatory variables and with Y.
• Direct Effect: From X1 to Y (partial slope).
• Indirect Effect: From X1 → X2 → Y.
• Total Effect = Direct Effect + Indirect Effect
• Total Effect ≈ Marginal Slope from SLR.

2
5. Example: CGPA Prediction (Business School Admissions)

Dataset

• 15 Students
• Y: CGPA
• X1: Entrance Exam Score (0-10)
• X2: Interview Score (0-10)

Correlations

• CGPA and Entrance: 0.74

• CGPA and Interview: 0.76
• Entrance and Interview: 0.54 → Sign of multicollinearity

SLR: X1 on Y

• R: 0.74, R²: 0.55, Se: 0.785

• Coefficient: 0.72 (Marginal slope)
• p-value: 0.001 → Significant

SLR: X2 on Y

• R: 0.763, Se: 0.741

• Coefficient: 0.934 (Marginal slope)
• p-value: 0.0001 → Significant

MLR: X1 and X2 on Y

• Multiple R: 0.86, R²: 0.74, Adjusted R²: 0.69, Se: 0.628

• p-value: 0.0003 → Model is significant

Coefficients (Partial Slopes):

• Intercept: -0.7
• X1: 0.455 (p = 0.019, CI: 0.10 to 0.81)
• X2: 0.622 (p = 0.010, CI: 0.15 to 1.08)

Regression Equation: CGPA = -0.7 + 0.455(Entrance) + 0.622(Interview)

Path Diagram Quantification

• X1 → X2: Coefficient = 0.42

• Indirect Effect (X1): 0.42 * 0.622 = 0.26
• Total Effect (X1): 0.455 + 0.26 = 0.715 ≈ 0.72 (Marginal)
• X2 → X1: Coefficient = 0.68
• Indirect Effect (X2): 0.68 * 0.455 = 0.31
• Total Effect (X2): 0.622 + 0.31 = 0.932 ≈ 0.934 (Marginal)

3
6. Variance Inflation Factor (VIF)

Definition

• Measures how much variance in X_i is explained by other Xs.

• Formula: VIF(Xi) = 1 / (1 - Ri²)
• Ri²: R-squared from regression of Xi on all other explanatory variables.

Interpretation

• VIF = 1: No collinearity
• VIF > 1: Indicates multicollinearity

Impact on Standard Error

• SE(bi)_with_VIF = SE(bi)_without_VIF * √(VIF)

• Higher VIF → Larger SE → Smaller t-statistic → Larger p-value

CGPA Example

• Correlation X1 & X2: 0.54

• R² from auxiliary regression: 0.29 → VIF = 1.41
• √(1.41) = 1.18 → SE increases by 18%

7. Case Study: Apartment Price Prediction

Variables

• Y: Price
• X1: Area (sq. ft.)
• X2: Bedrooms
• X3: Parking Lots
• Data: 20 apartments

SLR Results

• All variables: Significant individually (p < 0.05)

• Area marginal slope = 0.32

MLR Results

• Multiple R = 0.7, R² = 0.49, Model p-value = 0.01

But:

• Area (X1):

4
• Partial slope = ~0.05
• p = 0.7 → Not significant
• CI includes 0
• Parking Lots (X3):
• p = 0.11 → Not significant
• CI includes 0

Conclusion: Despite individual significance in SLR, variables become insignificant in MLR due to
multicollinearity.

VIF Values

• Area: 1.53
• Bedrooms: 1.34
• Parking Lots: 1.23

8. Signs of Multicollinearity
• R² increases only slightly with more variables
• Marginal vs. Partial slopes differ drastically
• Strong overall F-statistic, but weak individual t-tests
• Partial slope SE > Marginal slope SE

9. Remedies for Multicollinearity

1. Remove Redundant Variables

2. Drop variables that add little unique value.

3. Re-express Variables

4. Combine correlated variables into one (e.g., economic status).

5. Do Nothing (if variables are still significant)

6. If p-values are low and estimates are stable, collinearity might be acceptable.
7. Example: In CGPA model, both variables had significant partial slopes despite correlation.

Multiple Regression Analysis 1
No ratings yet
Multiple Regression Analysis 1
57 pages
Chapter 8 Linear Regression
No ratings yet
Chapter 8 Linear Regression
34 pages
Day.11 What Is Multiple Linear Regression
No ratings yet
Day.11 What Is Multiple Linear Regression
10 pages
MLR Assumptions & Diagnostics
No ratings yet
MLR Assumptions & Diagnostics
11 pages
04 MLR
No ratings yet
04 MLR
32 pages
ML Unit3 MultipleLinearRegression
No ratings yet
ML Unit3 MultipleLinearRegression
70 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
53 pages
3-Linear Regreesion-Assumptions
No ratings yet
3-Linear Regreesion-Assumptions
28 pages
Lecture 3and4
No ratings yet
Lecture 3and4
51 pages
Regression Packet
No ratings yet
Regression Packet
27 pages
Econometrics Unit 4
No ratings yet
Econometrics Unit 4
56 pages
120.508 Module 8 Multiple Regression (PDF Full Page Color)
No ratings yet
120.508 Module 8 Multiple Regression (PDF Full Page Color)
52 pages
Mulicolinearity
No ratings yet
Mulicolinearity
18 pages
CHAPTER 4 - Violations of Assumptions
No ratings yet
CHAPTER 4 - Violations of Assumptions
96 pages
05 Diagnostics
No ratings yet
05 Diagnostics
46 pages
Advanced Regression Analysis Guide
No ratings yet
Advanced Regression Analysis Guide
68 pages
Econometrics Edited Chapter-4
No ratings yet
Econometrics Edited Chapter-4
35 pages
MLR and Pls
No ratings yet
MLR and Pls
26 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
17 pages
Multi Collinearity
No ratings yet
Multi Collinearity
22 pages
Multiple Regression Analysis Overview
No ratings yet
Multiple Regression Analysis Overview
40 pages
Class Material - Multiple Linear Regression
No ratings yet
Class Material - Multiple Linear Regression
57 pages
Chapter 4
No ratings yet
Chapter 4
38 pages
Regression Diagnostics Overview
100% (1)
Regression Diagnostics Overview
53 pages
Missing Value 11
No ratings yet
Missing Value 11
14 pages
Unit 5 Business Analytics
No ratings yet
Unit 5 Business Analytics
24 pages
Multicollinearity Assignment April 5
100% (1)
Multicollinearity Assignment April 5
15 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Chap 5
No ratings yet
Chap 5
13 pages
Understanding Multiple Linear Regression
No ratings yet
Understanding Multiple Linear Regression
37 pages
DAV 2201079 Exp 3-1
No ratings yet
DAV 2201079 Exp 3-1
11 pages
Multiple-Regression - Batool & Raya
No ratings yet
Multiple-Regression - Batool & Raya
24 pages
Mult Hetero Notes Agd
No ratings yet
Mult Hetero Notes Agd
29 pages
LinearRegression - 2022
No ratings yet
LinearRegression - 2022
38 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
10 pages
Violation of OLS Assumption - Multicollinearity
No ratings yet
Violation of OLS Assumption - Multicollinearity
18 pages
CH 4 Multiple Regression Models
No ratings yet
CH 4 Multiple Regression Models
28 pages
Multiple Regression for Students
100% (2)
Multiple Regression for Students
105 pages
MLR with SPSS: A Beginner's Guide
No ratings yet
MLR with SPSS: A Beginner's Guide
17 pages
Multiple Regression
No ratings yet
Multiple Regression
21 pages
Multiple Linear Regression Guide
No ratings yet
Multiple Linear Regression Guide
48 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
26 pages
Lecture - 8 MLR
No ratings yet
Lecture - 8 MLR
63 pages
Linear Regression PDF
100% (1)
Linear Regression PDF
32 pages
Intro To Regresion: Codergirl Data Analysis
No ratings yet
Intro To Regresion: Codergirl Data Analysis
32 pages
Regression and Introduction To Bayesian Network
No ratings yet
Regression and Introduction To Bayesian Network
12 pages
Understanding Multicollinearity
No ratings yet
Understanding Multicollinearity
14 pages
RiP Final Study
No ratings yet
RiP Final Study
35 pages
Multiple Regression Exam Notes
No ratings yet
Multiple Regression Exam Notes
3 pages
Understanding Imperfect Multicollinearity
No ratings yet
Understanding Imperfect Multicollinearity
26 pages
Multi Col Linearity
No ratings yet
Multi Col Linearity
3 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Chapter 4 Multicollinearity
No ratings yet
Chapter 4 Multicollinearity
7 pages
Lecture 4 - Multicolinearity
No ratings yet
Lecture 4 - Multicolinearity
24 pages
Simple Regression Model: Erbil Technology Institute
No ratings yet
Simple Regression Model: Erbil Technology Institute
9 pages
04 Violation of Assumptions All
No ratings yet
04 Violation of Assumptions All
24 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
ITW Stretch Wrapper Octopus - 1800-2800 - S-SFTS Data Sheet
No ratings yet
ITW Stretch Wrapper Octopus - 1800-2800 - S-SFTS Data Sheet
4 pages
1 Service and Maintenance PDF
100% (1)
1 Service and Maintenance PDF
33 pages
Defendant Memo: Virtual Moot Court 2020
No ratings yet
Defendant Memo: Virtual Moot Court 2020
29 pages
Practice Questions Leadership PDF
No ratings yet
Practice Questions Leadership PDF
8 pages
Insights on Arjuna and Krishna in Gita
No ratings yet
Insights on Arjuna and Krishna in Gita
8 pages
Multiplication and Times Tables Worksheets
No ratings yet
Multiplication and Times Tables Worksheets
11 pages
English 10th Paper 1
No ratings yet
English 10th Paper 1
19 pages
Diane Arbus and The Past Where She Was Good
No ratings yet
Diane Arbus and The Past Where She Was Good
6 pages
Short-Term Wave Height Distribution Analysis
No ratings yet
Short-Term Wave Height Distribution Analysis
1 page
Mid Term Assignment of 120 Hours TESOL Course: Muneesa's Lesson Plan
50% (2)
Mid Term Assignment of 120 Hours TESOL Course: Muneesa's Lesson Plan
7 pages
LR Parsing: Dewan Tanvir Ahmed Assistant Professor, CSE Buet
No ratings yet
LR Parsing: Dewan Tanvir Ahmed Assistant Professor, CSE Buet
60 pages
The Agile Cfo
100% (1)
The Agile Cfo
20 pages
2.5 Fuel Oil System
No ratings yet
2.5 Fuel Oil System
14 pages
Group 7 - Characteristics of Good Language Assessment For English Specific Purposes
No ratings yet
Group 7 - Characteristics of Good Language Assessment For English Specific Purposes
10 pages
List of Indian Equipment Dealers: S.No. Addresses S.No. Addresses
No ratings yet
List of Indian Equipment Dealers: S.No. Addresses S.No. Addresses
5 pages
LO WEEK 6 IPHP Q1 W7 8 The Human Person and The Environment Gualdo Benguet Deleted
No ratings yet
LO WEEK 6 IPHP Q1 W7 8 The Human Person and The Environment Gualdo Benguet Deleted
10 pages
GRADE 11. Imam Al-Bukhārī' - Emir of The Believers in Hadith
No ratings yet
GRADE 11. Imam Al-Bukhārī' - Emir of The Believers in Hadith
3 pages
Wind Loads: (207.8A-1, NSCP 2015)
No ratings yet
Wind Loads: (207.8A-1, NSCP 2015)
4 pages
Hai 2013 - Comparison of Kinetic Parameters For Phosphatases
No ratings yet
Hai 2013 - Comparison of Kinetic Parameters For Phosphatases
9 pages
F.A.S. Amps Model Gallery Overview
No ratings yet
F.A.S. Amps Model Gallery Overview
21 pages
Hfe Nakamichi Soundspace 5 Flyer en
100% (1)
Hfe Nakamichi Soundspace 5 Flyer en
2 pages
Schedule E
No ratings yet
Schedule E
15 pages
Conversations Between Friends
100% (1)
Conversations Between Friends
38 pages
The Law and Practice of The International Criminal Court 1st Edition Carsten Stahn Download Full Chapters
100% (5)
The Law and Practice of The International Criminal Court 1st Edition Carsten Stahn Download Full Chapters
324 pages
Twelfth Night Act 3 Scene 4
No ratings yet
Twelfth Night Act 3 Scene 4
18 pages
Top Ten Most Populated Nations 2023
No ratings yet
Top Ten Most Populated Nations 2023
2 pages
Overview of Desktop Publishing (DTP)
No ratings yet
Overview of Desktop Publishing (DTP)
4 pages
Herltage Tourism in Sufi Shrines in South Bengal
No ratings yet
Herltage Tourism in Sufi Shrines in South Bengal
15 pages
Early Atomic Models: Postulates & Limitations
No ratings yet
Early Atomic Models: Postulates & Limitations
13 pages
Qualitative Research
100% (5)
Qualitative Research
31 pages

Multiple Linear Regression

Uploaded by

Multiple Linear Regression

Uploaded by

Multiple Linear Regression (MLR): Simplified &

1. Introduction to Multiple Linear Regression (MLR)

Definition and Purpose

Comparison: MLR vs. SLR

• SLR Equation: Y = β0 + β1X1 + ε

MLR Model Components

Error Term Assumptions

• Errors are independent

• E[Y | X1, ..., Xk] = β0 + β1X1 + β2X2 + ... + βkXk

Adjusted R-squared (R̄²)

Standard Error (Se)

• Definition: Estimate of population standard deviation of error terms (σε).

Coefficient of Correlation (R)

• In SLR: R = correlation between X and Y.

Marginal vs. Partial Slopes

• Marginal Slope (SLR): Total effect of a variable on Y, ignoring other variables.

4. Path Diagram: Direct and Indirect Effects

• CGPA and Entrance: 0.74

• R: 0.74, R²: 0.55, Se: 0.785

• R: 0.763, Se: 0.741

• Multiple R: 0.86, R²: 0.74, Adjusted R²: 0.69, Se: 0.628

Coefficients (Partial Slopes):

Regression Equation: CGPA = -0.7 + 0.455(Entrance) + 0.622(Interview)

Path Diagram Quantification

• X1 → X2: Coefficient = 0.42

• Measures how much variance in X_i is explained by other Xs.

Impact on Standard Error

• SE(bi)_with_VIF = SE(bi)_without_VIF * √(VIF)

• Correlation X1 & X2: 0.54

7. Case Study: Apartment Price Prediction

• All variables: Significant individually (p < 0.05)

• Multiple R = 0.7, R² = 0.49, Model p-value = 0.01

9. Remedies for Multicollinearity

2. Drop variables that add little unique value.

4. Combine correlated variables into one (e.g., economic status).

5. Do Nothing (if variables are still significant)

You might also like