0% found this document useful (0 votes)

5 views19 pages

Model Building

The document discusses multiple regression model building, focusing on the differences between linear and nonlinear fits, particularly quadratic models. It explains the significance testing of quadratic effects and the implications of collinearity in regression analysis. Examples are provided to illustrate the application of quadratic regression and the detection of collinearity using the Variance Inflationary Factor (VIF).

Uploaded by

pavi.premsai.spam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views19 pages

Model Building

Uploaded by

pavi.premsai.spam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Basic Business

Statistics

Multiple Regression Model Building

Chap 15-1
Linear vs. Nonlinear Fit

Y Y

X X

residuals
residuals

X X


Linear fit does not give Nonlinear fit gives
random residuals random residuals
Nonlinear Relationships

The relationship between the dependent

variable and an independent variable may not
be linear
Can review the scatter plot to check for non-
linear relationships
Example: Quadratic model

Yi = β0 + β1X1i + β 2 X + ε i
2
1i

 The second independent variable is the square

of the first variable
Quadratic Regression Model

Model form:

Yi = β0 + β1X1i + β 2 X + ε i2
1i

 where:
β0 = Y intercept
β1 = regression coefficient for linear effect of X on Y
β2 = regression coefficient for quadratic effect on Y
εi = random error in Y for observation i
Quadratic Regression Model

Yi = β0 + β1X1i + β 2 X1i2 + ε i
Quadratic models may be considered when the scatter
plot takes on one of the following shapes:
Y Y Y Y

X1 X1 X1 X1
β1 > 0 β1 > 0 β1 < 0 β1 < 0
β2 > 0 β2 < 0 β2 > 0 β2 < 0
β1 = the coefficient of the linear term
β2 = the coefficient of the squared term
Testing for Significance:
Quadratic Effect

Testing the Quadratic Effect

 Compare quadratic regression equation

Yi = b0 + b1X1i + b 2 X1i2

with the linear regression equation

Yi = b0 + b1X1i
Testing for Significance:
Quadratic Effect
(continued)
Testing the Quadratic Effect
 Consider the quadratic regression equation

Yi = b0 + b1X1i + b 2 X1i2

Hypotheses

H0: β2 = 0 (The quadratic term does not improve the model)

H1: β2 ≠ 0 (The quadratic term improves the model)
Testing for Significance:
Quadratic Effect
(continued)
 Testing the Quadratic Effect
Hypotheses
H0: β2 = 0 (The quadratic term does not improve the model)
H1: β2 ≠ 0 (The quadratic term improves the model)

where:
 The test statistic is
b2 = squared term slope
b2 − β2 coefficient
t STAT =
Sb 2 β2 = hypothesized slope (zero)
Sb = standard error of the slope
d.f. = n − 3 2
Testing for Significance:
Quadratic Effect
(continued)
 Testing the Quadratic Effect

Compare adjusted r2 from simple regression model to

adjusted r2 from the quadratic model

 If adjusted r2 from the quadratic model is larger than

the adjusted r2 from the simple model, then the
quadratic model is likely a better model
Example: Quadratic Model

Filter Purity increases as filter time increases:

Purity Time
3 1
7 2 Purity vs. Time

8 3
100
15 5
22 7 80
33 8
60
40 10
Purity

54 12
40
67 13
70 14 20

78 15
0
85 15
0 5 10 15 20
87 16 Time
99 17
Example: Quadratic Model

(continued)
Simple regression results:
^Y = -11.283 + 5.985 Time

Standard
Coefficients Error t Stat P-value
t statistic and r2 are all high,
Intercept -11.28267 3.46805 -3.25332 0.00691 but the residuals are not
Time 5.98520 0.30966 19.32819 2.078E-10 random:

Time Residual Plot

Regression Statistics
10
R Square 0.96888

Residuals
Adjusted R Square 0.96628 5
Standard Error 6.15997 0
-5 0 5 10 15 20
-10
Time
Example: Quadratic Model in Excel
& Minitab

(continued)
 Quadratic regression results:
^ = 1.539 + 1.565 Time + 0.245 (Time)2
Y
Excel Minitab
Standard The regression equation is
Coefficients Error t Stat P-value Purity = 1.54 + 1.56 Time + 0.245 Time Squared
Intercept 1.53870 2.24465 0.68550 0.50722
Predictor Coef SE Coef T P
Time 1.56496 0.60179 2.60052 0.02467 Constant 1.5390 2.24500 0.69 0.507
Time 1.5650 0.60180 2.60 0.025
Time-squared 0.24516 0.03258 7.52406 1.165E-05
Time Squared 0.24516 0.03258 7.52 0.000

S = 2.59513 R-Sq = 99.5% R-Sq(adj) = 99.4%

The quadratic term is statistically significant (p-value very small)

Example: Quadratic Model in Excel
& Minitab

(continued)
 Quadratic regression results:
^ = 1.539 + 1.565 Time + 0.245 (Time)2
Y

Regression Statistics The regression equation is

Purity = 1.54 + 1.56 Time + 0.245 Time Squared
R Square 0.99494
Adjusted R Predictor Coef SE Coef T P
Square 0.99402 Constant 1.5390 2.24500 0.69 0.507
Time 1.5650 0.60180 2.60 0.025
Standard Error 2.59513 Time Squared 0.24516 0.03258 7.52 0.000

S = 2.59513 R-Sq = 99.5% R-Sq(adj) = 99.4%

The adjusted r2 of the quadratic model is higher than the adjusted r2 of the
simple regression model. The quadratic model explains 99.4% of the
variation in Y.
Example: Quadratic Model Residual
Plots

(continued)
Quadratic regression results:
Y = 1.539 + 1.565 Time + 0.245 (Time)2
Time Residual Plot Time-squared Residual Plot
10 10

Residuals
Residuals

5 5

0 0
0 5 10 15 20 0 100 200 300 400
-5 -5
Time Time-squared

The residuals plotted versus both Time and Time-squared show a random
pattern.
Collinearity
(continued)

Including two highly correlated independent

variables can adversely affect the regression
results
 No new information provided
Some Indications of Strong
Collinearity

Incorrect signs on the coefficients

Large change in the value of a previous
coefficient when a new variable is added to the
model
A previously significant variable becomes
non-significant when a new independent
variable is added
Detecting Collinearity
(Variance Inflationary Factor)

VIFj is used to measure collinearity:

1
VIFj =
1− R j
2

where R2j is the coefficient of determination of

variable Xj with all other X variables

If VIFj > 5, Xj is highly correlated with

the other independent variables
Example: Pie Sales

Pie Price Advertising

Week Sales ($) ($100s)
1 350 5.50 3.3
2 460 7.50 3.3 Recall the multiple regression
3 350 8.00 3.0
equation of chapter 2:
4 430 8.00 4.5
5 350 6.80 3.0
6 380 7.50 4.0
7 430 4.50 3.0 Sales = b0 + b1 (Price)
8 470 6.40 3.7
9 450 7.00 3.5 + b2 (Advertising)
10 490 5.00 4.0
11 340 7.20 3.5
12 300 7.90 3.2
13 440 5.90 4.0
14 450 5.00 3.5
15 300 7.00 2.7
Detecting Collinearity in Excel
using PHStat

PHStat / regression / multiple regression …

Check the “variance inflationary factor (VIF)” box

Regression Analysis
Output for the pie sales example:
Price and all other X
Regression Statistics  VIF is < 5
Multiple R 0.030438  There is no evidence of
R Square 0.000926
Adjusted R
collinearity between Price
Square -0.075925 and Advertising
Standard Error 1.21527
Observations 15
VIF 1.000927

Linear Regression: Simple & Multiple Models
No ratings yet
Linear Regression: Simple & Multiple Models
29 pages
Regression Diagnostics Overview
100% (1)
Regression Diagnostics Overview
53 pages
ADM2304 Multiple Regression Dr. Suren Phansalker
No ratings yet
ADM2304 Multiple Regression Dr. Suren Phansalker
12 pages
Mungadze Linear
No ratings yet
Mungadze Linear
21 pages
CH 4 Multiple Regression Models
No ratings yet
CH 4 Multiple Regression Models
28 pages
Unit-2 Ak
No ratings yet
Unit-2 Ak
106 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
01 - Quantitative Methods
No ratings yet
01 - Quantitative Methods
28 pages
Mult Regression
No ratings yet
Mult Regression
28 pages
High Yield Notes
No ratings yet
High Yield Notes
251 pages
Linear Regression PDF
100% (1)
Linear Regression PDF
32 pages
Multiple Regression Slides Mod-Ed
No ratings yet
Multiple Regression Slides Mod-Ed
32 pages
Linear Regression for Researchers
No ratings yet
Linear Regression for Researchers
41 pages
Multiple Regression Analysis Overview
No ratings yet
Multiple Regression Analysis Overview
40 pages
Bivariate
No ratings yet
Bivariate
28 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
53 pages
Residual Analysis For Simple Linear Regression: X B B y N e N e
No ratings yet
Residual Analysis For Simple Linear Regression: X B B y N e N e
15 pages
Developing a Multiple Regression Model
No ratings yet
Developing a Multiple Regression Model
36 pages
Advanced Linear Regression Guide
No ratings yet
Advanced Linear Regression Guide
7 pages
Regression Kann Ur 14
No ratings yet
Regression Kann Ur 14
43 pages
Lecture 12 - Adv. Correlation and Multiple Regression
No ratings yet
Lecture 12 - Adv. Correlation and Multiple Regression
32 pages
SLF W2 MLS Presentation
No ratings yet
SLF W2 MLS Presentation
11 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
26 pages
Quad R Tic Regression
No ratings yet
Quad R Tic Regression
13 pages
Multiple Regression Analysis for Business Forecasting
No ratings yet
Multiple Regression Analysis for Business Forecasting
100 pages
Data Science Interview Preparation
100% (1)
Data Science Interview Preparation
113 pages
Advanced Managerial Statistics Guide
No ratings yet
Advanced Managerial Statistics Guide
37 pages
4 Multiple Regression Analysis
No ratings yet
4 Multiple Regression Analysis
58 pages
Understanding Multiple Linear Regression
No ratings yet
Understanding Multiple Linear Regression
31 pages
t2 Sol
No ratings yet
t2 Sol
5 pages
Ch13 Multiple Regres
No ratings yet
Ch13 Multiple Regres
46 pages
Regression Models - Follow
No ratings yet
Regression Models - Follow
7 pages
Week 8 - 10
No ratings yet
Week 8 - 10
72 pages
Understanding Multiple Linear Regression
No ratings yet
Understanding Multiple Linear Regression
18 pages
1 Multicollinearity and Partial F Test PowerPoint
No ratings yet
1 Multicollinearity and Partial F Test PowerPoint
61 pages
Multiple Regression (Compatibility Mode)
No ratings yet
Multiple Regression (Compatibility Mode)
24 pages
Business Statistics: A Decision-Making Approach: Multiple Regression Analysis and Model Building
No ratings yet
Business Statistics: A Decision-Making Approach: Multiple Regression Analysis and Model Building
69 pages
3-Linear Regreesion-Assumptions
No ratings yet
3-Linear Regreesion-Assumptions
28 pages
Module 4
No ratings yet
Module 4
33 pages
Multiple Regression Forecasting Techniques
No ratings yet
Multiple Regression Forecasting Techniques
100 pages
Multiple Regression Analysis 1
No ratings yet
Multiple Regression Analysis 1
57 pages
Multiple Regression
100% (1)
Multiple Regression
100 pages
Multiple Regression Analysis Guide
No ratings yet
Multiple Regression Analysis Guide
19 pages
Chap 3 Multiple Regression
No ratings yet
Chap 3 Multiple Regression
22 pages
CLRM Assumptions and Diagnostics Overview
No ratings yet
CLRM Assumptions and Diagnostics Overview
39 pages
SRM Notes
50% (2)
SRM Notes
38 pages
Lecture 14: Multiple Linear Regression 1 Review of Simple Linear Regression in Matrix Form
No ratings yet
Lecture 14: Multiple Linear Regression 1 Review of Simple Linear Regression in Matrix Form
7 pages
Robust Regression with STATA Guide
No ratings yet
Robust Regression with STATA Guide
93 pages
Session2 Used Car Sales DS (AutoRecovered)
No ratings yet
Session2 Used Car Sales DS (AutoRecovered)
9 pages
Module01.1 LinearRegression
No ratings yet
Module01.1 LinearRegression
32 pages
4 Multiple Regression Models
No ratings yet
4 Multiple Regression Models
67 pages
Chapetr 14-Intro Multiple Regression-MyLO 2020S1
No ratings yet
Chapetr 14-Intro Multiple Regression-MyLO 2020S1
14 pages
Part 11 Multiple Linear Regression - Pdf.crdownload
No ratings yet
Part 11 Multiple Linear Regression - Pdf.crdownload
41 pages
ALSMStudentsolnsbookv1 Solution
No ratings yet
ALSMStudentsolnsbookv1 Solution
9 pages
Notes 9
No ratings yet
Notes 9
57 pages
Linear Regression Models Overview
No ratings yet
Linear Regression Models Overview
6 pages
Chap 5
No ratings yet
Chap 5
13 pages
Chapter 14 MR
No ratings yet
Chapter 14 MR
35 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Adult Emergency Trolley Checklist
No ratings yet
Adult Emergency Trolley Checklist
3 pages
Course Registration Form Date Applied Ersonal Data: Irst Name M.I. Last Name
No ratings yet
Course Registration Form Date Applied Ersonal Data: Irst Name M.I. Last Name
2 pages
Module 3 Protocols and Models
No ratings yet
Module 3 Protocols and Models
46 pages
Internship On Unilever
No ratings yet
Internship On Unilever
57 pages
B.Tech Student List - B.P. Poddar
No ratings yet
B.Tech Student List - B.P. Poddar
3 pages
101 Mistakes in English-2
No ratings yet
101 Mistakes in English-2
48 pages
Genital Fistulae
No ratings yet
Genital Fistulae
15 pages
Extra Work - Jennifer Marina Chavez Carcamo
No ratings yet
Extra Work - Jennifer Marina Chavez Carcamo
5 pages
Adaptec Ultra160 Driver Guide Win98
No ratings yet
Adaptec Ultra160 Driver Guide Win98
4 pages
Hook Up Format PDF
No ratings yet
Hook Up Format PDF
1 page
(Ebook) Songs of Naropa: Commentaries On Songs of Realization by Khenchen Rinpoche, Thrangu Rinpoche, Marcia Binder Schmidt, Erik Pema Kunsang ISBN 9789627341284, 9627341282 Instant Download
100% (1)
(Ebook) Songs of Naropa: Commentaries On Songs of Realization by Khenchen Rinpoche, Thrangu Rinpoche, Marcia Binder Schmidt, Erik Pema Kunsang ISBN 9789627341284, 9627341282 Instant Download
142 pages
Fluidized Bed Engineering Study
No ratings yet
Fluidized Bed Engineering Study
29 pages
Mba Strategic Management Thesis PDF
100% (3)
Mba Strategic Management Thesis PDF
7 pages
Bilge and Ballast Systems Review Guide
No ratings yet
Bilge and Ballast Systems Review Guide
14 pages
Nle Quick Guide Handbook 02-24-25
No ratings yet
Nle Quick Guide Handbook 02-24-25
19 pages
Confinity Contech Privata Limited-Asti47625-26-10 Jul 25
No ratings yet
Confinity Contech Privata Limited-Asti47625-26-10 Jul 25
1 page
VHF Audio/Video Transmitter Circuit
No ratings yet
VHF Audio/Video Transmitter Circuit
1 page
5880 16410 1 SP
No ratings yet
5880 16410 1 SP
936 pages
CNC Usb Controlle Mdk2-4axis - tb6560
50% (2)
CNC Usb Controlle Mdk2-4axis - tb6560
34 pages
Eco-Friendly Additives in Lime Mortars
No ratings yet
Eco-Friendly Additives in Lime Mortars
15 pages
A Catalogue of Works Pertaining To The Explanation of The Creed in Carolingian Manuscripts Susan Keefe Download
100% (10)
A Catalogue of Works Pertaining To The Explanation of The Creed in Carolingian Manuscripts Susan Keefe Download
78 pages
FiFi System PLC Control Overview
No ratings yet
FiFi System PLC Control Overview
52 pages
Penetration of Bituminous Materials: Standard Test Method For
No ratings yet
Penetration of Bituminous Materials: Standard Test Method For
4 pages
Credit Repair Plan B 19 Day Results
36% (11)
Credit Repair Plan B 19 Day Results
2 pages
Swapcard-Hybrid Event Design Handbook-V1
No ratings yet
Swapcard-Hybrid Event Design Handbook-V1
16 pages
Smart Transportation System Using IOT
No ratings yet
Smart Transportation System Using IOT
6 pages
Misleading "Natural" Claims on Kix Cereal
No ratings yet
Misleading "Natural" Claims on Kix Cereal
29 pages
Cartas Magic
No ratings yet
Cartas Magic
18 pages
Aptitude Test Sample Questions
100% (1)
Aptitude Test Sample Questions
4 pages
Phases of Effective Lesson Planning
No ratings yet
Phases of Effective Lesson Planning
11 pages

Model Building

Uploaded by

Model Building

Uploaded by

Basic Business

Multiple Regression Model Building

The relationship between the dependent

 The second independent variable is the square

Testing the Quadratic Effect

 Compare quadratic regression equation

with the linear regression equation

H0: β2 = 0 (The quadratic term does not improve the model)

Compare adjusted r2 from simple regression model to

 If adjusted r2 from the quadratic model is larger than

Filter Purity increases as filter time increases:

Time Residual Plot

S = 2.59513 R-Sq = 99.5% R-Sq(adj) = 99.4%

The quadratic term is statistically significant (p-value very small)

Regression Statistics The regression equation is

S = 2.59513 R-Sq = 99.5% R-Sq(adj) = 99.4%

Including two highly correlated independent

Incorrect signs on the coefficients

VIFj is used to measure collinearity:

where R2j is the coefficient of determination of

If VIFj > 5, Xj is highly correlated with

Pie Price Advertising

PHStat / regression / multiple regression …

You might also like