0% found this document useful (0 votes)

97 views5 pages

Lecture 10: F - Tests, ANOVA and R 1 Anova: Reg Reg Res

1. ANOVA compares two linear regression models - one with just an intercept term and one with an intercept and slope term. It uses an F-test to determine if adding the slope term significantly improves the model fit over the intercept-only model. 2. In R, the anova function performs the ANOVA analysis and returns an analysis of variance table with test statistics. 3. The F-test actually tests whether the slope term in the simple linear regression model is statistically different from zero, not whether the simple linear regression model itself is correctly specified. Neither rejecting nor failing to reject the null tells us if the assumptions of the linear model are valid.

Uploaded by

vishakha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

97 views5 pages

Lecture 10: F - Tests, ANOVA and R 1 Anova: Reg Reg Res

Uploaded by

vishakha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Lecture 10: F -Tests, ANOVA and R2

1 ANOVA
We saw that we could test the null hypothesis that β1 = 0 using the statistic (βb1 − 0)/se.
b (Although
I also mentioned that confidence intervals are generally more important than testing). There is
another approach called Analysis of Variance (ANOVA). It’s out-dated but it is in the book and
you should know how it works.
The idea is to compare two models:

Y = β0 + versus Y = β0 + β1 X + .

If we fit the first model, the least squares estimator is βb0 = Y . The idea is not to create a statistic
that measures how much P better the second model is than the first model. The residual sums iof
squares (RSS) is thus (Y − Y )2 . This is called the total sums of squares and is denoted by
i i
SStotal = i (Yi − Y )2 . IfP
P
we fit thw second model (the usual linear model) we get a smaller residual
sums of squares, RSS = i e2i .
The difference SStotal − RSS is called the sums of squares due to regression and is denoted by
SSreg . If β1 = 0 we expect this to be small. In the olden days, poeple summarized this in a ANOVA
table lioke this:
Source df SS MS F p-value
SS MSreg
Regression 1 SSreg MSreg = 1reg F = MSres
RSS
Residual n-2 RSS b2 = n−2
σ
Total n-1 SStotal

The degrees of freedom (df) are just numbers that are defined, frankly, to make things work
right. The mean squared errors (MS) are the sums of squares divided by the df. The F test is
MSreg
F = .
MSres
Under H0 , the statistic has a known distribution called the F distribution. This distribution depends
on two parameters (just as the χ2 distribution depends on one parameter). These are called the
degrees of freedom for the F distribution. We denote the distribution by F1,n−2 . The p-value is

P (F > Fobserved )

where F ∼ F1,n−2 and Fobserved is the actual observed value you comoute from the data.
This is equivalent to using our previous test and squaring it.
A little more formally, an F random variable is defined by
χ2a /a
χ2b /b

when χ2a and χ2b are independent.

Since χ2 distributions arise from sums of Gaussians, F -distributed random variables tend to
arise when we are dealing with ratios of sums of Gaussians. The MS terms in our table are
independent χ2 random variables under the usual assumptions.

1
2 ANOVA in R
The easiest way to do this in R is to use the anova function. This will give you an analysis-of-
variance table for the model. The actual object the function returns is an anova object, which is a
special type of data frame. The columns record, respectively, degrees of freedom, sums of squares,
mean squares, the actual F statistic, and the p value of the F statistic. What we’ll care about will
be the first row of this table, which will give us the test information for the slope on X.
Let’s do an example:

library(gamair)
out = lm(death ~ tmpd,data=chicago)
anova(out)

The output looks like this:

## Analysis of Variance Table

##
## Response: death
## Df Sum Sq Mean Sq F value Pr(>F)
## tmpd 1 162473 162473 803.07 < 2.2e-16 ***
## Residuals 5112 1034236 202
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Assumptions In deriving the F distribution, it is absolutely vital that all of the assumptions of
the Gaussian-noise simple linear regression model hold: the true model must be linear, the noise
must be Gaussian, the noise variance must be constant, the noise must be independent of X and
independent across measurements. The only hypothesis being tested is whether, maintaining all
these assumptions, we must reject the flat model β1 = 0 in favor of a line at an angle. In particular,
the test never doubts that the right model is a straight line.
ANOVA is an historical relic. In serious applied work from the modern (say, post-1985) era,
I have never seen any study where filling out an ANOVA table for a regression, etc., was at all
important.

3 What the F Test Really Tests

The textbook (§2.7–2.8) goes into great detail about an F test for whether the simple linear
regression model “explains” (really, predicts) a “significant” amount of the variance in the response.
What this really does is compare two versions of the simple linear regression model. The null
hypothesis is that all of the assumptions of that model hold, and the slope, β1 , is exactly 0. (This
is sometimes called the “intercept-only” model, for obvious reasons.) The alternative is that all
of the simple linear regression assumptions hold with β1 ∈ R. The alternative, non-zero-slope
model will always fit the data better than the null, intercept-only model; the F test asks if the
improvement in fit is larger than we’d expect under the null.
There are situations where it is useful to know about this precise quantity, and so run an F
test on the regression. It is hardly ever, however, a good way to check whether the simple linear

2
regression model is correctly specified, because neither retaining nor rejecting the null gives us
information about what we really want to know.
Suppose first that we retain the null hypothesis, i.e., we do not find any significant share of
variance associated with the regression. This could be because (i) the intercept-only model is right;
(iii) β1 6= 0 but the test doesn’t have enough power to detect departures from the null. We don’t
know which it is. There is also possibility that the real relationship is nonlinear, but the best linear
approximation to it has slope (nearly) zero, in which case the F test will have no power to detect
the nonlinearity.
Suppose instead that we reject the null, intercept-only hypothesis. This does not mean that the
simple linear model is right. It means that the latter model predicts better than the intercept-only
model — too much better to be due to chance. The simple linear regression model can be absolute
garbage, with every single one of its assumptions flagrantly violated, and yet better than the model
which makes all those assumptions and thinks the optimal slope is zero.
Neither the F test of β1 = 0 vs. β1 6= 0 nor the Wald/t test of the same hypothesis tell us
anything about the correctness of the simple linear regression model. All these tests presume the
simple linear regression model with Gaussian noise is true, and check a special case (flat line) against
the general one (titled line). They do not test linearity, constant variance, lack of correlation, or
Gaussianity.

4 R2
Another quantity that gets mentioned a lot in regression (which is also a historical relic) is R2 . It
is defined by
SSreg
R2 = .
SStotal
It is often described as “the fraction of variability explained by the regression.” It can be shown
that it can be written as R2 = r2 where
\Y)
Cov(X,
r=
sX sY
in other words, the correlation coefficient squared.
R2 will be 0 when βb1 = 0. On the other hand, if all the residuals are zero, then R2 = 1. It is
not too hard to show that R2 can’t possible be bigger than 1, so we have marked out the limits: a
sample slope of 0 gives an R2 of 0, the lowest possbile, and all the data points falling exactly on a
straight line gives an R2 of 1, the largest possible.
What does R2 converge to as n → ∞. The population version is
Var [m(X)]
R2 = (1)
Var [Y ]
Var [β0 + β1 X]
= (2)
Var [β0 + β1 X + ]
Var [β1 X]
= (3)
Var [β1 X + ]
β12 Var [X]
= (4)
β12 Var [X] + σ 2

3
Since all our parameter estimates are consistent, and this formula is continuous in all the parameters,
the R2 we get from our estimate will converge on this limit.
Unfortunately, a lot of myths about R2 have become endemic in the scientific community, and
it is vital at this point to immunize you against them.

1. The most fundamental is that R2 does not measure goodness of fit.

(a) R2 can be arbitrarily low when the model is completely correct. Look at Eq. 4. By making
Var [X] small, or σ 2 large, we drive R2 towards 0, even when every assumption of the
simple linear regression model is correct in every particular.
(b) R2 can be arbitrarily close to 1 when the model is totally wrong. There is, indeed, no
limit to how high R2 can get when the true model is nonlinear. All that’s needed is for
the slope of the best linear approximation to be non-zero, and for Var [X] to get big.
2. R2 is also pretty useless as a measure of predictability.
(a) R2 says nothing about prediction error. R2 can be anywhere between 0 and 1 just by
changing the range of X. Mean squared error is a much better measure of how good
predictions are; better yet are estimates of out-of-sample error which we’ll cover later in
the course.
(b) R2 says nothing about interval forecasts. In particular, it gives us no idea how big
prediction intervals, or confidence intervals for m(x), might be.
3. R2 cannot be compared across data sets.
4. R2 cannot be compared between a model with untransformed Y and one with transformed
Y , or between different transformations of Y .
5. The one situation where R2 can be compared is when different models are fit to the same
data set with the same, untransformed response variable. Then increasing R2 is the same as
decreasing in-sample MSE (by Eq. ??). In that case, however, you might as well just compare
the MSEs.
6. It is very common to say that R2 is “the fraction of variance explained” by the regression.
But it is also extremely easy to devise situations where R2 is high even though neither one
could possible explain the other.

At this point, you might be wondering just what R2 is good for — what job it does that isn’t
better done by other tools. The only honest answer I can give you is that I have never found a
situation where it helped at all. If I could design the regression curriculum from scratch, I would
never mention it. Unfortunately, it lives on as a historical relic, so you need to know what it is,
and what mis-understandings about it people suffer from.

5 The Correlation Coefficient

As you know, the correlation coefficient between X and Y is
Cov [X, Y ]
ρXY = p
Var [X] Var [Y ]

4
which lies between −1 and 1. It takes its extreme values when Y is a linear function of X.
Recall, from lecture 1, that the slope of the ideal linear predictor β1 is

Cov [X, Y ]
Var [X]
so s
Var [X]
ρXY = β1 .
Var [Y ]
As we saw, R2 is just ρb2XY .

6 Concluding Comment
The tone I have taken when discussing F tests, R2 and correlation has been dismissive. This is
deliberate, because they are grossly abused and over-used in current practice, especially by non-
statisticians, and I want you to be too proud (or too ashamed) to engage in those abuses. In a
better world, we’d just skip over them, but you will have to deal with colleagues, and bosses, who
learned their statistics in the bad old days, and so have to understand what they’re doing wrong.
In all fairness, the people who came up with these tools were great scientists, struggling with
very hard problems when nothing was clear; they were inventing all the tools and concepts we take
for granted in a class like this. Anyone in this class, me included, would be doing very well to come
up with one idea over the whole of our careers which is as good as R2 . But we best respect our
ancestors, and the tradition they left us, when we improve that tradition where we can. Sometimes
that means throwing out the broken bits.

Lecture 10
No ratings yet
Lecture 10
24 pages
Analyzing Variance in Regression Models
No ratings yet
Analyzing Variance in Regression Models
15 pages
Ordinary Least Squares
No ratings yet
Ordinary Least Squares
54 pages
Stats Notes
No ratings yet
Stats Notes
48 pages
Quants
No ratings yet
Quants
8 pages
T-Tests, Anova and Regression: Lorelei Howard and Nick Wright MFD 2008
No ratings yet
T-Tests, Anova and Regression: Lorelei Howard and Nick Wright MFD 2008
37 pages
Statistical Inference in Regression
No ratings yet
Statistical Inference in Regression
30 pages
ISOM2500 Spring 25 - Topic 10 - Linear Regression Interpretation and Diagnosis
No ratings yet
ISOM2500 Spring 25 - Topic 10 - Linear Regression Interpretation and Diagnosis
51 pages
T-Tests, Anova and Regression: Lorelei Howard and Nick Wright MFD 2008
No ratings yet
T-Tests, Anova and Regression: Lorelei Howard and Nick Wright MFD 2008
37 pages
Why Do We Need Statistics? - P Values - T-Tests - Anova - Correlation33
No ratings yet
Why Do We Need Statistics? - P Values - T-Tests - Anova - Correlation33
37 pages
Evans - Analytics2e - PPT - 07 and 08 CH
No ratings yet
Evans - Analytics2e - PPT - 07 and 08 CH
50 pages
Problem Set 2 Quantitative Methods UNIGE
No ratings yet
Problem Set 2 Quantitative Methods UNIGE
10 pages
Time Series Regression Analysis
No ratings yet
Time Series Regression Analysis
18 pages
L01 T Tests ANOVAs Regression 2023
No ratings yet
L01 T Tests ANOVAs Regression 2023
35 pages
Strongest Linear Regression Analysis
No ratings yet
Strongest Linear Regression Analysis
5 pages
Linear Regression Analysis Guide
No ratings yet
Linear Regression Analysis Guide
58 pages
Basics of The OLS Estimator: Study Guide For The Midterm
No ratings yet
Basics of The OLS Estimator: Study Guide For The Midterm
7 pages
P4 New - CHeat Sheet End-Term
No ratings yet
P4 New - CHeat Sheet End-Term
7 pages
OLS Estimators and Gaussian-Markov Assumptions
No ratings yet
OLS Estimators and Gaussian-Markov Assumptions
13 pages
OLS Assumptions & Issues Guide
No ratings yet
OLS Assumptions & Issues Guide
4 pages
T-Tests, Anovas & Regression: and Their Application To The Statistical Analysis of Neuroimaging
No ratings yet
T-Tests, Anovas & Regression: and Their Application To The Statistical Analysis of Neuroimaging
39 pages
Chapter 12
No ratings yet
Chapter 12
12 pages
EC501 Lecture 03
No ratings yet
EC501 Lecture 03
30 pages
Regression Analysis Estimation and Interpretation of Regression Equation Dummy Independent Variable
No ratings yet
Regression Analysis Estimation and Interpretation of Regression Equation Dummy Independent Variable
39 pages
Correlation and Regression
No ratings yet
Correlation and Regression
30 pages
F Tests and OLS Forecasting
No ratings yet
F Tests and OLS Forecasting
74 pages
Hypothesis Testing & ANOVA
No ratings yet
Hypothesis Testing & ANOVA
23 pages
Chapter 9 Multiple Regression Analysis: The Problem of Inference
No ratings yet
Chapter 9 Multiple Regression Analysis: The Problem of Inference
10 pages
L1 QM07 High Yield Notes
No ratings yet
L1 QM07 High Yield Notes
4 pages
Answers Review Questions Econometrics
84% (25)
Answers Review Questions Econometrics
59 pages
Answers Review Questions Econometrics PDF
93% (14)
Answers Review Questions Econometrics PDF
59 pages
01 SLR Final
No ratings yet
01 SLR Final
37 pages
Is The Dependent Variable Related To The Independent Variable?
No ratings yet
Is The Dependent Variable Related To The Independent Variable?
10 pages
PE Civil: Transportation Ebook Practice Exam
No ratings yet
PE Civil: Transportation Ebook Practice Exam
41 pages
SRM Notes
50% (2)
SRM Notes
38 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
Statistical Analysis Techniques Overview
No ratings yet
Statistical Analysis Techniques Overview
75 pages
Corr and Regress
No ratings yet
Corr and Regress
30 pages
Econometrics
No ratings yet
Econometrics
12 pages
Regression Analysis
100% (2)
Regression Analysis
9 pages
Statistics and Quantitative Analysis
No ratings yet
Statistics and Quantitative Analysis
38 pages
Econometric S
No ratings yet
Econometric S
23 pages
Advanced F-Test Analysis Guide
No ratings yet
Advanced F-Test Analysis Guide
7 pages
Econometrics Hypothesis Testing
No ratings yet
Econometrics Hypothesis Testing
35 pages
Linear Regression
No ratings yet
Linear Regression
42 pages
Chapter 2: Simple Linear Regression (Cont'd)
No ratings yet
Chapter 2: Simple Linear Regression (Cont'd)
37 pages
Hypothesis Testing in Regression Analysis
No ratings yet
Hypothesis Testing in Regression Analysis
37 pages
Ecn 306
No ratings yet
Ecn 306
43 pages
A Guide To Modern Econometrics, 5th Edition Answers To Selected Exercises - Chapter 2
No ratings yet
A Guide To Modern Econometrics, 5th Edition Answers To Selected Exercises - Chapter 2
5 pages
Review of Multiple Regression
No ratings yet
Review of Multiple Regression
12 pages
Chapter 2
No ratings yet
Chapter 2
27 pages
Applied Linear Regression Models 4th Ed Note
No ratings yet
Applied Linear Regression Models 4th Ed Note
46 pages
Chapter 8 Regression Model - 2023
No ratings yet
Chapter 8 Regression Model - 2023
21 pages
Bank Performance: Efficiency & Competition
No ratings yet
Bank Performance: Efficiency & Competition
19 pages
Evaluating Buyer-Supplier Relationships For International Procurement
No ratings yet
Evaluating Buyer-Supplier Relationships For International Procurement
8 pages
ˆ β = (X X) X y: dyˆ i,i dy
No ratings yet
ˆ β = (X X) X y: dyˆ i,i dy
7 pages
M204-19 - Vishakha Kate
No ratings yet
M204-19 - Vishakha Kate
2 pages
TOR EMBeD Summer 2020 Internship
No ratings yet
TOR EMBeD Summer 2020 Internship
2 pages
Tradoc 155782 PDF
No ratings yet
Tradoc 155782 PDF
36 pages
Investment Return Analysis
No ratings yet
Investment Return Analysis
12 pages
Optimizing Assignments: Hungarian Method
No ratings yet
Optimizing Assignments: Hungarian Method
11 pages
M204-19 Vishakha Kate
No ratings yet
M204-19 Vishakha Kate
8 pages
2009-10-26 011711 Ypeach
No ratings yet
2009-10-26 011711 Ypeach
1 page
(Index - HTML) : Font Change A A+ Hindi - English
No ratings yet
(Index - HTML) : Font Change A A+ Hindi - English
1 page
Intellectual Capital On Firm Financial Performance: The Moderation Effect of Firm Size
No ratings yet
Intellectual Capital On Firm Financial Performance: The Moderation Effect of Firm Size
12 pages
Efects of Diferent Warm-Up Methods
No ratings yet
Efects of Diferent Warm-Up Methods
18 pages
Outline Comm 18
No ratings yet
Outline Comm 18
6 pages
Anova Notes
No ratings yet
Anova Notes
2 pages
Impact of Credit Management on Bank Performance
No ratings yet
Impact of Credit Management on Bank Performance
12 pages
2015 VCE Maths Methods Exam 2
0% (1)
2015 VCE Maths Methods Exam 2
24 pages
Financial Performance of Flour Firms in Ethiopia
No ratings yet
Financial Performance of Flour Firms in Ethiopia
80 pages
Sales and Operations Planning
No ratings yet
Sales and Operations Planning
15 pages
Moore, A. B. (2002) - Community Development Practice Theory in Action. Community
No ratings yet
Moore, A. B. (2002) - Community Development Practice Theory in Action. Community
14 pages
Cfsei 3may 2012
No ratings yet
Cfsei 3may 2012
4 pages
Research Design: Types & Guide
No ratings yet
Research Design: Types & Guide
10 pages
Statistical Data Analysis Procedure
No ratings yet
Statistical Data Analysis Procedure
2 pages
Chapter 11 Test Bank
100% (1)
Chapter 11 Test Bank
46 pages
Hypothesis Testing in Research
No ratings yet
Hypothesis Testing in Research
4 pages
Effects of Modular Classes To The Study Habbits of Grade 10 Student in Acnhs 1
No ratings yet
Effects of Modular Classes To The Study Habbits of Grade 10 Student in Acnhs 1
35 pages
Literature Review Document Management System
100% (3)
Literature Review Document Management System
6 pages
Qualitative Sampling Techniques Guide
100% (2)
Qualitative Sampling Techniques Guide
24 pages
Answer 1
No ratings yet
Answer 1
7 pages
Complete Pastpaper
No ratings yet
Complete Pastpaper
82 pages
Saueressig Et Al - Rehab, Lca - 2022
No ratings yet
Saueressig Et Al - Rehab, Lca - 2022
13 pages
Practice Quiz (Chapter 10)
No ratings yet
Practice Quiz (Chapter 10)
24 pages
2025 Van Berkom Case Competition Entry
No ratings yet
2025 Van Berkom Case Competition Entry
1 page
HRM Ind Assmnt2
No ratings yet
HRM Ind Assmnt2
20 pages
Retail Consumer Clustering Analysis
No ratings yet
Retail Consumer Clustering Analysis
26 pages
Pa Cephalometric Analysis
No ratings yet
Pa Cephalometric Analysis
71 pages
Stats Old Paper July 2024
No ratings yet
Stats Old Paper July 2024
2 pages
Preliminaries Gian
No ratings yet
Preliminaries Gian
9 pages
CS1 Study Guide 2024
No ratings yet
CS1 Study Guide 2024
32 pages
Challenges Facing Internship Programme For Engineering Students As A Learning Experience: A Case Study of Debre Berhan University in Ethiopia
No ratings yet
Challenges Facing Internship Programme For Engineering Students As A Learning Experience: A Case Study of Debre Berhan University in Ethiopia
17 pages
12 Effective Technical Writing
No ratings yet
12 Effective Technical Writing
21 pages

Lecture 10: F - Tests, ANOVA and R 1 Anova: Reg Reg Res

Uploaded by

Lecture 10: F - Tests, ANOVA and R 1 Anova: Reg Reg Res

Uploaded by

Lecture 10: F -Tests, ANOVA and R2

when χ2a and χ2b are independent.

The output looks like this:

## Analysis of Variance Table

3 What the F Test Really Tests

1. The most fundamental is that R2 does not measure goodness of fit.

5 The Correlation Coefficient

You might also like