0% found this document useful (0 votes)

74 views41 pages

CH 16 Aslr

This document discusses simple linear regression. It defines the components of the simple regression equation (β1, β0) and explains how to interpret them. It demonstrates the least squares method for calculating β1 and β0, which finds the line that best fits the data by minimizing the vertical distances between the data points and the regression line. It also defines the correlation coefficient and coefficient of determination, and discusses the relationship between correlation and causation.

Uploaded by

benny

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views41 pages

CH 16 Aslr

Uploaded by

benny

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 41

Simple Linear Regression

________________________________________

1) Discuss conceptual differences between ANOVA and

regression.

2) Identify the components of the simple regression

equation (1, 0) and explain their interpretation.

3) Demonstrate the Least-Squares method for calculating

1 and 0.

4) Develop a measure for error in the regression model

and demonstrate a method for comparing the variance
due to error with the variance due to our model.

5) Define and explain the correlation coefficient and the

coefficient of determination.

6) Discuss the relationship between correlation and

causation.
CI vs. ANOVA vs. Regression
__________________________________________

Key word for CI:

Key word for ANOVA:

t-test

Key word(s) for regression:

Trivia Wars
______________________________________

Let’s say Amherst declares war on Northampton because

Northampton tries to lure Judie's into moving out of
Amherst. No one actually wants to kill anyone, so we
decide to settle our differences with a rousing game of
Jeopardy! You are elected the Captain of Amherst’s team
(as if you would be selected instead of me). How are you
going to choose the team?

Multiple criteria:
1) Knowledge
2) Performance under pressure
EX:Cindy Brady
3) Speed

Historical roots in WW II
Who would be a good ball turret gunner?
Regression
______________________________________

What is the relationship between…

Grades or Money or

Relationship or Health
Status
…and Life Satisfaction?
______________________________________

How well can I predict a person’s Life Satisfaction if I

know their …

Grades or Money or

Relationship or Health
Status
______________________________________

How are we going to do this?


General form of Probabilistic (Regression) Models
________________________________________

y= +

y = regression line + error

y= +

_______________________________________

E(y) -
 Regression line connects
Simple Regression
First-Order
Single-Predictor
___________________________________________

y = 0 + 1x + 
y =

x =

E(y) =

 =
______________________________________

0

y = mx + b

1
Interpretation of y-intercept and slope
________________________________________

Intercept
 Intercept only makes sense if x

 Regression equation only applies

________________________________________

Slope
 Change in y for a unit change in x.
o + implies relationship
o – implies relationship

________________________________________

Most important point:

Give me a value for x and the regression equation

and I can
Steps to completing a regression analysis
(both simple and multiple)
________________________________________

Hypothesize the
deterministic component
of the model.
Step 1

Use sample data to

Step 2

Specify the probability

Step 3 distribution of the

Step 4 Evaluate the usefulness of

Use the model for

Step 5
Fitting a model to our data (Step 2)
________________________________________

Least-Squares method

1) Sum of the vertical distance between each point

2) Square of the vertical distance is

When in doubt, think Bribery!!
____________________________________________

You want to determine the relationship between monetary

gifts and "BONUS POINTS FOR SPECIAL
CONTRIBUTIONS TO CLASS" added to your final
average so that you can decide how large a check to write
at the end of the semester (though I do prefer cash for tax
purposes). Let's say x represents the amount of money
contributed by past students, and y represents the number
of "Bonus Points" awarded to them.

Bribery

8
Bonus Points

0
0 1 2 3 4 5 6 7 8 9 10

Donation
Fishing for a regression line
________________________________________
12
Series1
10 y= x+ 1
y=5
8
Bonus Points

0
0 1 2 3 4 5 6 7 8 9 10

Donation

X Y Distance Squared-
Distance
Gift BP y=5 Y=x+1 y=5 y=x+1
4 1 -4 -4 16 16
8 9 4 0 16 0
2 5 0 2 0 4
6 5 0 -2 0 4
0 -4 32 24

Which regression line is better?

Is that the ‘best’ regression line?
Formulae for Least Squares Method
________________________________________

1 = SP / SSx

0 = My – (1* Mx)

______________________________________________

x 2    x  
 2

SSx =  n 

SP =  xy     x  y  
 n
Finding the best-fit regression line
________________________________________

x Y x2 Xy
4 1 16 4
8 9 64 72
2 5 4 10
6 5 36 30
x = 20 y = 20 (x2) = 120 (xy) = 116

SSx = (x2) – [(x)2 / n]

= 120 – [(20)2 / 4]
= 120 – (400 / 4)
= 120 – 100 = 20

SP = (xy) – [(x)y)] / n
= 116 – [(20)(20) / 4]
= 116 – (400 /4)
= 116 – 100 = 16
________________________________________

1 = SP / SSx
= 16 / 20 = 0.8

0 = My – (1* Mx)
= 5 – (.8)(5) = 1.0
________________________________________
The Least-Squares Regression Line
________________________________________

12
Series1
10 y=x+1
y=5
8 Least Squares Reg. Line
Bonus Points

0
0 1 2 3 4 5 6 7 8 9 10
Donation

x y E(y) Distance Squared-

Distance
4 1 4.2 -3.2 10.24
8 9 7.4 1.6 2.56
2 5 2.6 2.4 5.76
6 5 5.8 -0.8 0.64
0 19.20
Testing Example
__________________________________________

Unbeknownst to you, Biff is the heir to his family’s

Widget fortune. For his summer job, Biff was asked to
evaluate a group of employees’ widget making ability
using a standardized widget-making test. Biff’s boss
(Uncle Buck) asks Biff to determine the regression
equation that one would use to predict performance on the
test from years of service with the company. The data
appear below.

x (years) y (score) x2 y2 xy
3 55 9 3025 165
4 78 16 6084 312
4 72 16 5184 288
2 58 4 3364 116
5 89 25 7921 445
3 63 9 3969 189
4 73 16 5329 292
5 84 25 7056 420
3 75 9 5625 225
2 48 4 2304 96
x = 35 y = 695 (x2) = (y2) = (xy) =
133 49,861 2,548
Calculations
______________________________________________

SSx = (x2) – [(x)2 / n]

SP = (xy) – [(x)y)] / n

______________________________________________

1 = SP / SSx

0 = My – (1* Mx)
Widget Test Scatter Plot
________________________________________

100

80
Test Score

40
2 3 4 5
Experience
Assumptions regarding Error ()
________________________________________

: essentially vertical distance from regression line

_________________________________________

1) The mean of the probability distribution =

2) The variance of the probability distribution of

 is

3) Distribution of  is

4) Values of  are of one another.

Factors that contribute to Error
________________________________________

Two types of Error

1) Measurement Error -

EX:incorrect reading of beaker

2) Chance factors
EX:unusually non/reactive chemical
Estimation of Variability due to Error (Step 3)
________________________________________

s2 is analogous to MSE
s2 = SSE / dferror = SSE / n – 2

SSE = SSy - 1(SP)

SSy = y2 – [(y)2 / n]

________________________________________

 s2 = SSE / (n-2) = MSE

s = Estimated Standard Error

of the Regression Model

= Root MSE
Calculate the error
______________________________________

SSy = y2 – [(y)2 / n]

= 49,861 – [(695)2 / 10]

= 49,861 – (483,025 / 10)
= 49,861 – 48,302.5 = 1558.5

SSE = SSy - 1(SP)

= 1558.5 – 11.0(115.5)
= 1558.5 – 1270.5 = 288

s2 = SSE / (n-2)
= 288 / (10-2) = 36
(a/k/a MSE)

s = 36 = 6
(a/k/a Root MSE)
Important points about error or 
________________________________________

1. The smaller , the better we can

2. The smaller , the more the

individual data points will be around the regression
line.

3. A smaller  implies that x is a predictor of y.

Why?

Also, can use this information to develop a sense of how

far points should fall off the line.
 We can calculate a CI around the regression line.
95% of our points should fall within about 2 RMSEs
of the regression line. If not, HMMMM…
Evaluate the usefulness of the model (Step 4)
________________________________________

Step 1: Specify the null and alternative hypotheses.

 Ho: 1 = 0
 Ha: 1  0

Step 2: Designate the rejection region by selecting .

Step 3: Obtain the critical value for your test statistic

 t
 df = n-2

Collect your data

Step 5: Use your sample data to calculate:

 1 SP / SSx
 s1 = SE = s / SSx

Step 6: Use your parameter estimates to calculate the

observed value of your test statistic
 t = 1 – 0 / s1

Step 7: Compare tobs with tcrit:

 If the test statistic falls in the RR, reject the null.
 Otherwise, we fail to reject the null.
Calculating whether 1 (slope)  0
________________________________________

Ho: 1 = 0
Ha: 1  0

tcrit 2.306
(df = 8;  = .05)

RR |tobs| > 2.306

Observed t = 1 – 0 / (s /  SSx)
= 11 – 0 / (6 / 10.5)
= 11 / 1.85
= 5.94

We would reject the null hypothesis because tobs exceeds

the tcrit. In other words, tobs falls in the rejection region.

Implication:
Correlation Coefficient
________________________________________

Pearson’s product moment coefficient of correlation – a

measure of the strength of the linear relationship
between two variables.
Terminology / notation:
 r
 Pearson’s r
 correlation coefficient

________________________________________
SP
r = ( SSx )(SSy )

Interpretation:

+1 perfect positive relationship

(strong positive relationship)
0 no relationship
(strong negative relationship)
-1 perfect negative relationship
r for the Widget Example
____________________________________________
SP
r = ( SSx)( SSy )

Experience in Years

=
115.5
(10.5)(1558.5)

=
115 .5
16,364.25
= 115.5 / 127.92 = .90

Experience in Months

=
1386
(1512)(1558.5)

=
1386
2,356, 452
= 1386 / 1535.074 = .90
Stress and Health
____________________________________________

There is a strong negative correlation between stress and

health. Generally, the more stressed a person is, the
worse their health is.

But, does that mean that stress causes poor health?

No... Yes...
Coefficient of Determination
________________________________________

r2 represents the proportion of the total sample

variability

For simple, linear regression, r2 = r2.

________________________________________

More general formula is as follows:

r2 = (SSy – SSE) / SSy

= 1 – (SSE / SSy)

________________________________________

SPSS will give us everything we need!

Questions about Regression output
__________________________________________

1) What is r?

2) Is this correlation significant?

3) How much of the variance in # of colds per winter can

be explained by weekend bedtime?

4) What is the y-intercept?

5) Is it significantly different from zero?

6) What is E(y) if x = 10:00 PM (10)?

7) What is E(y) if x = 2:00 AM (14)?

8) Are your answers to questions 6 and 7 meaningful?

SPSS output
______________________________________________
Model Summary
2 2
Model R R Adj R SE
1 .204 .041 .034 1.20

ANOVA
Sum of Mean
Model Squares df Square F Sig.
1 Regression 7.68 1 7.68 5.32 .023
Residual 177.58 123 1.44
Total 185.27 124

Coefficients
Unstand Stand
Model Coeff Coeff t Sig.
B SE Beta
1 (Constant) 5.711 1.69 3.38 .001
bed_we -.266 .12 -.20 -2.31 .023
I just don’t get it
____________________________________________

I know I’m old, but I just don’t get the tattoo thing. I
gotta figure that people regret their decision as time
passes. The data below represent 100 subjects who had
tattoos etched into their skin between 1 and 5 years ago.
They rated their satisfaction with their lifetime scar on a
scale of 1-10 (10 = extremely satisfied). Is there a
relationship between tattoo age and tattoo satisfaction?
(x) (y) (x2) (y2) (x)(y)
300 600 1100 3954 1660

Regression Equation
SP = (xi)(yi) – [(xi)yi)] / n
SSx = xi2 – [(xi)2 / n]
1 = SP / SSx
0 = My – (1* Mx)

Hypothesis Test
SSy = yi2 – [(yi)2 / n]
SSE = SSy - 1(SP)
s2 (MSE) = SSE / (n-2)
t = 1 - 0 / (s / SSxx)

Correlation Coefficient
SP
r = ( SS x )( SS y )
Calculating the regression parameters
______________________________________________

SP = (xi)(yi) – [(xi)yi)] / n

SSx = xi2 – [(xi)2 / n]

1 = SP / SSx

0 = My – (1* Mx)
Let's do a t-test
______________________________________________

SSy = yi2 – [(yi)2 / n]

SSE = SSy - 1* (SP)

s2 = MSE

s =

t = 1 – 0 / (s / SSx)

We reject the null and conclude that there is a significant

NEGATIVE relationship between tattoo age and tattoo
satisfaction.
Let's calculate the correlation coefficient
____________________________________________

SSy = yi2 – [(yi)2 / n]

SP
r = ( SS x )( SS y )

r2 =

Although there is a significant NEGATIVE relationship

between tattoo age and tattoo satisfaction, age only
explains about 25% of the variance in satisfaction.
Clearly, other factors are involved.
Skipping Class
__________________________________________
In a perfect world, the correlation between the number of classes skipped and
the percentage of classes skipped should be 1.00. Let's see how well the
percentage of classes skipped (x) predicts the number of hours of classes
skipped (y). Please calculate the regression line, the correlation
coefficient, and the coefficient of determination.

(x) (y) (x2) (y2) (x)(y)

Regression Equation
SP = (xi)(yi) – [(xi)yi)] / n
SSx = xi2 – [(xi)2 / n]
1 = SP / SSx
0 = My – (1* Mx)

Hypothesis Test
SSy = yi2 – [(yi)2 / n]
SSE = SSy - 1(SP)
s2 (MSE) = SSE / (n-2)
t = 1 - 0 / (s / SSxx)

Correlation Coefficient
SP
r = ( SS x )( SS y )

Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
18 pages
LP-III Lab Manual
No ratings yet
LP-III Lab Manual
49 pages
Biostat Lecture 10
No ratings yet
Biostat Lecture 10
47 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
60 pages
ML Assignment No. 1: 1.1 Title
No ratings yet
ML Assignment No. 1: 1.1 Title
8 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
7 pages
8-Simple Regression Analysis
No ratings yet
8-Simple Regression Analysis
9 pages
Correlation - Linear - Logistic Regression
No ratings yet
Correlation - Linear - Logistic Regression
123 pages
Understanding Linear Regression Concepts
No ratings yet
Understanding Linear Regression Concepts
38 pages
Regression
No ratings yet
Regression
11 pages
Chapter 6 Student
No ratings yet
Chapter 6 Student
21 pages
Bio-L8 - Correlation and Regression Analysis
No ratings yet
Bio-L8 - Correlation and Regression Analysis
15 pages
Regression Analysis: Basic Statistics
No ratings yet
Regression Analysis: Basic Statistics
26 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Regression Analysis Basics
No ratings yet
Regression Analysis Basics
56 pages
Chapter 5,6 Regression Analysis
50% (2)
Chapter 5,6 Regression Analysis
44 pages
Linear Regression Basics Guide
No ratings yet
Linear Regression Basics Guide
6 pages
F Regression
No ratings yet
F Regression
65 pages
Topic Simple Linear Regression
No ratings yet
Topic Simple Linear Regression
38 pages
Output Input Linear Correlation Coefficient Regression Analysis
No ratings yet
Output Input Linear Correlation Coefficient Regression Analysis
6 pages
Lesson 12 - Introduction To Regression and Correlation Analysis Regression Analysis
No ratings yet
Lesson 12 - Introduction To Regression and Correlation Analysis Regression Analysis
39 pages
Simple Linear Regression Techniques
No ratings yet
Simple Linear Regression Techniques
8 pages
Simple Linear Regression Guide
No ratings yet
Simple Linear Regression Guide
12 pages
Chapter7
No ratings yet
Chapter7
52 pages
ISOM2500 Spring 25 - Topic 10 - Linear Regression Interpretation and Diagnosis
No ratings yet
ISOM2500 Spring 25 - Topic 10 - Linear Regression Interpretation and Diagnosis
51 pages
Week 2
No ratings yet
Week 2
33 pages
Regression and Correlation
No ratings yet
Regression and Correlation
17 pages
Regression Analysis
100% (1)
Regression Analysis
43 pages
@regression
No ratings yet
@regression
33 pages
Regression Analysis
No ratings yet
Regression Analysis
5 pages
Lecture8 4
No ratings yet
Lecture8 4
29 pages
Chapter 7 Presentation - 11.18.2024
No ratings yet
Chapter 7 Presentation - 11.18.2024
18 pages
Hypothesis Testing for Regression Slope
No ratings yet
Hypothesis Testing for Regression Slope
3 pages
12.1correlation and Simple Linear
No ratings yet
12.1correlation and Simple Linear
45 pages
Business Statistics II
100% (2)
Business Statistics II
100 pages
Regression & Correlation
No ratings yet
Regression & Correlation
44 pages
Regression and Correlation
No ratings yet
Regression and Correlation
14 pages
2 The Linear Regression Model
No ratings yet
2 The Linear Regression Model
11 pages
CH 11
No ratings yet
CH 11
55 pages
Simple Linear Regression and Multiple Linear Regression: MAST 6474 Introduction To Data Analysis I
No ratings yet
Simple Linear Regression and Multiple Linear Regression: MAST 6474 Introduction To Data Analysis I
15 pages
Lect5 Math231
No ratings yet
Lect5 Math231
31 pages
Chapter 4 - Notes
No ratings yet
Chapter 4 - Notes
58 pages
File4-Session3-Introduction To Regression
No ratings yet
File4-Session3-Introduction To Regression
50 pages
Iml Exp. 3
No ratings yet
Iml Exp. 3
4 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
34 pages
Correlation Coefficient and R-squared Explained
No ratings yet
Correlation Coefficient and R-squared Explained
66 pages
Regression: Leech N L, Barret K C & Morgan G A (2011)
No ratings yet
Regression: Leech N L, Barret K C & Morgan G A (2011)
35 pages
Module 11. Lesson Proper
No ratings yet
Module 11. Lesson Proper
5 pages
03 Regression
No ratings yet
03 Regression
2 pages
Regression Analysis Guide
No ratings yet
Regression Analysis Guide
34 pages
Chapter 14 Simple Linear Regression
No ratings yet
Chapter 14 Simple Linear Regression
45 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
11 pages
Simple Linear Regression Part 1
No ratings yet
Simple Linear Regression Part 1
63 pages
Regression Analysis
No ratings yet
Regression Analysis
47 pages
What Is Multiple Linear Regression
No ratings yet
What Is Multiple Linear Regression
23 pages
Regression Analysis Essentials
No ratings yet
Regression Analysis Essentials
55 pages
Week 13
No ratings yet
Week 13
25 pages
Dumas Method
No ratings yet
Dumas Method
4 pages
Great Works of World Literature Course
No ratings yet
Great Works of World Literature Course
11 pages
FALL 2020 - SYLLABUS - LAW 1101 - PROF. FOLKS - Revised
No ratings yet
FALL 2020 - SYLLABUS - LAW 1101 - PROF. FOLKS - Revised
7 pages
Lion King
0% (1)
Lion King
13 pages
The Lion King Soundtrack Highlights Arr. Calvin Custer
100% (2)
The Lion King Soundtrack Highlights Arr. Calvin Custer
67 pages
Pol 1101 Syllabus Spring 2020
No ratings yet
Pol 1101 Syllabus Spring 2020
11 pages
Techniques Terms in Rhetorical Analysis PDF
No ratings yet
Techniques Terms in Rhetorical Analysis PDF
5 pages
Student Discussion Leader Guide
No ratings yet
Student Discussion Leader Guide
1 page
ATCB-B01-Y-C30 (COMMSCOPE) Product Specifications - AISG RET Control Cable Two-Way Splitter
No ratings yet
ATCB-B01-Y-C30 (COMMSCOPE) Product Specifications - AISG RET Control Cable Two-Way Splitter
2 pages
Export Certificate UAE to Indonesia
No ratings yet
Export Certificate UAE to Indonesia
3 pages
Lorentz Price List 2022
No ratings yet
Lorentz Price List 2022
40 pages
Id Cards
No ratings yet
Id Cards
1 page
TH L42DT50T
No ratings yet
TH L42DT50T
70 pages
Movie Editing Tools
No ratings yet
Movie Editing Tools
29 pages
States of Matter Quiz 2 - Quizizz
No ratings yet
States of Matter Quiz 2 - Quizizz
2 pages
Applications of Machine Learning - Javatpoint
No ratings yet
Applications of Machine Learning - Javatpoint
10 pages
NPD-Unit 4
No ratings yet
NPD-Unit 4
44 pages
SmartSafe EB480 EV Battery Cell Equalizer User's Manual
No ratings yet
SmartSafe EB480 EV Battery Cell Equalizer User's Manual
24 pages
Sanyo Ce32wh3-F wb5 (ET)
No ratings yet
Sanyo Ce32wh3-F wb5 (ET)
78 pages
Courier
No ratings yet
Courier
2 pages
Campus Drive Questions With Solution
No ratings yet
Campus Drive Questions With Solution
45 pages
Description and Discussion On Dcase 2025 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection For Machine Condition Monitoring
No ratings yet
Description and Discussion On Dcase 2025 Challenge Task 2: First-Shot Unsupervised Anomalous Sound Detection For Machine Condition Monitoring
4 pages
Decentralized AO Supercomputer Protocol
No ratings yet
Decentralized AO Supercomputer Protocol
17 pages
IT Application Tools in Business - Module 1
No ratings yet
IT Application Tools in Business - Module 1
7 pages
POS Audit Checklist
No ratings yet
POS Audit Checklist
1 page
Tech Use Across Generations
No ratings yet
Tech Use Across Generations
3 pages
Silicon Junction FETs Specs & Applications
No ratings yet
Silicon Junction FETs Specs & Applications
1 page
Rockwell in Roboguide v3
No ratings yet
Rockwell in Roboguide v3
11 pages
Duraflex Wires Price
No ratings yet
Duraflex Wires Price
1 page
User Manual: Turbidity
No ratings yet
User Manual: Turbidity
2 pages
BGI Group - Aon Assessments - 4th Semester - 2027 Batch
No ratings yet
BGI Group - Aon Assessments - 4th Semester - 2027 Batch
9 pages
ISM Internal Test Question Bank
No ratings yet
ISM Internal Test Question Bank
6 pages
Cppa Act
No ratings yet
Cppa Act
64 pages
Altosonic V12 Altosonic V12 Altosonic V12 Altosonic V12: Ultrasonic Gas Flowmeter For Custody Transfer
No ratings yet
Altosonic V12 Altosonic V12 Altosonic V12 Altosonic V12: Ultrasonic Gas Flowmeter For Custody Transfer
40 pages
Soosan Hydraulic Breakers Features Overview
No ratings yet
Soosan Hydraulic Breakers Features Overview
2 pages
MP C6503SP/C8003SP (D257/D258) Parts Catalog
No ratings yet
MP C6503SP/C8003SP (D257/D258) Parts Catalog
470 pages
EPAS Gateway EN UM E51
No ratings yet
EPAS Gateway EN UM E51
473 pages
Gen Bio 1 Module 1
100% (2)
Gen Bio 1 Module 1
30 pages