0% found this document useful (0 votes)
517 views22 pages

Correlation and Regression Activity - Answer Key

Aljur and Kylie studied the relationship between stress levels and life satisfaction in participants. They found a strong negative linear relationship, such that higher stress was correlated with lower life satisfaction. The regression equation calculated from the data was y=10.78-0.36x, indicating that for every 1 unit increase in stress, life satisfaction decreased by 0.36 units.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
517 views22 pages

Correlation and Regression Activity - Answer Key

Aljur and Kylie studied the relationship between stress levels and life satisfaction in participants. They found a strong negative linear relationship, such that higher stress was correlated with lower life satisfaction. The regression equation calculated from the data was y=10.78-0.36x, indicating that for every 1 unit increase in stress, life satisfaction decreased by 0.36 units.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as XLSX, PDF, TXT or read online on Scribd

Aljur and Kylie did a study on feelings of stress and life satisfaction.

Participants completed a measure on how st


feeling (on a 1 to 30 scale) and a measure of how satisfied they felt with their lives (measured on a 1 to 10 scale)
indicates the participants’ scores. Using this data, answer the following questions: (1) Create a Scatter Plot (2) Calcu
(3) Interpret the Correlation (4) Create a Regression Equation Line

Participant Stress Score (X) Life Satisfaction (Y)


1 11 7 Life Satisfactio
2 25 1 10
3 19 4 9
8
4 7 9
7
5 23 2 6
6 6 8 5
7 11 8 4
3
8 22 3
2
9 25 3 1
10 10 6 0
5 10 15 20

Stress Score (X) Life Satisfaction (Y)


Stress Score (X) 1
Life Satisfaction (Y) -0.957273896112683 1
A strong negative linear relationship

SUMMARY OUTPUT

Regression Statistics
Multiple R 0.957273896112683
R Square 0.916373312178756 0.957274
Adjusted R Square 0.905919976201101
Standard Error 0.872953717427839
Observations 10

ANOVA
df SS MS
Regression 1 66.80361 66.80361
Residual 8 6.096386 0.762048
Total 9 72.9

Coefficients Standard Error t Stat


Intercept 10.7831325301205 0.666811 16.17119
Stress Score (X) -0.357429718875502 0.038175 -9.362865

Regression Equation Line


y=10.78-0.36x
s completed a measure on how stressed they were
ives (measured on a 1 to 10 scale). The table below
: (1) Create a Scatter Plot (2) Calculate the Correlation
ssion Equation Line

Life Satisfaction (Y)

10 15 20 25 30

F Significance F
87.66324 1.385E-05

P-value Lower 95%Upper 95%Lower 95.0%Upper 95.0%


2.149E-07 9.245463 12.3208 9.245463 12.3208
1.385E-05 -0.445462 -0.269397 -0.445462 -0.269397
At Mosang Academy, students often have a lot of homework. The table below indicates the number of hours stu
performed on an exam in two of their classes. (1) Find the correlations between hours spent studying and how stu
and Mosa 102 (2) Which class was more strongly correlated with studying?

Marites 101 Mosa 102


Participant Study Hours Exam Score Study Hours Exam Score
1 3 75 4 70
2 15 95 12 98
3 6 65 9 85
4 8 70 6 80
5 4 85 2 65
6 2 80 3 75
7 10 65 10 92

Marites 101 Scatter Plot

Mosa 102 Scatter Plot


tes the number of hours students studied, and how they
spent studying and how students performed Marites 101
rrelated with studying?

Study Hours Exam Score


Study Hours 1
Exam Score 0.2686677417 1
A very weak positive linear relationship

Study Hours Exam Score


Study Hours 1
Exam Score 0.9697606254 1
A strong positive linear relationship

Therefore, Mosa 102 is more strongly correlated with studying.


In a biology experiment a number of cultures were grown in the laboratory. The number of bacteria, in millions, an
are presented below. (1) Create a scatter diagram (2) Compute for the correlation

Age (X) # of Bacteria (Y) Age (X)


1 34 Age (X) 1
2 106 # of Bacteria (Y) 0.989139
3 135 A strong positive linear relationship
4 181
5 192
6 231
7 268
8 300

SUMMARY OUTPUT

# of Bacteria (Y) Regression Statistics


400 Multiple R 0.989139
300 R Square 0.978396
200 Adjusted R Square 0.974795
100 Standard Error 13.90536
0
0 1 2 3 4 5 6 7 8 9 Observations 8

ANOVA
df
Regression 1
Residual 6
Total 7
Regression Equation Line
y=21.71+35.37x Coefficients
Intercept 21.71429
Age (X) 35.36905
10 days 375.41
15 days 552.26
20 days 729.11 Desmos Website
25 days 905.96
30 days 1082.81
50 days 1790.21
atory. The number of bacteria, in millions, and their ages, in days,
agram (2) Compute for the correlation

# of Bacteria (Y)

1
positive linear relationship

0.989139009589014

SS MS F Significance F
52540.7202380952 52540.72 271.7261 3.177E-06
1160.15476190476 193.3591
53700.875

Standard Error t Stat P-value Lower 95%Upper 95%Lower 95.0%


Upper 95.0%
10.834971749469 2.004093 0.091904 -4.797935 48.22651 -4.797935 48.22651
2.14564413119168 16.48412 3.177E-06 30.11885 40.61925 30.11885 40.61925

Desmos Website
The yield of a batch process in the chemical industry is known to be approximately linearly related to the
temperature, at least over a limited range of temperature. Two measurement of the yield are made at each of eight
temperatures, within this range, with the following results: (1) Create a Scatter Plot (2) For each temperature,
calculate the mean of the 2 yields.(3) Calculate the equation of the regression line of this mean yield on the
temperature. (4) Predict from the regression line, the yield of a batch at each of the following temperature: 175,
185, and 300

Temp C (X) Yield in Tonnes (Y) Average


180 136.2 136.9 136.6 Chart Title
190 147.5 145.1 146.3 250.0
200 153.0 155.9 154.5 200.0
210 161.7 167.8 164.8 150.0
220 176.6 164.4 170.5 100.0
230 194.2 183.0 188.6
50.0
240 194.3 175.5 184.9
0.0
250 196.5 219.3 207.9 170 180 190 200 210 220 230 24

Column C Column D

SUMMARY OUTPUT

Regression Statistics
Multiple R 0.982854
R Square 0.966001 0.982854
Adjusted R Square 0.960335
Standard Error 4.731022
Observations 8

ANOVA
df SS MS F
Regression 1 3815.717 3815.717 170.4772
Residual 6 134.2954 22.38256
Total 7 3950.012

Coefficients
Standard Error t Stat P-value
Intercept -35.68452 15.78415 -2.260783 0.064479
Temp C (X) 0.953155 0.073001 13.05669 1.244E-05

Desmos Website
ly related to the
made at each of eight
each temperature,
mean yield on the
g temperature: 175,

Chart Title

0 210 220 230 240 250 260

Column C Column D

Regression Equation Line


y=-35.68+0.95x
175.0 130.57
185.0 140.07
300.0 249.32

Significance F
1.244E-05

Lower 95%Upper 95%Lower 95.0%


Upper 95.0%
-74.30694 2.93789 -74.30694 2.93789
0.774527 1.131782 0.774527 1.131782

You might also like