0 ratings0% found this document useful (0 votes) 345 views10 pagesS6 Correlation and Regression Tutorial
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content,
claim it here.
Available Formats
Download as PDF or read online on Scribd
RAFPLES INSTEPUTION
H2 Mathematics 9758
2017 Year 6
nd Regressic
Correlat
Section A (Basic Questions)
1
w wy ©
Three sets of bivariate data have been plotted on scatter diagrams, as illustrated. In each
diagram the product moment correlation coefficient takes one of the values -1, -0.8, 0, 0.8,
1. State the appropriate value of the correlation coefficient corresponding to the scatter
diagrams (A), (B) and (C).
2 Explain why it is advisable to draw a scatter diagram before interpreting a correlation
coefficient calculated for a sample drawn from a bivariate distribution nu
Sketch a scatter diagram that shows an obvious relation between two variables but which
yields a linear product-moment correlation coefficient that is close to zero. i}
3 ns, of catalyst used in a chemical reaction and the resulting times, y
hours, taken to complete the reaction were recorded. The results are given in the following
table.
30 35 40 45 50
62.1 51.2 44.1 39.2 35.0 37.3 33.0
‘a
Calculate the value of the product moment correlation coefficient for the data.
State what your value indicates about the relation between and y, and what you
expect about the imple.
catter diagram from thi
[ (api) -0.925]
Tutorial S6: Correlation and Regression
Page | of 102017 Year 6
Raffles Institution 112 Mathematics
4 The following data is collected from a tial of a weedhiller
jv [1 | 20] x0 | 40 | so | oo | 7 | x0 | 90
[oe fae fe far fortis fas pu te fa]
Trial field areas, each of area | hectare, were treated with different volumes of weedkiller
The volume of weedkiller applied, is v litres, and v is the number of weeds found in the
corresponding hectare after the weedkiller is applied
e of «ony inthe form x= a+ by
Calculate the equation of the regression |
Use your answer to estimate the number of weeds that would be found in an area of 1
hectare treated with 35 litres of weedkiller
Interpret the coetticient 4 in the conteat of volume of weedkiller and the number of weeds.
Give a reason why a may not necessarily give the expected number of weeds per hectare
when no weedkiller is used, e
Explain why in this context a linear model would probably not be appropriate for large
values of ¥ [x =43.4-0.4lv; 29]
5 The variables x and y are believed to be linearly related and a set of values of x and y
are obtained experimentally. Both variables are subject to experimental error. State
circumstances under which the regression line of x ony should be used, rather than the
regression line of y on x, when the value of one variable is to be estimated from a given
value of the other.
suitable ut
The heart rate (r) and diastolic blood pressure (y ), bot . Were measured
for each of 10 hospital patients after being given a certain drug. The results were as
follows. eet
x 49 1 54 58 63 64 68 70 75 78
90 88 85 91 82 85 76 77 70 71
(i) By referring to the scatter diagram of the data obtained using your GC, state what e
the scatter diagram indicates about the correlation between x and y'.
Gi) Calculate the equation of the estimated least square regression line of yon x in
the form y=a+be, giving the values of a and b correct to 3 significant figures.
Use a suitable regression line to estimate the heart rate when the diastolie blood
pressure
ea reason why it might be unwise to use either of the regression lines to estimate
the diastolic blood pressure when the heart rate is 90.
(iv) Find the product moment correlation coefficient for the data and state, giving a
reason, whether ‘your statement in part (i).
[ i)» =126-0.704.. (iii) 64.8, (iv) -0.919]
value ¢
relation and Regression
Page 2 of 10Raffles Institution H2 Mathematies
6
2017 Year 6
. ts given by the formula
The radiation intensity 7 at ime ¢, from a
P= Ae", where /, and & are constants. Show that the relation between In/- and ¢ is
linear.
The following data were obtained from a particular source. The values of ¢ may be
act, while the values of / are subject to experimental error
considered to be
02 04 06 08 10
3.22 1.03 0.89 OAL 0.36
nv of the estimated regression line of In/ on ¢ and hence give
wo
estimates for 1, and A
Calculate the radiation intensity that would be expected at time £= 0.5
Calculate the product moment correlation coefficient between ¢ and In
(iy) Explain why it is appropriate to use the regression equation obtained in (i) to
7 =1.5. Obtain this value.
estimate the value of ¢ whe
[Inf =-2.88/+1.65, fy =5.23, k =2.88, (if) 1.24, (iii) 0.984, (iv) 0.433 |
For a random sample of 12 observations of pairs of values (x,y), the equation of the
regression line of y on x is y=4.82-2.25x. The sum of the 12 values of x is 20.64
and the product moment correlation coefficient for the sample is -0.3.
ji Find the sum of the 12 values of y .
(ii) Find the estimated value of y when x=2.8 and comment on the reliability of this
estimate,
1@ 114, (i) -1.48]
6: Correlation and Regression
Page 3 of 10Raffles Institution 12 Mathematic 2017 Year 6
8 9740/2008/02/Q8
A certain metal discolours when exposed te air Lo protect the
it iy treated with a chemical In
were applied to standard samples of the metal, and the
discolour were measured. The results are given in th
tal against discolouring,
experiment, different quantities, «inl, of the chemical
for the metal to
[2 | 20 | a7 [aw | ae | se | oo
[+ 2.2 45 | 58 | 73 76 | 90 | 99
(Calculate the product moment correlation coefficient between «and ¢ and explain
whether your answer suggests that linear model is appropriate BI
(ii) Draw a seater diayraim for the data (1
One of the values of 1 appears to be incorreet
Indicate the corresponding point on your diagram by labelling it, and explain
why the seater diagram for the t ing points may be consistent with a model of
the form f= a+ bInx (2)
(iv) Omitting?’ calculate least squares estimates of and b for the mode f= a+b In.x
(21
(vy) Assume that the value of wat P is correct. Estimate the value of ¢ for this value of
x ty
(vi) Comment on the use of the model in part (iv) in predicting the value of ¢ when
x=80 uy
10.970, Gii) P(4.8,7.6), (iv) a= 1.42, b=4.40,(v) 83 |
9 8863/2008/01/Q11
An engineering company makes cranes. The numbers, x, sold in each three-month period
for two years, together with the profits, y thousand dollars, on the sale of these cranes are
given in the following table.
[x 15 7 [13 [21 J 16 | 2 14 18
[oy] 200 | 350 [270 | 430 | 340 | 410 | 300 [360
(Give a sketch of the scatter diagram for the data as shown in your calculator. [2]
(ii) Find ¥ and j, and mark the point (¥, 7) on your scatter diagram. (2)
(ii) Calculate the equation of the regression line of y on x, and draw this line on your
seatter diagran 2
(iv) Calculate the product moment correlation, and comment on its value in relation to
your scatter diagram. 2
(v) For the next three-month period, the sales target is 20 cranes. Estimate the
corresponding profit. (2)
(vi) The company’s sales dir in (iii) to predict the profit if
40 cranes were to be sold in a three-month period. Comment on the validity of this
prediction. 21
[¥=17,7 =344 (iii) y =17.1e+53.3 (iv) 0.969 (v) $395,000]
tor uses the regression
Tutorial S6: Correlation and Regression
Page 4 of 10Raffles Institution H2 Mathematics 2017 Year
10 9740/2010/02/Q10
A car is placed ina wind tunnel
appropriate units is recorded. The results are showt
I the dray force F for different wind speeds v, in
vf ofa [x | 2 fw | 2 | [ow] 2 | aw]
roo [2st si | xs | a2] is] v6 | 20] are fas |
i) Draw the seater diagram for these values, labelling the axes clearly Ry
Itis thought that the drag force F can be modelled by one of the formulae
Feathy or Feetdy
where a, b, © and dare constants.
Find, correct to 4 decimal places, the value of the product moment correlation
coefficient between
(a) v and F,
(b) Wand F 2)
Use your answers to parts (i) and (ii) to explain which of F = a+ by or B= 6-4 dv’
is the better model. a
(iv) Itis required to estimate the value of v for which F = 26.0. Find the equation of a
why neither
nd the required estimate. Expl
suitable regression line, and use it to
the regression line of v on F nor the regression line of v’ on F should be
used. 14)
{Gi () 0.9860 ,(b) 0.9907 , (ivy =30.7 |
i 9740/2013/02/Q10
(Sketch a scatter diagram that might be expected when x and y are related
approximately as given in each of the cases (A), (B) and (C) below. In each case
your diagram should include 6 points approximately equally spaced with respect to
x, and with all x- and y-values positive. The letters a, 6, c, d, ¢ and f represent
constants,
(A) y=a-bx*, where a is positive and b is negative,
(B) y=c+dinx, where c is positive and d is negative,
©) y=e+£, where e is positive and fis negative, BI
x
A motoring website gives the following about the distance travelled, y km, by a certain type
of car at different speeds, x kmh, ona _
a8 [96 120 128
Distance,y [148 [147 126 107
Draw the scatter diagram for thes
1)
Explain which of the three cases in part (i) is the most appropriate for modelling
these values, and calculate the product moment correlation coefficient for this case.
QI
(iy) tis required to estimate the distance travelled at a speed of 110 kmh. Use the case
that you identified in part (iii) to find the equation of a suitable regression line, and
uuse your equation to find the required estimate. BI
[dii) (A), -0.939 (iv) y = 190-0.00462x", 134km]
‘Tutorial $6: Correlation ai
Page 5 of 10