EXPERT CLASSES
Assignment of Statistics
th
Standard 12 Chapter: 3(Linear Regression)
LIST OF FORMULAE
• Regression Line is denoted by ̂
𝒚
𝑦̂ = 𝒂 + 𝒃𝒙
• 5 Steps to Find Regression Line is:-
1. Table
2. 𝑥̅ & 𝑦̅
3. Regression Coefficient b
𝑛∑𝑢𝑣−(∑𝑢)(∑𝑣) 𝐶𝑦 𝑋−𝐴 𝑌−𝐵
b = * where u = &v=
𝑛∑𝑢2 −(∑𝑢)2 𝐶𝑥 𝐶𝑥 𝐶𝑦
4. a = 𝑦̅ − b𝑥̅
5. 𝑦̂ = a + bx
• Coefficient of Determination (𝑅)2
2
𝑛∑𝑢𝑣−(∑𝑢)(∑𝑣)
(𝑅)2 = 𝑟 2 = [ ]
√𝑛∑𝑢2 −(∑𝑢)2 ∗√𝑛∑𝑣 2 −(∑𝑣) 2
Short Sums
Long ‖𝑯𝒊𝒈𝒉𝒍𝒊𝒈𝒉𝒕𝒆𝒓𝒔‖ Short
𝐶𝑜𝑣(𝑥, 𝑦)
- [ 𝐶𝑜𝑣 (𝑥, 𝑦)] 𝑏=
𝑆𝑥 2
𝑆𝑦
- [ 𝑟 ] 𝑏=𝑟∗
𝑆𝑥
∑(𝑥 − 𝑥̅ )(𝑦 − 𝑦̅) ∑(𝑥 − 𝑥̅ )(𝑦 − 𝑦̅)
𝑏= [ ∑(𝑥 − 𝑥̅ )(𝑦 − 𝑦̅)] 𝑏=
∑(𝑥 − 𝑥̅ )2 𝑛 ∗ 𝑆𝑥 2
𝑛∑𝑥𝑦 − (∑𝑥)(∑𝑦) ̅̅̅̅̅
∑𝑥𝑦 − 𝑛𝑥𝑦
𝑏= [ ∑𝑥𝑦 ] 𝑏=
𝑛∑𝑥 2 − (∑𝑥)2 𝑛 ∗ 𝑆𝑥 2
Expert है toh भरोसा है! Page 1 of 6
Answer in one sentence
1. Regression is which type of relation between two corrected variables?
Ans: Regression is functional related between two corrected variables
2. State the literal meaning of regression?
Ans: The literal meaning of regression is ‘to avert ‘or ‘to return to average value’.
3. What is a model?
Ans: A set of one or more equations representing a relation or problems is called a model.
4. What is called explanatory variables?
Ans: One of two variables having cause-effect relationship, the causal variable is called explanatory
variables.
5. What is regression model?
Ans: A statistical models describes the cause and effect relationships between two variables are called a
regression model.
6. What is called constant 𝑎?
Ans: The constant ′𝑎′ is called the intercept the line of regression.
7. On the other way, what is called the regression coefficient of ‘𝑏’?
Ans: The regression coefficient of ‘𝑏’ is also called as slope of the regression line of Y on X.
8. Define: Linear Regression.
Ans: Functional relation between two related variables in which the change in the values of the variables
are approximately in constant proportion and this relationship can be determined by a straight line.
9. Define: Regression Coefficient.
Ans: The approximate change in the values of dependent variable for a unit change in the value of
independent variable. It is also known as slope of the regression line.
10. What is fitting of regression line?
Ans: The procedure of obtaining estimated linear regression is called the fitting of the regression line.
11. State the methods of fitting a line of regression?
Ans: There are two methods of fitting a line of regression: (i) Method of scatter diagram and (ii)
Method of least square
12. By which method is the regression line is obtained?
Ans: The regression line is obtained by method of least square
13. Which statistician initiated the study of regression?
Ans: English statistician Sir Francis Galton initiated the study of regression.
14. What is meant by the best fitted line of regression?
Ans: Out of all the possible line passing from the neighborhood of all the points on scatter diagram, the
line for which the sum of squares of error of estimation 𝑒 = 𝑦 − ŷ is minimum is called the best fitted line
of regression
Expert है toh भरोसा है! Page 2 of 6
15. What is coefficient of determination?
Ans: The measure of reliability of the estimate of Y obtained by linear regression is called coefficient of
determination. It is denoted by symbol 𝑅2
16. What is the value of error if a sample point is on the fitted line?
Ans: The value of error is zero, if a sample point is on the fitted line
Short Sums
1. The fitted regression line of Y on X is ŷ = 80 + 2.5𝑥 and one of the observations used in fitting of the line
is (12, 112). Find the error in estimating Y for 𝑥 = 12
Ans: e = 2
2. The fitted regression line of Y on X is ŷ = 80 + 4.5𝑥 and one of the observations used in fitting of the line
is (20, 170). Find the error in estimating Y for 𝑥 = 20 and interpret it.
Ans: e = 0
3. If 𝑏 = 1.2, 𝑟 = 0.6 and variance of Y is 144. Find the standard deviation of X.
Ans: Sx = 6
4. If the slope of the regression line of Y on In X is 0.5 and the variance of X and Y are 576 and 225
respectively, Find the coefficient of determination.
Ans: r = 0.8, R2 = 0.64
5. If coefficient of determination is 0.81 and 𝑆𝑥: 𝑆𝑦 = 3: 5. Find the value of regression coefficient.
Ans: r = 0.9, b = 1.5
6. If 𝑏 = 24/50, 𝑆𝑦2 = 16 and 𝑆𝑥2 = 25 Find r.
Ans: r = 0.6
7. From the following data is collection on Variable X and Y:
𝑛 = 50 , 𝑥 = 13 ,𝑦 = 15 ,𝜮𝑥𝑦 = 10000, 𝜮𝑥2 = 8800 and 𝜮𝑦2 = 11000
From the above data, obtain the regression line of Y on X and estimate Y for 𝑋 = 20
Ans: ŷ = 5.77 + 0.71x, ŷ = 19.97
8. Obtain the coefficient of determination and the regression line of Y on X from the following data:
𝑛 = 10, 𝑥 = 100 , 𝑦 = 400 , 𝑆𝑥 = 10 , 𝑆𝑦 = 30 and 𝐶𝑜𝑣(𝑥, 𝑦) = (−270)
Ans: ŷ = 670 – 2.7x, R2 = 0.81
9. If the regression line of Y on X is ŷ = 𝑎 − 0.2𝑥 and 𝑥 = 85, 𝑦 = 100, 𝑆𝑥2=225, 𝑆𝑦2=900.
(i) Find 𝑎. (ii) Estimate Y for 𝑋 = 90 (iii) Find 𝑟 (𝑥, 𝑦) (iv) Find 𝑅2
Ans: (i) a = 117 (ii) ŷ = 99 (iii) r(x, y) = 0.1 (iv) R2 = 0.01
10. If the regression line of Y on X is ŷ = 𝑎 + 𝑏𝑥.
If 𝑥 = 2, 𝑦 = 198 .Variance of 𝑋 = 0.40, 𝐶𝑜𝑣 (𝑥, 𝑦) = 19. Find the values of 𝑎 and 𝑏.
Ans: a = 103, b = 47.5
11. For a bivariable data. 𝑏𝑦𝑥 = 0.15 and 𝑆𝑥2: 𝑆𝑦2= 25: 64. Find 𝑅2 and interpret it.
Ans: R2 = 0.0087, R2 is near to zero, hence Regression model is not reliable.
Expert है toh भरोसा है! Page 3 of 6
12. From the following data, obtain the estimate of the weights if height is 150 cm.
Particulars Height (cm) X Weights (kg) Y
No. Of Observation 8 8
Mean 137 54
The sum of square of deviation is taken from Mean 374 380
The sum of product of deviation is taken from Mean 90
Ans: ŷ = 21.12 + 0.24x, ŷ = 57.12 kg
13. If the slope of regression line of Y on X is 0.5 and the variance of Y and Y is 576 and 225 respectively.
Find the coefficient of determination.
Ans: r = 0.8, R2 = 0.64
14. If the coefficient of determination is 0.81 and 𝑆𝑦: 𝑆𝑥 = 3:5. Find the value of regression coefficient.
Ans: r = 0.9, b = 1.5
15. If 𝛴 (𝑥 − 40) = 0, 𝛴 (𝑦 − 50) = 0, 𝛴 (𝑥 − 40) (𝑦 − 50) = 200, 𝛴 (𝑥 − 40)2 = 250 and 𝛴 (𝑦 − 50)2=
400. Find the values of 𝑎 and 𝑏 for the regression line of Y on X.
Ans: b = 0.8, a = 18
Long Sums
1. Marks scored by 9 candidates in a 10 marks before the training and after the training are given below:
Marks before the training (X) 2 3 5 4 3 6 8 5 7
Marks after the training (Y) 3 3 4 6 5 7 9 9 8
From the data obtain, the best fitted line of regression of marks after the training and before the training.
Ans: ŷ = 1.21 + 0.98x
2. The following data given the information regarding the year of experience and the monthly salary of 10
employees of a firm:
Year of experience 2 3 5 4 7 6 9 1 10 8
Monthly salary (Rs. 1000) 2 4 4 4 6 6 7 2 9 7
From that, obtain the best fitted line of regression of monthly salary (Y) on years of experience (X).
Estimate the monthly salary of an employee having experience of 12 years.
Ans: ŷ = 1.08 + 0.73x, ₹98,400
3. From the following data on length (X) and weight (Y) of seven articles, obtain the best fitted line of
regression of weight (Y) on length (X). Find the estimated weight of an article with length 15 cm.
Length (cm) 5 6 10 7 9 12 8
Weights (gms) 11 13 12 9 17 18 19
Ans: ŷ = 6.38 + 0.9x, 19.88 ῀ 20gms
Expert है toh भरोसा है! Page 4 of 6
4. In order to study the relationship between the proportions of people having higher education (X) given
in terms of the percentage of population and the per capita income (Y) of a state, a random sample of
some states was taken and the sample inquiry revealed the following data:
Proportion of people having higher education (in % of population) Per capita income (in
State thousand Rs.)
A 20 3
B 25 4
C 10 3
D 5 2
E 30 4
F 10 2
G 15 2
Fit the line of regression of Y on X by the method of least squares from the data. Find the coefficient of
determination.
Ans: ŷ = 1.38 + 0.09x, R2 = 0.73
5. From the following information on total value (Y) and production (X). Obtain the best fitted line of
regression of total value on production. Estimate the total value when the production is 15 thousand tons.
Production (thousand tons) 3 4 11 6 8 5 12
Total values (Rs. 1000) 230 310 870 670 800 350 900
Ans: ŷ = 49.46 + 77.22x, ₹12,07,760
6. From the following data on monthly wages and cost of living index number, obtain the best fitted line of
regression of monthly wages (Y) on cost of living index number (X). Estimate the monthly wages when the
cost of living index number is 480.
Cost of living index 180 215 220 245 250 280 300 340 400
Monthly wages (Rs. 100) 25 30 30 32 32 35 36 37 40
Ans: ŷ = 16.8 + 0.06x, ₹45,600
7. From the following data, obtain the estimate of the weight of a student if his height is 150 cm:
Height X (cm) 130 135 140 143 150 132 128 138
Weights Y (kg) 50 52 56 56 58 54 55 51
Ans: ŷ = 21.12 + 0.24x, ŷ = 57.12 Kg, e = 0.88
8. From the following data on the demand (Y) and the price (X) of an item obtain the best fitted line of
regression of demand on price. Estimate the demand when price is 40.
Price of the items (Rs.) 32 33 35 37 36 35 30
Demand of the items (Pieces) 68 56 50 51 55 61 62
Ans: ŷ = 158.96 – 2.94x, ŷ = 41
Expert है toh भरोसा है! Page 5 of 6
9. Following table depicts the figures of price of TV set and its demand during certain period of time:
Price of TV sets (Rs. 1000) 9.5 10 10.5 11 13.5 14 16.5 19
Demand of TV sets (Pieces) 95 80 100 90 70 90 65 50
Obtain the best fitted line of regression of demand of TV set (Y) on price of TV set(X). Find the estimated
demand of TV set when its price is Rs. 12.500.
Ans: ŷ = 136.16 – 4.32x, ŷ = 82
10. Following table gives the marks of 10 students of standard 12 in Commercial Mathematics and
language:
Marks in commercial mathematics 45 55 56 58 60 65 68 70 80 85
Marks in language 40 60 48 52 50 55 45 40 50 70
Obtain the best fitted line of regression of marks in Language (Y) on marks in Commercial Mathematics (X).
Obtain the estimate of marks in language when 75 mark in Mathematics.
Ans: ŷ = 25.39 + 043x, ŷ = 58
11. Two judges A and B have given marks independently to 5 plays as below. When the sixth play was
enacted, judge B was absent and Judge A gave 37 marks to that play. Obtain the best fitted line of
regression of the marks given by Judge B on the marks given by Judge A and estimate the marks that would
be given to the sixth play by judge B:
Serial no. Of play 1 2 3 4 5
Marks given by judge A (X) 46 44 43 42 40
Marks given by judge B (Y) 42 36 39 38 35
Ans: ŷ = -2.85 + 0.95x, ŷ = 32 marks
12. The data on the production (X) and the expense (Y) of an item are tabulated below:
Production (lakh Rs.) 48 52 60 40 68 50 75 70 58
Expense (lakh Rs.) 15 15 20 10 22 15 25 22 20
Obtain the linear regression model ŷ = 𝑎 + 𝑏𝑥 representing the relation between the expense (Y) and the
production (X). Find the error in estimating expense (Y) (lakh rupees) for production (X) = 50 (lakh rupees)
Ans: ŷ = -4.94 + 0.4x, Error e = -0.06
13. Verify the reliability of the regression model and find the intercept of the regression line from the
following data:
𝑥 = 20.6, 𝛴𝑥 = 103, 𝛴 (𝑦 − 18) = 1, 𝛴𝑥𝑦 = 3448, 𝛴𝑥2= 3675, 𝛴𝑦2= 3250
Ans: R2 = 1
14. The following results are obtained for a data:
𝑛 = 8, 𝛴𝑥 = 56, 𝛴𝑦 = 122, 𝛴𝑥𝑦 = 868, 𝛴𝑥2= 404
Later on, it was known that one pair (5. 20) was wrongly taken as (8, 15). By correcting the above
measures, obtain the regression line of Y on X. Estimate Y for X = 8.
Ans: ŷ = 12.7 + 0.48x, ŷ = 16.54
15. The following results are obtained for a data: 𝑛 = 6, 𝛴𝑢 = 11, 𝛴𝑣 = 12, 𝛴𝑢𝑣 = 22, 𝛴𝑢2=139, 𝛴𝑣2= 30
Verify the reliability of the regression model.
Ans: R2 = 0
Expert है toh भरोसा है! Page 6 of 6