1
Notes and Practice Problem Set on Module 4
Module-4: Probability Distribution and Sampling Theory
4.1 Probability Distribution: Poisson and Normal distribution
4.2 Sampling distribution Test of Hypothesis, Level of Significance, Critical
region, One-tailed, and two-tailed test, Degree of freedom.
4.3 Students’ t-distribution (Small sample). Test the significance of mean
and Difference between the means of two samples. Chi-Square Test: test of
goodness of fit and independence of attributes, Contingency table.
In a Random Experiment, all the possible outcomes are known, however,
its exact outcome cannot be precisely predicted in advance. There are
certain terms associated with random experiments that are given as follows:
Outcome: An outcome is a possible result of the random experiment.
Sample space: A sample space can be defined as the list of all possible
outcomes of a random experiment.
Event: An event is a possible outcome of an experiment and forms a subset
of the sample space.
Trial: When a random experiment is repeated many times each one is
known as a trial.
An example of random experiment: Suppose a coin is tossed. The two
possible outcomes are getting a head or a tail. The outcome of this
experiment cannot be predicted before it has been performed. Furthermore,
it can be conducted many times under the same conditions. Thus, tossing a
coin is an example of a random experiment.
Random variable is a variable that is used to quantify the outcome of a
random experiment.
Example-1: Suppose 2 dice are rolled and the random variable, X, is used to
represent the sum of the numbers. Then, the smallest value of X will be
equal to 2 (1 + 1), while the highest value would be 12 (6 + 6). Thus, X could
take on any value between 2 to 12 (inclusive)
Prof. Divesh Singh KJSIT, Sion
2
Types of Random Variables
Examples of discrete variables include:
Years of schooling
Number of goals made in a soccer match
The number of workers in an office
The number of steps you take in a day
The number of babies born each day
Votes for a particular politician
Number of times a coin lands on heads after ten coin tosses.
Examples of continuous variables include:
Prof. Divesh Singh KJSIT, Sion
3
The time it takes for office employees to commute to work
The distance you walk in a day
The weight of baby elephants
The body temperature of patients with the flu
Probability distribution for a discrete random variable
The probability distribution for a discrete random variable X is a formula,
table, or graph that gives all the possible values of x and the probability P (x)
associated with each value of X.
Poisson Distribution
𝑒 −𝑚 (𝑚)𝑥
𝑃(𝑋 = 𝑥) = , where m is mean
𝑥!
Problems
1. The mean and variance of the Poisson distribution is 2. Write down
𝑒 −2 (2)𝑥
the distribution [Ans: 𝑃(𝑋 = 𝑥) = , 𝑥 = 0,1,2,3 ….]
𝑥!
2. In a Poisson distribution P(x=3) is 2/3 of P(x=4) . Find the mean and
standard deviation. [Ans: 6,√6]
3. If X is Poisson variate such that 𝑃(𝑋 = 1) = 𝑃(𝑋 = 2), find 𝐸(𝑋 2 ).
[Ans: 6]
4. If the mean of Poisson distribution is 2. Find the probabilities of
x=1,2,3,4 from the recurrence relation of probability.
5. Assume that the probability of an individual coal miner being killed in
a mine accident during a year is 1/2400 . Use appropriate statistical
distribution to calculate the probability that in a mine employing 200
miners there will be at least one fatal accident every year.
[Ans:1/12,0.08]
6. Suppose that a local appliances shop has found from experience that
the demand for the tube lights is roughly distributed as Poisson with
mean of 4 tube lights per week. If the shop keeps 6 tube lights during
a particular week. What is the probability that the demand will exceed
the supply during that week. [Ans: 0.0183]
7. If the probability that an individual suffers a bad reaction from a
particular injection is 0.001, determine the probability that out of
2000 individuals (i) exactly three (ii) more than 2 individuals will
suffer a bad reaction. [Ans: (i) 0.1804 (ii) 0.3233]
8. Fit a Poisson distribution to the following data
X: 0 1 2 3 4 Total
f: 192 100 24 3 1 320
[Ans: m=0.5, Frequencies: 194,97,24,4,1]
Prof. Divesh Singh KJSIT, Sion
4
9. Fit a Poisson distribution to the following data
X: 0 1 2 3 4 Total
f: 211 90 19 5 0 325
[Ans: m=0.44, Frequencies: 209,92,20,3,1]
10. Fit a Poisson distribution to the following data
X 0 1 2 3 4 5 6 7 8
f 56 156 132 92 37 22 4 0 1
[Ans: Frequencies: 70,137,135,89,44,17,6,2,0]
11. Fit a Poisson distribution to the following data
X 0 1 2 3 4 5
f 142 156 69 27 5 1
[Ans: Frequencies: 147,147,74,24,6,2]
Normal Distribution
The normal distribution, also known as the Gaussian distribution is a
continuous probability distribution that is symmetrical around its mean.
Normal probability density function
𝟏 𝟏 𝒙−𝒎 𝟐
𝒆 𝟐 𝝈 )
− (
𝑓(𝑥) =
√𝟐𝝅𝝈𝟐
Where the parameter m is the mean and the parameter σ is the standard
deviation
Properties
The normal distribution is a continuous distribution that is:
bell-shaped
unimodal
symmetric
asymptotic
area under the curve is equal to 1
The mean, median, and mode are all equal
Half of the population is less than the mean and half is greater than
the mean
Prof. Divesh Singh KJSIT, Sion
5
X∼N (100, 15): Random variable X is normally distributed with mean 100
and standard deviation 15.
Normal curve with two different means and same standard deviation
Normal curves with two different standard deviations and same mean
Prof. Divesh Singh KJSIT, Sion
6
There are an infinite number of normal distribution models, as m can take
on any real number and σ any positive real number. In order to make
finding probabilities associated with the normal distribution easier, we
generally compute what is known as z -score.
Calculation of z-score
𝒙−𝒎
𝒛= 𝝈
Suppose random variable X which is normally distributed has mean 20 and
variance 5. If X takes two values 28 and 35 then corresponding z-scores are
x − m 28 − 20 8
z= = = = 1.6
σ 5 5
Prof. Divesh Singh KJSIT, Sion
7
x − m 35 − 20 15
z= = = =3
σ 5 5
about 68% of a normal distribution is within 1 standard deviation of the
mean
about 95% of a normal distribution is within 2 standard deviations of
the mean
Almost the entire curve (99.7%) of the normal distribution is within 3
standard deviations of the mean.
Prof. Divesh Singh KJSIT, Sion
8
Prof. Divesh Singh KJSIT, Sion
9
Problems:
1. For a normal variate with mean 2.5 and standard deviation 3.5 , find
the probability that
(i) 2 ≤ 𝑋 ≤ 4.5 (ii) −1.5 ≤ 𝑋 ≤ 5.5 [Ans: (i) 0.2714 (ii) 0.6610]
2. If X is a normal variate with mean 10 and standard deviation 4 , find
(ii) 𝑃(5 ≤ 𝑋 ≤ 18) (ii) 𝑃(𝑋 ≤ 12) (iii) 𝑃(|𝑋 − 14| < 1)
[Ans: (i) 0.8716 (ii) 0.6914 (iii) 0.6826]
3. If Z is standard normal variate , find c such that
(i) 𝑃(−𝑐 ≤ 𝑧 ≤ 𝑐)=0.95 (ii) ) 𝑃(|𝑧| > 𝑐) = 0.01
If X is a normal variate with mean 120 and standard deviation 10,
find c such that (𝑖𝑖𝑖)𝑃(𝑋 > 𝑐) = 0.02 (iv) 𝑃(𝑋 < 𝑐) = 0.05
[Ans: (i) 1.96 (ii) 2.58 (iii) 140.5 (iv) 103.6]
4. If X is normal variate with mean 5 and standard deviation 3. What is
the probability that the deviation from the mean of an item taken at
random will be negative? [Ans: 0.0575]
5. If X is normally distributed with mean and standard deviation 4 , find
(iii) 𝑃(5 ≤ 𝑋 ≤ 10) (ii) 𝑃(𝑋 ≥ 15) (iii) 𝑃(10 ≤ 𝑋 ≤ 15) (iv) 𝑃(𝑋 ≤ 5)
[Ans: (i) 0.3326 (ii) 0.003 (iii) 0.1557 (iv) 0.05987]
6. The daily sales of a firm are normally distributed with mean ₹ 8000
and variance of ₹ 10,000 (i) what is the probability that on a certain
day the sales will be less than ₹8210? (ii) What is % of days on which
the sales will be between ₹ 8100 and ₹ 8200?
[Ans: (i) 0.5832 (ii) 14%]
7. The mean height of soldiers is 68.22” with variance 10.8”. Find the
expected number of soldiers in a regiment of 1000 whose height will
be more than 6” (Area from z=0 to z=1.15 is 0.3749)[Ans: 125]
8. A sample of 100 dry battery cell is tested to find the length of life ,
produced the following results. 𝑋̅ = 12 ℎ𝑜𝑢𝑟𝑠 and 𝜎 = 3 ℎ𝑟𝑠 .
Assuming normal distribution what percentage of cells is expected to
have life (i) more than 15 hrs (ii) between 10 and 14 hrs . [Ans: (i)
15.87% (ii) 49.72%]
9. The income distribution of workers in a certain factory was found to
normal with mean of ₹ 500 and standard deviation ₹ 50. There were
228 persons above ₹600. How many persons were there in all? (Area
Prof. Divesh Singh KJSIT, Sion
10
under the S.N curve between height at 0 and 2 is 0.4772) [Ans:
10,000]
10. The life of army shoes is normally distributed with mean 8
months and standard deviation 2 months . If 5000 pairs are issued ,
how many pairs would be expected to need replacement after 12
months . [Ans: 2386]
11. In an intelligence test administered to 1000 students the
average was 42 and standard deviation was 24. Find the number of
students (i) exceeding 50 (ii) between 30 and 54 (iii) the least score of
top 100 students. [Ans: (i) 371 (ii)383 (iii) 72.72 ]
12. The marks obtained by students in a class are normally
distributed with mean 75 and standard deviations. If top 5% got grade
A and bottom 25% got grade B, what are the marks of the lowest of A
and what are the marks of highest of B? Also find the percentage of
the students who got marks between 60 and 70.
[Ans:(i) 83(ii)72(iii)15.74 ]
13. The local authorities in a certain city installed 10,000 electric
lamps in the streets of the city. If these lamps have average life of
1000 burning hrs with a standard deviation of 200 hrs , what number
of lamps might be expected to fail (i) in the first 800 hrs (ii) between
800 and 1200 hrs ?
After what period of burning hrs would you expect that (i) 10% of lamp
would fail (ii) 10% of lamp would be still burning? (The area between
the ordinates corresponding to S.N.V z=0 and z=1 is 0.34134 and 80%
of the area lies between the ordinate corresponding to𝑧 = ±1.25)
[Ans: (i) 1587 (ii) 6827 (iii) 750 hrs 9 (iv) 1250 hrs ]
14. In a test of 2000 electric bulbs , it was found that the life of a
particular make was normally distributed with an average life of 2040
hrs and standard deviation of 60 hrs . Estimate the number of bulbs
likely to burn for (i) more than 2150 hrs (ii) less than 1950 hrs
[Ans: (i)67 (ii) 184 ]
15. Assuming that the diameter of 100 brass plugs taken
consecutively from a normal distribution with mean 0.7515cm. and
standard deviation 0.0020 cm . how many plugs are likely to be
rejected if the approved diameter is 0.752±0.004 cms ? [Ans: 53]
16. The marks of 1000 students of a university are found to be
normally distributed with mean 70 and standard deviation 5 .
Estimate the number of students whose marks will be (i) between 60
and 75 (ii) more than 75 (iii) less than 68 [Ans: (i) 910 (ii) 23 (iii) 37]
17. In a normal distribution 31 % items are under 45 and 8% are
over 64. Find the mean and standard deviation. Find also the
percentage of items lying between 30 and 75.(Given : S.N.V.Z area
from z=0 to z=0.5 is 0.19 and that from z=0 to z=1.4 is 0.42 )
Prof. Divesh Singh KJSIT, Sion
11
[Ans: m=50 , σ = 10: 0.957]
18. Marks obtained by students in an examination follow a normal
distribution. If 30% of students got below 35 marks and 10% got
above 60 marks. Find the mean and variance of the normal
distribution.% of students who got marks between 40 and 50 .
[Ans: m=42.25 , σ = 13.81: 28%]
19. Suppose that the length in hrs , say X of light bulbs
manufactured by accompany A are normally distributed with mean
800 hrs and standard deviation of 120 hrs and those of B with mean
850 hrs and standard deviation of 50 hrs . One bulb is selected from
the production of each company and is burned till death .Find the
probability that the length of life of the bulb from company A exceeds
the length of the bulb from the company B at least by 15 hrs . [Ans:
0.2979]
20. The marks obtained by students in a college are normally
distributed with mean 65 and variance 25. If 3 students are selected
at random from this college what is the probability that at least one of
them would have scored more than 75 marks? [Ans: 0.07]
21. A manufacturer knows from his experience that the resistant of
resisters he produces is normal with mean 𝜇 = 100 𝑜ℎ𝑚𝑠 and standard
deviation σ = 2 𝑜ℎ𝑚𝑠 .What percentage of resisters will have
resistance between 98 𝑜ℎ𝑚𝑠 and 102 𝑜ℎ𝑚𝑠 ?[Ans: 68.26%]
22. If 𝑋1 and 𝑋2 are two independent random variates with mean 30
and 25 and variance 16 and 12 and if 𝑌 = 3𝑋1 − 2𝑋2 ,
find 𝑃(60 ≤ 𝑌 ≤ 80) [Ans: 0.0730]
23. In an examination marks obtained by students in Mathematics,
Physics and Chemistry are normally distributed with mean 51,53,and
46 with SD 15,12,16 respectively . Find the probability of securing
total marks (i) 180 or above (ii) 80 or below. [Ans: 0.1151 ,0.0082]
24. In a competitive examination the top 15% of the students
appeared will get grade A, while the bottom 20% will be declared fail. If
the grades are normally distributed with mean % of marks 75 and SD
10 , determine the lowest % of marks to receive grade A and the lowest
% of marks that passes . [85.4,66.6]
25. In a distribution exactly normal 7% of items are under 35 and
89% of the items are under 63. Find the probability that an item
selected at random lies between 45 and 56 . [Ans: 0.4038]
Prof. Divesh Singh KJSIT, Sion
12
Statistics: When we select a random sample from a population, the
numerical descriptive measures [mean, median, mode, variance, SD etc.] we
calculate from the sample are called statistics.
Parameters are numbers that describe the numerical descriptive measures
of entire populations.
Sampling Distribution:
The sampling distribution of a statistic is the probability distribution for
the possible values of the statistic that results when random samples of size
n are repeatedly drawn from the population.
The sampling distribution of a statistic is the probability distribution of that
statistic.
Hypothesis testing:
Making a decision about the value of a parameter based on some
preconceived idea about what its value might be.
A statistical test of hypothesis consists of five parts:
1. The null hypothesis, denoted by 𝑯𝟎
2. The alternative hypothesis, denoted by 𝑯𝒂 or 𝑯𝟏
3. The test statistic and its p-value
4. The significance level and the rejection region
5. The conclusion
Prof. Divesh Singh KJSIT, Sion
13
Level of Significance:
Prof. Divesh Singh KJSIT, Sion
14
t- test was developed by William Sealy Gosset in 1908
Assumptions for t-test:
1. Sample size should be small i.e 𝑛 ≤ 30.
2. Population is normally distributed
3. Sample should be random.
Student’s t-distribution (Small Sample)
If 𝑋̅ is the sample mean and 𝜇 is the population mean then
̅ −𝝁
𝑿
𝒕 = 𝒔/
√𝒏
̅ : Sample mean
𝑿
𝝁: Population mean
𝒔: sample standard deviation
n: sample size
Standard deviation of population can be calculated by
̅ )𝟐
∑(𝑿𝒊 −𝑿
𝒔𝟐 = , where n-1 is degree of freedom
𝒏−𝟏
Type-1 Testing the hypothesis that the population mean is 𝝁
Prof. Divesh Singh KJSIT, Sion
15
Problems:
1. Vanaspati oil is marketed in tins of 10 kg., a sample of 20 tins showed
the mean weight as 9.5 kg. with standard deviation of 3kg. . Does the
sample justify the claim that the mean weight is 10kg ? Mention the
level of significance you use.[Ans:t=0.726,Yes at 5% level of sign.]
2. A random sample of 16 observations has mean 103.75 cm. The sum of
the squares of the deviation from the mean is 843.75 cm. Can this
sample be regarded as coming from population having 108.75 cm as
the mean?
3. A company supplies tooth-paste in a packing of 100 gm. A sample of
10 packing gave the following weights in gms.
100.5, 100.3, 100.1, 99.8, 99.7, 99.7, 100.3, 100.4, 99.2, 99.3
Does the sample support the claim of the company that the packing
weighs 100 gms? .[Ans: ̅ X = 99.93, 𝑠 2 = 0.2112,t=0.48,Yes ]
4. A machine is designed to produce insulating washers for electrical
devices of average thickness of 0.025 cms. A random sample of 10
washers was found to have average thickness of 0.024 cms., with SD
of 0.002 cms . Test the significance of the deviation.
[Ans: |𝑡| = 1.5: 𝐴𝑐𝑐𝑒𝑝𝑡 𝐻0 ]
5. A certain drug administered to 12 patients resulted in the following
changes in their blood pressure.
5, 2, 8, -1, 3, 0, 6, -2, 1, 5, 0, 4
Can we conclude that the drug increases the blood pressure?
[Ans: t=2.89,One tailed test X ̅ > 𝜇 is to be accepted. There is increase
in B.P]
6. Nine items of a sample had the following values
45, 47, 50, 52, 48, 47, 49, 53, 51
Does the mean of 9 items differ significantly from the assumed
population mean 47.5? [Ans: |𝑡| = 1.84: 𝐴𝑐𝑐𝑒𝑝𝑡 𝐻0 ]
7. Ten individuals are chosen at random from a population and their
heights are found to be 63, 63, 64, 65, 66, 69, 69, 70, 70, 71 inches.
Discuss the suggestion that the mean height of the universe is 65
inches. [Ans: |𝑡| = 2.02: 𝐴𝑐𝑐𝑒𝑝𝑡 𝐻0 ]
8. Tests made on breaking strength of 10 pieces of a metal wire gave the
following results.
578, 572, 570, 568, 572, 570, 570, 572, 596 and 584 in kgs. Test if
the breaking strength of the metal wire can be assumed to be 577kg. ?
[Ans: |𝑡| = 0.65: 𝐴𝑐𝑐𝑒𝑝𝑡 𝐻0 ]
Prof. Divesh Singh KJSIT, Sion
16
Type-2 Testing the difference between means (Independent samples)
1. Two independent sample of size 8 and 7 gave the following results.
Sample 1: 19 17 15 21 16 18 16 14
Sample 2: 15 14 15 19 15 18 16
Is the difference between sample means significant ? [Ans: t=0.93 ,
accept 𝐻0 ]
2. The mean and standard deviation of height of 8 randomly chosen
soldiers are 166.9 cm and 8.29 cm respectively. The corresponding
values of 6 randomly chosen sailors are 170.3 cm and 8.5 cm
respectively. Based on this data can we conclude that, in general , the
soldiers are shorter than the sailors ? Find 95% confidence limits for
the static used. [Ans: |𝑡| = 0.6967: 𝑁𝑜 ]
3. Samples of electric tubes of two companies were tested for lengths of
their life and the following information was obtained ,
Company A Company B
No. of sample 8 7
Mean life in hrs 1210 1314
Standard deviations in hrs 36 42
Test at 5% level of significance whether the difference in the
sample means is significant. [Ans: |𝑡| = 4.81: 𝑅𝑒𝑗𝑒𝑐𝑡 ]
4. Two types of anti-biotics were tested on two groups of patients for
curing a particular disease and the following data were obtained.
Type-A Type-B
No. of patients 6 6
Mean period in days 13.55 10.10
Unbiased SD in days 3.2 2.8
Use 5% level of significance to test the null hypothesis that the difference
in the mean period of the two drugs is significant. [Ans: |𝑡| = 1.40: 𝑅𝑒𝑗𝑒𝑐𝑡 ]
5. Two kind of manures were used in 17 plots of the same size other
conditions being the same. The yields in the quintals are given below.
Manure I 35, 42, 40, 42, 34, 24, 42
Manure II 34, 44, 32, 40, 52, 41, 50, 40, 42, 45
Test at 5% level of significance whether the two manures differ as
regards their mean yields. [Ans: |𝑡| = 1.68: 𝑑𝑖𝑓𝑓𝑒𝑟𝑒𝑛𝑐𝑒 𝑖𝑠 𝑛𝑜𝑡 𝑠𝑖𝑔𝑛𝑖𝑓𝑖𝑐𝑎𝑛𝑡 ]
6. The following are the gain in weights of cows fed on two types of diets
X and Y.
Diet X: 30, 37, 35, 37, 29, 19, 37
Diet Y: 29, 39, 27, 35, 47, 37, 45, 35, 37, 40
Test at 5% level of significance whether the diets differ as regards their
Prof. Divesh Singh KJSIT, Sion
17
effect on mean increase in weight. [Ans: |𝑡| = 1.62: 𝑑𝑜 𝑛𝑜𝑡 𝑑𝑖𝑓𝑓𝑒𝑟 ]
7. The mean of two random samples of size 9 and 7 are 196 and 199
respectively. The sum of the squares of the deviations from the mean
are 27 and 19 respectively . Can the samples be regarded to have been
drawn from the same normal population? [Ans: |𝑡| = 3.30: 𝑁𝑜 ]
8. A sample of 8 students of 16 years each shown up a mean systolic BP
of 118.4 mm of Hg and SD of 12.17mm. While a sample of 10 student
of 17 years each showed the mean systolic B.P of 121.0 mm with SD
of 12.88 during an investigation. The investigator feels that the
systolic BP is related to age. Do you think that the data provides
enough reasons to support investigators feeling at 5% LOS(Assume
the distribution of systolic BP to be normal [Ans: |𝑡| = 0.14: 𝑁𝑜 ]
9. The means of two random samples of size 9 and 7 are 196.42 and
198.82 respectively. The sums of squares of the deviations from the
means are 26.94 and 18.73 respectively. Can the samples be
considered to have been drawn from the sample population?
[Ans: |𝑡| = 2.64: 𝑁𝑜]
10. Six guinea pigs injected with 0.5 mg of a medication took on an
average 15.4 secs. to fall asleep with an unbiased standard deviation
2.2 secs. , while six other guinea pigs injected with 1.5 mg of the
medication took on an average 11.2 secs. to fall asleep with an
unbiased standard deviation 2.6 cms. Use 5% los to test the null
hypothesis that the difference in dosage has no affect.
[Ans: : |𝑡| = 5.288 , the difference is significant]
11. The height of six randomly chosen sailors are in inches : 63, 65,
68, 69, 71, 72. The heights of ten randomly chosen soldiers are: 61,
62, 65, 66, 69, 69, 70, 71, 72, 73.
Discuss in the light that these data throw on the suggestion that the
soldiers on an average are taller than sailors.
[Ans: |𝑡| = 0.099, suggestion cannot be accepted . ]
Type-3 Testing the difference between means (Dependent samples)
1. A drug was administered to 5 persons and the systolic blood
pressure before and after was measured. The result are given below
Candidates: I II III IV V
B.P. before: 140 130 132 150 140
B.P after: 132 126 133 144 133
Test whether the drug is affective in lowering the systolic blood
pressure . [Ans: |𝑡| = 3,the drug is not affective.
Prof. Divesh Singh KJSIT, Sion
18
2. A drug was administered to 10 patients and the changes in the
sugar content in the blood was recorded as under 10, -6, -4, 2, -8,
6, -5, -3, -6 . Is it reasonable to believe that the drug has no effect
on change of sugar? [Ans: |𝑡| = 0.289,Drug has no affect ]
3. The sales-data of an item in six shops before and after a special
promotional campaign are as under
Shops: A B C D E F
Before campaign: 53 28 31 48 50 42
After campaign: 58 29 30 55 56 45
Can the campaign be judged to be a success at 5% level of
significance? [Ans: |𝑡| = 3.14,Yes ]
4. The following data relates to the marks obtained by 11 students in
two tests, one held at the beginning of the year and the other at the
end of the year after giving intensive coaching.
Test-I 19 23 16 24 17 18 20 18 21 19 20
Test-II 17 24 20 24 20 22 20 20 18 22 18
Do the data indicate that the students are benefited by coaching?
[Ans: |𝑡| = 4,Yes]
5. An IQ test was administered to 5 persons before and after training .
The results are given below.
1 2 3 4 5
IQ before training 110 120 123 132 125
IQ after training 120 118 125 136 121
Test whether there is any change in IQ after training programme. Use
1% level of significance.
[Ans: |𝑡| = 0.82, The value of t for 𝜈 = 4 at 1% level of significance =4.6.
,Null hypothesis accepted]
6. A certain injection administered to 12 patients resulted in the
following changes of blood pressure:
5, 2, 8, -1, 3, 0, 6, -2, 1, 5, 0, 4
Can it be concluded that the injection will be in general
accompanied by an increase in blood pressure? [Ans: |𝑡| = 2.89, 𝐻0
rejected]
7. Ten school boys were given a test in Mathematics and their scores
were recorded. They were given a month special coaching and
second test was given to them in the same subject at the end of the
coaching period. Test if the marks given below give evidence to the
fact that the students are benefitted by coaching,
Marks in Test-I 70 68 56 75 80 90 68 75 56 58
Marks in Test-II 68 70 52 73 75 78 80 92 54 55
Prof. Divesh Singh KJSIT, Sion
19
[Ans: |𝑡| = 0.036, 𝐻0 accepted]
8. In a certain experiment to compare two types of pig-tools A and B ,
the following results of increasing weights were obtained .
Pig No. 1 2 3 4 5 6 7 8
Increase in weight X kg by A: 49 53 51 52 47 50 52 53
Increase in weight X kg by B: 52 55 52 53 50 54 54 53
(i) Assuming that the two sample of pigs are independent, can
we conclude that food B is better than food A.
(ii) Examine the case if the same set of pigs were used in both
the cases.
[Ans:(i)|𝑡| = 2.17, B is superior to food A(ii) |𝑡| = 4.17, B is superior to food A]
𝝌𝟐 𝒕𝒆𝒔𝒕
(𝑶 − 𝑬)𝟐
𝝌𝟐 = ∑
𝑬
O- Observed frequency, 𝑬 – expected frequency
Type-I : Independence of attributes
1. In an experiment on immunisation of cattle from Tuberculosis
the following results were obtained.
Affected Not-affected Total
Inoculated 267 27 294
Not Inoculated 757 155 912
Total 1024 182 1206
Use 𝝌 − test to determine the efficacy of vaccine in preventing
𝟐
tuber-culosis [Ans: Calculated 𝝌𝟐 =10.19 , 𝝌𝟐 table is 3.84 ,effective]
2. Based on following data determine if there is relation between
literacy and smoking.
Smokers Non-smokers
Literates 83 57
Illiterates 45 68
[Ans: Calculated 𝝌 =9.19, Yes]
𝟐
Prof. Divesh Singh KJSIT, Sion
20
3. A total of 3759 individuals were interviewed in a public opinion
survey on a political proposal. Of them 1872 were men and the
rest were women. A total of 2257 individuals were in favour of
the proposal and 917 were opposed to it. A total of 243 men
were undecided and 442 women were opposed to it. Do you
justify or contradict the hypothesis that there is no association
between sex and attitude at 5% level of significance.
[Ans: Calculated 𝝌𝟐 =18.76, Sex and attitude are associated]
4. A sample of 400 students of under-graduate and 400 of post-
graduate classes was keen to know their opinion about
autonomous colleges. 290 of the undergraduate and 310 of the
post-graduate students favoured the autonomous status.
Present these facts in the form of a table and test at 5% level ,
that the opinion regarding autonomous status of colleges is
independent of the level of classes of students .
[Ans: Calculated 𝝌𝟐 =2.66, Opinions are independent]
5. Investigate the association between the darkness of eye colour
in father and son from the following data.
Colour of father’s eye
Colour Dark Not-dark total
of son’s Dark 48 90 138
eyes Not-dark 80 782 862
total 128 872 1000
[Ans: Calculated 𝝌𝟐 =66.88, Yes there is association]
6. The following table gives the number of accounting clerks not
committing errors among trained and untrained clerks working
in an organisation.
No. of clerks No. of clerks not Total
Committing errors Committing errors
Trained 70 530 600
Untrained 155 745 900
Total 225 1275 1500
Test the effectiveness of training in preventing errors.
[Ans: Calculated 𝝌𝟐 =8.7, training is effective in preventing errors]
7. In a survey of 200 boys of which 75 were intelligent, 40 had
educated fathers while 85 of the unintelligent boys had
uneducated father. Do this figures support the hypothesis that
educated fathers have intelligent boys.
[Ans: Calculated 𝝌𝟐 =11.89, Yes there is association]
Prof. Divesh Singh KJSIT, Sion
21
Problems on Goodness of Fit
1. The following table gives the number of accidents in a city during a
week. Find whether the accidents are uniformly distributed over a
week.
Day: Sun, Mon, Tue, Wed, Thu, Fri, Sat Total
No of accidents: 13, 15, 9, 11, 12, 10, 14, 84.
[Ans: Calculated 𝝌𝟐 =2.33, occurs equally on all working days]
2. A die was thrown 132 times and following frequencies were observed.
No. Obtained 1 2 3 4 5 6 Total
Frequency 15 20 25 15 29 28 132
Test the hypothesis that the die is unbiased. [Ans: Calculated 𝝌𝟐 =8.91,
the die is unbiased]
3. The number of car accidents in a metropolitan city was found to be
20,17,12,6,7,15,8,5,16,14 per month respectively. Use 𝝌𝟐 -test to
check whether these frequencies agreement with the belief that
occurrence of accidents was the same during 10 months period. Test
at 5% level of significance.
[Ans: Calculated 𝝌𝟐 =20.33, do not occur equally on all months]
4. 300 digits were chosen at random from a table of random numbers.
The frequencies of digits were as follows.
Digit: 0 1 2 3 4 5 6 7 8 9 Total
Frequency: 28 29 33 31 26 35 32 30 31 25 300
Use 𝝌 -test examine the hypothesis that the digits were distributed in
𝟐
equal numbers in the table.
[Ans: Calculated 𝝌𝟐 =2.87, digits are equally distributed in the table]
5. Theory predicts that proportion of beans in the four groups A, B, C, D
should be 9:3:3:1. In an experiment among 1600 beans the number in
the four groups were 882,313,287 and 118 . Do the experimental
results support the theory? [Ans: Calculated 𝝌𝟐 =4.72, the proportion
9:3:3:1 is correct.]
6. In an experiment of pea breeding the following frequencies were
obtained
Round and Wrinkled Round and Wrinkled Total
yellow and yellow green and green
315 101 108 32 556
Prof. Divesh Singh KJSIT, Sion
22
Theory predicts that the frequencies should be in proportion of
9:3:3:1. Examine the correspondence between theory and experiment
using Chi-square test. [Ans: Calculated 𝝌𝟐 =0.51, the proportion
9:3:3:1 is correct .]
7. The figures given below are (a) the observed frequencies of a
distribution , (b) the frequencies of the normal distribution having the
same mean , SD and the total frequency as in (a) .
(a) 1,12,66,220,495,792,924,792,495,220,66,12,1.
(b) 2,15,66,210,484,799,943,799,484,210,66,15,2.
Apply 𝝌𝟐 test of goodness of fit. [Ans: Calculated 𝝌𝟐 =3.84, the fit is good]
8. The following figures show the distribution of digits in numbers
chosen at random from a telephone directory.
Digit: 0 1 2 3 4 5 6 7 8 9
Freq: 1026 1107 997 966 1075 933 1107 972 964 853
Test at 5% level whether digits may be taken to occur equally
frequently in the directory. [Ans: Calculated 𝝌𝟐 =59.36, No]
Prof. Divesh Singh KJSIT, Sion