Understanding Sampling and Population Errors
Understanding Sampling and Population Errors
1B SAMPLING THEORY
1. Population or Universe
Population in statistics means the whole of the information which comes under the
purview of statistical investigation. It is the totality of all the observations of a statistical
experiment or enquiry.
A population may be finite or infinite according as the number of observations or items in
it are finite or infinite. The population of weights of students of class XII in a government
school is an example of a finite population. The population of pressure at different points
in the atmosphere is an example of an infinite population.
Types of Population:
a) Finite Population: When the items in the population are fixed and limited.
Example : No. of students in the class
b) Infinite Population: If a population consist of infinite no. of items its an infinite
population. If a sample is known to have been drawn from a continuous probability
distribution, then the population is infinite. Example : Population of all real numbers
lying between 5 and 20.
c) Real Population: A Population consisting of the items which are all present physically
is termed as real population.
d) Hypothetical Population: The Population consists of the results of the repeated
trails is named as hypothetical population The tossing of a coin repeatedly results
into a hypothetical population of heads and tails.
2. Sample
A part of the population selected for study is called a sample. In other words, the
selection of a group of individuals or items from a population in such a way that this
group represents the population, is called a sample.
1. Sampling is a process whereby we judge the characteristics or draw inference
about the totality or Universe (known as population) on the basis of judging the
characteristics of a selected portion taken from that totality (known as sample).
2. Sample: Sample is the part of population selected on some basis it is a finite subset
70
CA FOUNDATION STATISTICS
of the population.
3. Sample Units : Units forming the samples are called Sample Units.
4. Sample Frame : A complete list of sampling units is called Sample Frame
n
5. Sample Faction : is called Sampling Fraction where n = Sample Size and N =
N
Population Size.
6. Complete enumeration or census : In case of enumeration, information is collected
for each and every unit. The aggregate of all the units under consideration is called
the ‘population’ or the ‘universe’. The results are more accurate and reliable but it
involves lot of time, money and man power
Notations
Statistical Measure Population Sample
Mean µ x
Standard deviation σ s
Proportion P p
Size N n
Related MCQ’s:
1. The aggregate or totality of statistical data forming a subject of investigation is
known as :
a) Sample b) Population
c) Both a) and b) above d) None of the above
71
CA FOUNDATION STATISTICS
72
CA FOUNDATION STATISTICS
11. A complete list of all the units in a finite population, properly numbered for
identification, is called a:
a) Universe b) Sampling Data
c) Sampling Units d) Sampling Frame
13. A border patrol checkpoint which stops every passenger van is utilizing :
a) simple random sampling. b) systematic sampling
c) systematic sampling. d) complete enumeration
14. A population consisting of all the items which are physically present is called :
a) hypothetical b) normal population
c) existent population d) none of the above
73
CA FOUNDATION STATISTICS
Related MCQ’s:
19. Law of Statistical Regularity states that:
a) A sample of reasonably small size when selected at random, is almost not
sure to represent the characteristics of the population
b) A sample of reasonably large size when selected, is almost not sure to represent
the characteristics of the population.
c) A sample of reasonably large size when selected at random, is almost sure to
represent the characteristics of the population, on an average
d) None of the above
74
CA FOUNDATION STATISTICS
22. Increase in reliability and accuracy of results from a sampling study with the increase
in sample size is known as the principle of:
a) statistical regularity b) optimization.
c) law of increasing returns. d) inertia of large numbers.
23. Sampling error increases with an increase in the size of the sample.
a) The above statement is true.
b) The above statement is not true.
c) Sampling error do not depends upon the sample size
d) None of the above
75
CA FOUNDATION STATISTICS
Related MCQ’s:
25. How many different kind of errors can one find in sampling process?
a) One b) Two
c) Three d) Many
33. “Sampling errors are present both in census as well as a sample survey.’’ -State
whether the given statement is correct or not.
a) Correct b) Incorrect
c) Nothing cannot be said d) None of the above
34. There are more chances of non-sampling errors than sampling errors in case of :
a) studies of large samples. b) complete enumeration.
c) inefficient investigators. d) all of the above
From a population of size N, number of samples of size n can be drawn. These samples
will give different values of a statistic. E.g. if different samples of size n are drawn from
a population, different values of sample mean are obtained. The various values of a
statistic thus obtained, can be arranged in the form of a frequency distribution known as
Sampling Distribution. Thus we can have sampling distribution of sample mean x ,
sampling distribution of sample proportion p etc.
Errors in Sampling
Any statistical measure say, mean of the sample, may not be equal to the corresponding
statistical measure (mean) of the population from which the sample has been drawn.
Thus there can be discrepancies in the statistical measure of population, i.e., parameter
and the statistical measures of sample drawn from the same population i.e., statistic.
These discrepancies are known as Errors in Sampling.
Standard Error of a Statistic
Standard error is used to measure the variability of the values of a statistic computed from
the samples of the same size drawn from the population, whereas standard deviation is
used to measure the variability of the observations of the population itself.
The standard deviation of the sample statistics is called standard error of that statistic.
E.g. if different samples of the same size n are drawn from a population, we get different
values of sample mean x . The S.D. of x . is called standard error of x . . It is obvious that
77
CA FOUNDATION STATISTICS
the standard error of x . will depend upon the size of the sample and the variability of
the population.
σ
i) Standard error of sample mean SE ( x ) =
σ=Population S.D
and s=Sample S.D
ii) Standard error of proportion SE (p) =
The following table will provide us a better understanding of the situations while
calculating SE ( x )
Summary
Concept of Sampling Distribution of Statistic and Standard Error:
Samples can be drawn with or without replacement
78
CA FOUNDATION STATISTICS
Related MCQ’s:
36. Values of a particular statistic with their relative frequencies will constitute the of
the concerned statistic.
a) Probability Distribution b) Sampling Distribution
c) Theoretical Distribution d) None of these
38. In general mean of the sampling distribution is _ as the mean of the population.
a) more than b) less than
c) same d) none of the above
79
CA FOUNDATION STATISTICS
40. The population standard deviation describes the variation among elements of the
universe, whereas, the standard error measures the:
a) variability in a statistic due to universe
b) variabillity in a statistic due to sampling
c) variablity in a parameter due to universe
d) variablity in a statistic due to parameter
41. As the units selected in two or more samples drawn from a population are not
the same, the value of a varies from sample to sample, but the always remains
constant.
a) mean, standard deviation b) statistic, standard deviation
c) statistic, parameter d) parameter, statistic
80
CA FOUNDATION STATISTICS
46. The standard error of the is the standard deviation of sample means.
a) Population b) Sample c) Mean d) Median
7. Types of Sampling
A sample can be selected from a population in various ways. Different situations call for
different methods of sampling. There are three methods of Sampling:
1. Random Sampling or Probability Sampling Method
3. Mixed Sampling
81
CA FOUNDATION STATISTICS
basis of random numbers which are available from the random number
tables. The various random number tables available are:
a. Trippet’s Random Number Series;
b. Fisher’s and Yales Random Number Series;
c. Kendall and Badington Random Number Series;
d. Rand Corporation Random Number Series;
One major disadvantage of random sampling is that all the members of the
population must be known and be serially numbered. It will entail a lot of
difficulties in case the population is of large size and will be impossible in case
the population is of infinite size.
(b) Restricted Random Sampling:
It is of three types
• Stratified Sampling
• Systematic Sampling
• Multi-stage Sampling
Stratified Sampling: In stratified random sampling, the population is divided into
strata (groups) before the sample is drawn. Strata are so designed that they do
not overlap. An elementary unit from each stratum is drawn at random and
the units so drawn constitute a sample. Stratified sampling is suitable in those
cases where the population is hetrogeneous but there is homogeneity within
each of the groups or strata.
Advantages
(i) It is a representative sample of the hetrogeneous population.
(ii) It lessens the possibility of bias of one sidedness.
Disadvantages
(i) It may be difficult to divide population into homogeneous groups.
(ii) There may be over lapping of different strata of the population which will
provide an unrepresentative Sample.
Systematic Sampling: In this method every elementary unit of the population is
arranged in order and the sample units are distributed at equal and regular
intervals. In other words, a sample of suitable size is obtained (from the orderly
arranged population) by taking every unit say tenth unit of the population.
One of the first units in this ordered arrangement is chosen at random and the
sample is computed by selecting every tenth unit (say) from the rest of the lot.
If the first unit selected is 4, then the other units constituting the sample will
be 14, 24, 34, 44, and so on.
82
CA FOUNDATION STATISTICS
Advantages: It is most suitable where the population units are serially numbered
or serially arranged.
Disadvantages: It may not provide a desirable result due to large variation in
the items selected.
Multi-stage Sampling: In this sampling method, sample of elementary units is
selected in stages. Firstly a sample of cluster is selected and from among them
a sample of elementary units is selected. It is suitable in those cases where
population size is very big and it contains a large number of units.
83
CA FOUNDATION STATISTICS
Related MCQ’s:
48. Simple random sampling is very effective if
(a) The population is not very large
(b) The population is not much heterogeneous
(c) The population is partitioned into several sections.
(d) Both (a) and (b)
84
CA FOUNDATION STATISTICS
51. Which sampling provides separate estimates for population means for different
segments and also an over all estimate?
(a) Multistage sampling (b) Stratified sampling
(c) Simple random sampling (d) Systematic sampling
53. Which sampling is affected most if the sampling frame contains an undetected
periodicity?
(a) Simple random sampling (b) Stratified sampling
(c) Multistage sampling (d) Systematic sampling
While selecting the units for a sample, when a unit of sample selected is replaced before
the next unit is selected then it is called sampling with replacement.
In this case the total number of samples that can be drawn = (N)n For E.g.: Let Population
= {a, b, c}
N = 3, let n = 2
No. of samples = (N)n = (3)2 = 9
No. of samples = {(a, b) (a, c) (b, c) (b, a) (c, a) (c, b) (a, a) (b, b) (c, c)}
While selecting the units for a sample, when a unit of sample is selected but not replaced
before the next unit is selected then it is called Sampling Without Replacement.
In this case the total number of samples that can be drawn = For E.g.: Let population =
{a, b, c}
N = 3, let n = 2
85
CA FOUNDATION STATISTICS
No. of samples =
No. of samples = {(a, b), (a, c), (b, c)}
Related MCQ’s:
55. A population comprises 3 members 1, 5, 3. Draw all possible samples of size two (i)
with replacement (ii) without replacement
Find the sampling distribution of sample mean in both cases.
56. In simple random sampling with replacement, the total number of possible sample
with distinct permutation of member is:
(N = Size of Population, n = Sample size)
a) N x n b) Nn c) N d) n
57. In simple random sampling without replacement, the total number of possible
sample with distinct permutation of member is:
(N = Size of Population, n = Sample size)
a) Nn b) P(N, n) c) C(N,n) d) None of the above
58. If from a population with 20 members, a random sample without replacement of 2
members is taken, the number of all such samples is :
a) 400 b) 190 c) 210 d) 200
86
CA FOUNDATION STATISTICS
Theory Answers
1 B 11 D 21 A 31 C 41 C 51 B
2 A 12 B 22 D 32 C 42 C 52 D
3 A 13 D 23 B 33 B 43 C 53 D
4 C 14 C 24 B 34 D 44 A 54 C
5 C 15 A 25 B 35 C 45 C 55 Read
note
6 D 16 B 26 D 36 B 46 C 56 B
7 A 17 A 27 B 37 D 47 A 57 C
8 A 18 C 28 D 38 C 48 D 58 B
9 B 19 C 29 C 39 D 49 A 59 C
10 A 20 B 30 C 40 B 50 B 60 D
87
CA FOUNDATION STATISTICS
HOMEWORK SECTION
88
CA FOUNDATION STATISTICS
7. A parameter is a characteristic of
(a) Population (b) Sample
(c) Both (a) and (b) (d) (a) or (b)
8. A statistic is
(a) A function of sample observations (b) A function of population units
(c) A characteristic of a population (d) A part of a population.
89
CA FOUNDATION STATISTICS
15. A population comprises 5 members. The number of all possible samples of size 2
that can be drawn from it with replacement is
(a) 100 (b) 15 (c) 125 (d) 25
19. Which sampling provides separate estimates for population means for different
segments and also an over all estimate?
(a) Multistage sampling (b) Stratified sampling
(c) Simple random sampling (d) Systematic sampling
90
CA FOUNDATION STATISTICS
21. Which sampling is affected most if the sampling frame contains an undetected
periodicity?
(a) Simple random sampling (b) Stratified sampling
(c) Multistage sampling (d) Systematic sampling
23. If a random sample of size 2 with replacement is taken from the population
containing the units 3,6 and 1, then the samples would be
(a) (3,6),(3,1),(6,1)
(b) (3,3),(6,6),(1,1)
(c) (3,3),(3,6),(3,1),(6,6),(6,3),(6,1),(1,1),(1,3),(1,6)
(d) (1,1),(1,3),(1,6),(6,1),(6,2),(6,3),(6,6),(1,6),(1,1)
24. If a random sample of size two is taken without replacement from a population
containing the units a,b,c and d then the possible samples are
(a) (a, b),(a, c),(a, d)
(b) (a, b),(b, c), (c, d)
(c) (a, b), (b, a), (a, c),(c,a), (a, d), (d, a)
(d) (a, b), (a, c), (a, d), (b, c), (b, d), (c,d)
Answers
1 c 6 d 11 d 16 b 21 d
2 c 7 a 12 a 17 a 22 c
3 d 8 a 13 d 18 b 23 c
4 b 9 a 14 a 19 b 24 d
5 a 10 d 15 d 20 d
91