Worksheet on chapter three
Imagine you have a data set showing the age (in years) of 10 students in a class: {18, 19, 19, 20,
21, 22, 22, 23, 24, 25}
Instructions:
1. Central Tendency:
o Calculate the mean, median, and mode of the students' ages.
What do these values tell you about the "typical" age in the class?
2. Spread:
o Calculate the range, variance, and standard deviation of the students' ages.
What do these values tell you about how spread out the ages are?
3. Quartiles:
o Calculate the first quartile (Q1), second quartile (Q2), and third quartile (Q3) of
the students' ages.
How do Q1, Q2, and Q3 divide the data? What can you learn about the
distribution of ages from these quartiles?
4. Visualization (Optional):
o Create a histogram or boxplot to visually represent the distribution of the students'
ages.
Does the visualization confirm what you learned from the numeric
measures?
5. Explain the difference between mean, median, and mode. In which scenario would each
measure be the most appropriate way to describe the "typical" value in a data set?
6. A data set contains exam scores for a class. Some students scored very high due to
exceptional performance, while the majority scored around the average. Would the mean,
median, or mode be the most representative of the "typical" student score in this case?
Why?
7. Imagine you are comparing the heights of two basketball teams. Team A has a higher
standard deviation than Team B. What does this suggest about the variation in heights
within each team?
8. A data set has a large range but a low standard deviation. What can you infer about the
distribution of the data?
9. Describe the first quartile (Q1), second quartile (Q2), and third quartile (Q3).
10. What is skewness?
11. Define the coefficient of variation (CV). Why might the CV be a more useful measure of
spread than standard deviation when comparing data sets with different units?
12. When summarizing a data set, why is it important to consider both measures of central
tendency and spread?
13. Consider the example of ten numbers
19, 21 20 20 34 22 24 27 27 27 then calculate mean,
median, mode, range, variance, SD, quartile (Q1, Q2, Q4) IQR ,QD, CQD.
14. The following are life expectancies of males in six European countries: 74, 77, 73, 75, 77
and 78. If the observed value of 74 is mistakenly recorded 740. What is the effect on each
of the following? . Mean, Median, SD, IQR
15. Birth weight (gm) data for 20 infants 2069, 2581, 2759, 2834, 2838, 2841, 3031, 3101,
3200, 3245, 3248, 3260, 3265, 3314, 3323, 3484, 3541, 3609, 3649, 4146
Question
a. Compute the Q3,
b. 10th and
c. 90th percentiles
16. Identify their skewness or distribution of the following data
o Data on height were collected from 100 adult individuals in Woreda “xx” and
summarized as Mean = 1.65cm, Median = 1.50cm & Mode = 1.60Cm
o body weights were measured for 100 adults aged 21 years and above in Woreda
“xx” and summarized as; Mean = 60Kg, Median = 70Kg & Mode = 80Kg