Probability and Statistics (PHM111s)-Lecture 3
Part I: Introduction to Statistical Methods.
Part II: Methods of Descriptive Statistics.
1- Collecting Data.
2- Organizing Data.
3- Presenting Data.
4- Summarizing Data.
Part III: Introduction to Probability.
Part IV: Methods of Inferential Statistics.
4. Summarizing Data.
Parameter for Population & Statistic for Sample
Descriptive
Statistics
Ungrouped Grouped
Data Data
Measures of Measures of Measures of
Central Variation Position
Tendency
1- Arithmetic Mean
Case 1: Ungrouped Data
a) The mean for sample with size n, is given as:
n
X1 + X 2 + X 3 + + X n ∑X i
=X = i =1
n n
b) For a population with size N, the mean is given as:
N
X1 + X 2 + X 3 + + X n ∑X i
=µ = i =1
N N
Example1: The data represent the number of days off per year for a sample of individuals selected from nine
different countries. Find the mean.
20, 26, 40, 36, 23, 42, 35, 24, 30
Solution:
X
= ∑=
X 20 + 26 + 40 + 36 + 23 + 42 + 35 + 24 + 30 276
= = 30.7 days
n 9 9
Case 2: Grouped Data
n
∑ f .X
1 n i i
=X =
i =1
n
n
∑ fi . X i
∑ fi i =1
i =1
Example 2: The following table gives the frequency distribution of the number of orders received each day during
the past 50 days at the office of a mail-order company. Calculate the mean.
Solution
Number of orders f
10 – 12 4
13 – 15 12
16 – 18 20
19 – 21 14
n = 50
Number of f X f. X
orders
10 – 12 4 11 44
13 – 15 12 14 168
16 – 18 20 17 340
19 – 21 14 20 280
n = 50 = 832
X ∑ = = 16.64 orders
f . X 832
=
n 50
2- The Median
Case 1: Ungrouped Data
Example 3: The number of rooms in the seven hotels in downtown Pittsburgh is 713, 300, 618, 595, 311, 401, and
292. Find the median.
Solution:
Step 1 Arrange the data in order.
292, 300, 311, 401, 595, 618, 713
Step 2 Select the middle value.
292, 300, 311, 401, 595, 618, 713
↑
Median
Hence, the median is 401 rooms.
Example 4: The number of cloudy days for the top 10 cloudiest cities is shown. Find the median.
209, 223, 211, 227, 213, 240, 240, 211, 229, 212
Solution:
Arrange the data in order.
209, 211, 211, 212, 213, 223, 227, 229, 240, 240
↑
Median
213 + 223
MD = = 218
2
Hence, the median is 218 days.
Case 2: Grouped Data
n
2 − Fm−1
Median = MD= L + i
fm
Example 5: Based on the grouped data below, find the median.
Solution
Seconds f
51 – 55 2
56 – 60 7
61 – 65 8
66 – 70 4
Construct the cumulative frequency distribution
Seconds f cf
51 – 55 2 2
56 – 60 7 9
61 – 65 8 17
66 – 70 4 21
n 21
= = 10.5 → class median is the 3rd class
2 2
So, Fm-1 = 9, fm = 8, i = 5 and L = 60.5
Therefore,
n
2 − Fm−1
MD= L + i
fm
10.5 − 9
= 60.5 + 5
8
=60.5 + 0.9375 =61.4375
Finding Median Graphically
(from Ogive)
n 200
= = 100
2 2
3- The Mode
Case 1: Ungrouped Data
Example 6: The data show the number of licensed nuclear reactors in the United States for a recent 15-year period.
Find the mode.
104 104 104 104 104
107 109 109 109 110
109 111 112 111 109
Solution
Since the values 104 and 109 both occur 5 times, the modes are 104 and 109. The data set is said to be bimodal.
Case 2: Grouped Data
f m − f m−1
Mode= L + i
( f m − f m−1 ) + ( f m − f m +1 )
Example 7: Based on the grouped data below, find the mode.
Solution
Seconds f
51 – 55 2
56 – 60 7
61 – 65 8
66 – 70 4
L = 60.5, fm-1 = 7, fm = 8, fm+1 = 4, i = 5
8−7
= 60.5 + 5
Mode
(8 − 7) + (8 − 4)
1
=60.5 + 5 =61.5
5
Finding Mode Graphically
(from Histogram)
4- The Midrange
lowest value + highest value
MR =
2
Example 8: Find the midrange of data:
18.0, 14.0, 34.5, 10, 11.3, 10, 12.4, 10
Solution
The smallest bonus is $10 million and the largest bonus is $34.5 million.
l0 + 34.5 44.5
MR
= = = $22.25 million
2 2
Notice that this amount is larger than seven of the eight amounts and is not typical of the average of the bonuses.
The reason is that there is one very high bonus, namely, $34.5 million.
5- The Weighted Mean
n
w1 X 1 + w2 X 2 + ... + wn X n ∑w X i i
=X = i =1
n
w1 + w2 + ... + wn
∑w i =1
i
Example 9: A student received an A in English Composition I (3 credits), a C in Introduction to Psychology
(3credits), a B in Biology I (4credits), and a D in Physical Education (2credits). Assuming A =4 grade
points, B =3 grade points, C =2 grade points, D =1 grade point, and F =0 grade points, find the
student’s grade point average.
Solution
Course Credits (w) Grade (X)
English Composition I 3 A (4 points)
Introduction to Psychology 3 C (2 points)
Biology I 4 B (3 points)
Physical Education 2 D (1 point)
n
∑w X i i
(3)(4) + (3)(2) + (4)(3) + (2)(1) 32
The grade point average (GPA) = =
X i =1
n = = = 2.7
3+ 3+ 4+ 2 12
∑ wi
i =1
Distribution Shapes
positively skewed or right-skewed distribution,
symmetric distribution,
negatively skewed or left-skewed.