Biomedical Engineering Department Dr.
Mohammed Jabbar Al-Salihy
3rd Stage 2022-2023
2nd Semester Engineering Statistics
Descriptive Statistics
In this lecture we learn several techniques for organizing and summarizing data
so that we may more easily determine what information they contain.
1-Ordered Array: An ordered array is a listing of the values of a collection
(either population or sample) in order of magnitude from the smallest value to the
largest value.
Example (1): value of 10 resistors (27,25,18,30,35,40,30,13,9,22)
Solution: Ordered array (9,13,18,22,25,27,30,30,35,40)
2-Dot plot
In a dot plot, each data entry is plotted, using a point, above a horizontal axis.
Example (2): Use a dot plot to display the ages of the 30 students in the statistics
class.
18 20 21 27 29 20
19 30 32 19 34 19
24 29 18 37 38 22
30 39 32 44 33 46
54 49 18 51 21 21
Solution:
From this plot, we can conclude that most of the values lie between 18 and 32.
1
Biomedical Engineering Department Dr. Mohammed Jabbar Al-Salihy
3rd Stage 2022-2023
2nd Semester Engineering Statistics
3-Scatter Plot
When each entry in one data set corresponds to an entry in another data set, the
sets are called paired data sets.
• In a scatter plot, the ordered pairs are graphed as points in a coordinate plane.
The scatter plot is used to show the relationship between two quantitative
variables.
• The following scatter plot represents the relationship between the number
of absences from a class during the semester and the final grade.
100
90 Absences Grade
80
8 78
70
60
2 92
50 5 90
40 12 58
30 15 43
20 9 74
10
6 81
0
0 2 4 6 8 10 12 14 16
From the scatter plot, you can see that as the number of absences increases, the
final grade tends to decrease.
4-Pareto Diagram
This display, orders each type of failure or defect according to its frequency,
can help engineers identify important defects and their causes.
When a company identifies a process as a candidate for improvement, the first
step is to collect data on the frequency of each type of failure. For example, the
performance of a computer‐controlled lathe is low, so the engineers record the
following causes of defects and their frequencies:
2
Biomedical Engineering Department Dr. Mohammed Jabbar Al-Salihy
3rd Stage 2022-2023
2nd Semester Engineering Statistics
Cause of Defect Frequency
Power fluctuations 6
Bad Maintenance 22
Operator error 13
Worn tool not replaced 2
Bad material 5
Cause of Defect Frequency Percentage
Bad Maintenance 22 (22/48) 100% = 45.8 %
Operator error 13 (13/48) 100% = 27.08 %
Power fluctuations 6 (6/48) 100% = 12.5 %
Bad material 5 (5/48) 100% = 10.4 %
Worn tool not replaced 2 (2/48) 100% = 4.17 %
48 100 %
3
Biomedical Engineering Department Dr. Mohammed Jabbar Al-Salihy
3rd Stage 2022-2023
2nd Semester Engineering Statistics
5-Frequency distribution
Organizing data into a table or a graph by putting them into non-overlapping
classes (intervals) or categories.
Example (3): Consider the following heights of 40 Nano-pillars were measured
in nanometers (µm) while fabricating a new transmission type electron multiplier
of a flat silicon membrane. Construct a frequency distribution using k=6 classes.
23 30 20 27 44 26 35 20 29 29
25 15 18 27 19 22 12 26 34 15
27 35 26 43 35 14 24 12 23 31
40 35 38 57 22 42 24 21 27 33
Solution:
𝑘𝑘 = 1 + 3.322 log10 𝑛𝑛,
𝑘𝑘 = 𝑁𝑁𝑁𝑁𝑁𝑁𝑁𝑁𝑁𝑁𝑁𝑁 𝑜𝑜𝑜𝑜 𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐
n=number of data (number of data sets under consideration) =40
𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅 𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 − 𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠
𝑤𝑤 = width of the class = =
𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛 𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛
k=6 given
57−12
𝑤𝑤 = = 7.5 ≈ 8 we approximate results to the nearest integer number due
6
to the data are integer
20 − 19
𝑗𝑗𝑗𝑗𝑗𝑗𝑗𝑗 = = 0.5
2
𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓
𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅 𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓 =
𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛 𝑜𝑜𝑜𝑜 𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜𝑜
𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃 = 𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅 𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓𝑓 ∗ 100%
4
Biomedical Engineering Department Dr. Mohammed Jabbar Al-Salihy
3rd Stage 2022-2023
2nd Semester Engineering Statistics
Class Frequency Cumulative Relative Cumulative True Mid-
limits Frequency Frequency Relative Class point
Frequency Interval
12-19 7 7 7/40=17.5% 7/40=17.5% 11.5-19.5 15.5
20-27 17 24 17/40=42.5% 24/40=60% 19.5-27.5 23.5
28-35 10 34 10/40=25% 34/40=85% 27.5-35.5 31.5
36-43 4 38 4/40=10% 38/40=95% 35.5-43.5 39.5
44-51 1 39 1/40=2.5% 39/40=97.5% 43.5-51.5 47.5
52-59 1 40 1/40=2.5% 40/40=100% 51.5-59.5 55.5
Total 40 100%
A-How many Nano-pillars at least 36 µm heights? 1+1+4=6
B-What’s the percentage of Nano-pillars that heights at most 27? (17+7)/40
6-Histogram (True Class Interval+ Frequency)
We may display a frequency distribution (or a relative frequency distribution)
graphically in the form of a histogram, which is a special type of bar graph. When
we construct a histogram the values of the variable under consideration are
represented by the horizontal axis, while the vertical axis has as its scale the
frequency (or relative frequency if desired) of occurrence
7-Frequency Polygon (Mid-point+ Frequency)
A frequency distribution can be portrayed graphically in yet another way by
means of a frequency polygon, which is a special kind of line graph.
5
Biomedical Engineering Department Dr. Mohammed Jabbar Al-Salihy
3rd Stage 2022-2023
2nd Semester Engineering Statistics
Example (3): For the ages of people in the table below, use k=9 classes, and plot
a histogram and frequency polygon.
Solution:
82−30
W= = 5.778
9
It is apparent that a class interval width of 5 or 10 will be more convenient to
use, as well as more meaningful to the reader. Suppose we decide on 10.
6
Biomedical Engineering Department Dr. Mohammed Jabbar Al-Salihy
3rd Stage 2022-2023
2nd Semester Engineering Statistics
Homework-1: The following table shows rainfall measurements, use k=6.
7 10 12 4 8 7 3 8 5
12 11 3 8 1 1 13 10 4
455877323
8 13 1 7 17 3 4 5 5
3 1 17 10 4 7 7 11 8
From these data construct:
A. A frequency distribution
B. A relative frequency distribution
C. A histogram
D. A frequency polygon
8-Stem-and-Leaf Displays
Another graphical device that is useful for representing quantitative data sets is
the stem-and-leaf display. To construct a stem-and-leaf display we partition each
measurement into two parts. The first part is called the stem, and the second part
is called the leaf.
7
Biomedical Engineering Department Dr. Mohammed Jabbar Al-Salihy
3rd Stage 2022-2023
2nd Semester Engineering Statistics
Example (4): The following are humidity readings. Construct a stem-and-leaf
display from these data.
23 41 26 34 35 14
65 10 52 46 55 61
21 24 33 41 32 54
59 15 17 50 27 45
49 46 51 39 49 41
Solution:
10 14 15 17 21 23
24 26 27 32 33 34
35 39 41 41 41 45
46 46 49 49 50 51
52 54 55 59 61 65
Stem Leaf
1 0457
2 13467
3 23459
4 11156699
5 012459
6 15
Range=max.-min.=65-10=55
Mode=41
Median= (41+41)/2=41
Mean=38.5
Homework-2: The height of 20 students in cm is given below:
143 160 154 159 172 165 162 171 146 165
176 145 165 182 175 186 160 158 167 172
Plot stem and leaf, Find the range of height, what is the mode of height, and find
the median of height