0% found this document useful (0 votes)

45 views37 pages

Week 6+7+8

FDGS

Uploaded by

phuonggliinh.work

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views37 pages

Week 6+7+8

FDGS

Uploaded by

phuonggliinh.work

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

STATISTICS

IN ECONOMICS AND BUSINESS

Nguyen Huyen Trang

Faculty of Statistics - National Economics University
[email protected]
LECTURE 6: DATA MEASUREMENT

Summary
Measures

Central Measures of
Tendency Dispersion

Standard
Mean Median Variance
Deviation

Coefficient of
Mode Variation
Quartile
OUTLINE

• Central Tendency
• Percentiles - Quartile
• Measures of Dispersion
CENTRAL TENDENCY

A summary measure that attempts to describe a whole

set of data with a single value that represents the middle
or center of its distribution.
• Mean
• Median
• Mode
MEAN

• The most common measure of central tendency

• Apply for quantitative only

• Have the same unit as original data

• Denote for the population mean: μ, for the sample mean: xത

• Formula:
➢ Arithmetic mean
➢ Geometric mean
ARITHMETIC MEAN

• Example: Student A’s grade in some courses

Course Grade Points
Algebra 3.63
Introduction to Logic 4.20 GPA???
Microeconomics 3.46
Statistics 4.00

 xi  xi
= x=
N n
WEIGHTED ARITHMETIC MEAN

• Example: Any difference if know more information about the

number of credits?
Course Number of Credits Grade Points
Algebra 3 3.63
Introduction to Logic 2 4.20
Microeconomics 3 3.46
Statistics 3 4.00

Weight wi Value xi
Each data is given a weight that reflects its importance
WEIGHTED ARITHMETIC MEAN

Number Grade Grade Points x

Course
of Credits Points Credits
Algebra 3 3.63 10.89
Introduction to Logic 2 4.20 8.40
Microeconomics 3 3.46 10.38
Statistics 3 4.00 12.00
Total 11 x 41.67

In general, for weighted data:

σ x i wi where:
xത = xi = value of observation i
σ wi wi = weight for observation i
GROUPED DATA

• The weighted mean computation can be used to obtain

approximations of the mean, variance, and standard deviation for the
grouped data.
• To compute the weighted mean, we treat the midpoint of each class
as though it were the mean of all items in the class.
• We compute a weighted mean of the class midpoints using the class
frequencies as weights.
• Similarly, in computing the variance and standard deviation, the class
frequencies are used as weights.
σ x i fi where:
xത = xi = midpoint of each class
σ fi fi = class frequencies
MEAN FOR GROUPED DATA

Example: SCCoast, an Internet provider in the Southeast, developed

the following frequency distribution on the age of Internet users.
Age Frequency
Number (fi)
of users xi x if i
10 up to 20 3 15 45
20 up to 30 7 25 175
30 up to 40 18 35 630
40 up to 50 20 45 900
50 up to 60 12 55 660
Total 60 2410
THE MEAN

• Compare the mean of following data:

– Data 1: {10, 10, 11, 12, 12}

– Data 2: {2, 3, 4, 6, 40}

• The mean is easily affected by the extreme values

or outliers → lead to biased comparison

• Use the other measure

MEDIAN

• The median of a data set is the value in the middle

when the data items are arranged in ascending
order.
• For an odd number of observations, the median is
the middle value.
• For an even number of observations, the median is
the average of the two middle values.
MEDIAN

• Median is the ‘cutoff point’ of lower 50% - upper 50% parts

• Denoted as Me

Lower
50%
Upper
50%

Median
MEDIAN

Example:

• Data: { 5, 6, 9, 5, 6}

Ordered data: { 5, 5, 6, 6, 9 }: Median = 6

• Ordered Data {6, 6, 7, 8, 9, 11} :

7+8
Median = = 7.5
2
MEDIAN

• Compare the mean and median of following data:

Data 1: {10, 10, 11, 12, 12}
Data 2: {2, 3, 4, 6, 40}
• The median is independent from the outliers
• Depends on the position
• Apply for quantitative variable only
MODE

• Could be applied for both quantitative and qualitative

variable
• The mode of a data set is the value that occurs with greatest
frequency
• Denoted as Mo
• Find the Mode:
➢ Qualitative Data
➢ Quantitative Data
MODE

• Qualitative Data
➢ Data: { Yellow, Yellow, Red, Blue, Green}
→ Mode is the category having the largest frequency
• Quantitative Data
➢ Data 1: { 5, 6, 6, 7, 7, 7, 9 }
➢ Data 2: { 5, 6, 7, 8, 9 }
➢ Data 3: { 5, 6, 9, 5, 6 }
➢ Data 4: { 5, 5, 5, 5, 5 }
→ Mode is the value having the largest frequency
There may be no mode or several modes
MEAN, MODE, MEDIAN

Negatively skewed Positively skewed

Left skewed Symmetric Right skewed

Mean
Median
Mean < Median < Mode Mode Mode < Median < Mean
PERCENTILES

❑ A percentile provides information about how the data are spread

over the interval from the smallest value to the largest value.
❑The pth percentile is a value
that divides the data into two
parts:
At least p% of the observations
are equal or less than the pth
percentile
At least (100 – p)% of the
observations are equal or
greater than the pth percentile
PERCENTILES

80% of people are shorter than you and your height is 1.85m

You are at the 80th percentile

Approximately 80% people shorter than (1.85m) and
20% people taller than 1.85m
PERCENTILES

A total of 10,000 people visited the shopping mall over 12 hours:

Time Cumulative 0 000 eople
(hours) Freq. 000
0 0 000
2 350 000

4 1100 000
5 000
6 2400
000
8 6500
000
10 8850 000
12 10000 000
i e in ours
0
• Estimate the 30th percentile 0 5 0

• Estimate what percentile of visitors had arrived after 11 hours

QUARTILES

Quartiles are specific percentiles, divides the data into 4 equal parts
by 3 cut-off points
• First Quartile Q1 = 25th Percentile
• Second Quartile Q2 = 50th Percentile = Median
• Third Quartile Q3 = 75th Percentile

25% 25% 25% 25%

Q1 Q2 Q3
MEASURES OF VARIABILITY

Firm A Firm B Mean A = Mean B = 1500

Worker 1 400 1480
Worker 2 400 1485
Worker 3 600 1486 Which firm’s worker salary is more
Worker 4 600 1488 fluctuated/stable?
Worker 5 700 1490
Worker 6 800 1503 Central Tendency may not provide
Worker 7 900 1505 efficient information of the data.
Worker 8 2000 1520 Data may have the same Mean,
Worker 9 2600 1521 Median, but differ in variability
Worker 10 6000 1522 (dispersion, spread)
MEASURES OF VARIABILITY

Tells about the spread of the data. Help us to compare the spread
in two or more distributions

▪ Range
▪ Variance
▪ Standard Deviation
▪ Coefficient of Variation
RANGE

• The difference between the largest and the smallest value in a

data set.
Firm A Firm B
R = xmax - xmin Worker 1 400 1480
Worker 2 400 1485
• Example:
Worker 3 600 1486
Range (A) = 6000 – 400 = 5600 Worker 4 600 1488
Worker 5 700 1490
Range (B) = 1522 – 1480 = 52 Worker 6 800 1503
Worker 7 900 1505
• Pros: simple Worker 8 2000 1520
Worker 9 2600 1521
• Cons: affected by outliers
Worker 10 6000 1522
INTERQUARTILE RANGE

• Interquartile Range is range between 3rd quartile and 1st quartile

• IQR is the width of 50% middle value of data

• It overcomes the sensitivity to extreme data values
VARIANCE

• Overcome the weakness of the range by using all the

values
➢ Data: x1, x2,…, xn → the mean
➢ Difference between the value of each observation (xi) and

the mean (x for a sample, μ for a population): xi - x

ത
VARIANCE

• Formula:
 ( x −  ) 2
➢ Population Variance: 2 = i
N

➢ Sample Variance: s2 =  ( xi − x )
2

n −1
• If 𝑠𝑥2 > 𝑠𝑦2 then:

• x is more dispersed, widespread, fluctuated than y

• y is more stable, concentrated than x

VARIANCE FOR GROUPED DATA

• Formula:
 f ( M −  ) 2
➢ Population Variance:  2
= i i
N

➢ Sample Variance: s2 =  f i ( M i − x ) 2

n −1
STANDARD DEVIATION

• Is the square root of the variance

• It is measured in the same units as the data, making it more

easily comparable, than the variance, to the mean

• Formula:

➢ Population Standard Variance: σ = σ2

➢ Sample Standard Variance: s = s2

COEFFICIENT OF VARIATION

• Indicates how large the standard deviation is in relation to the mean

• This is the ratio of the standard deviation to the mean

SD
CV =  100
mean

Business Decision Making – Nguyen Minh Thu – [email protected]

COEFFICIENT OF VARIATION

An investor is considering the relative risks associated with two

projects:
• The first project has a mean expected profit of £5000 with a
standard deviation of £707.11
• The second project has a mean expected profit of £500 with a
standard deviation of £112.13
Use the measures of dispersion to establish which project has the
lowest degree of risk.
Business Decision Making – Nguyen Minh Thu – [email protected]
EXPLORATORY DATA ANALYSIS

• Five-Number Summary
• Box Plot
• Detecting Outlier
FIVE-NUMBER SUMMARY

• Smallest Value
• First Quartile
• Median
• Third Quartile
• Largest Value
=> use to draw box plot
BOX PLOT

• A box is drawn with its ends located at the first and third
quartiles.
• A vertical line is drawn in the box at the location of the
median.
• Limits are located (not drawn) using the interquartile range
(IQR).
✓ The lower limit is located 1.5(IQR) below Q1.
✓ The upper limit is located 1.5(IQR) above Q3.
✓ Data outside these limits are considered outliers.
(Value < Q1 – 1.5 IQR or Value > Q3 + 1.5 IQR)
BOX AND WHISKER PLOT

▪ Boxplot 1
min max
Q1 Q2 Q3

▪ Boxplot 2 IQR = Q3 – Q1
outlier
Q1 – 1.5IQR Q3 + 1.5IQR

Lower limit: the maximum of Upper limit: the minimum of

(min, Q1-1.5*IQR) (max, Q3+1.5*IQR)
BOX AND WHISKER PLOT

A B C D E F

Max 6 6 7 9 6 4

Q3 5 4 6 6 4 3

Q2 4.5 2.5 5.5 4.5 2.5 2.5

Q1 3 2 4 4 1 2

Min 1 1 1 3 -1 1

4.2 2.8 5.16 4.84 2.5 2.5

Intro to Descriptive Statistics
No ratings yet
Intro to Descriptive Statistics
68 pages
Basic 1
No ratings yet
Basic 1
60 pages
Dsbda Unit 2
No ratings yet
Dsbda Unit 2
155 pages
Descriptive Statistics 1
No ratings yet
Descriptive Statistics 1
63 pages
Descriptive Statistics W25
No ratings yet
Descriptive Statistics W25
41 pages
Central Tendency Variation Outliers
No ratings yet
Central Tendency Variation Outliers
59 pages
STAT241 - Business Statistics (Day 3)
No ratings yet
STAT241 - Business Statistics (Day 3)
32 pages
Measusres of Locations
No ratings yet
Measusres of Locations
52 pages
Measures of Location and VARIATION For 1 Variable
No ratings yet
Measures of Location and VARIATION For 1 Variable
44 pages
2 Measures of Location - Dispersion
No ratings yet
2 Measures of Location - Dispersion
61 pages
Lecture Slides - Capítulo 02
No ratings yet
Lecture Slides - Capítulo 02
21 pages
Measures of Central Tendency and Spread: Chapter 1, Section 2
No ratings yet
Measures of Central Tendency and Spread: Chapter 1, Section 2
36 pages
Statistics for Business Analysis
No ratings yet
Statistics for Business Analysis
29 pages
2 Descriptives
No ratings yet
2 Descriptives
43 pages
Lecture 04
No ratings yet
Lecture 04
88 pages
Stats
No ratings yet
Stats
109 pages
Part 2-Chapter 3 - Describing Data - Edit
No ratings yet
Part 2-Chapter 3 - Describing Data - Edit
46 pages
Lecture 2b - Describing Data-Numerical
No ratings yet
Lecture 2b - Describing Data-Numerical
47 pages
Discriptive Statistics
No ratings yet
Discriptive Statistics
23 pages
EECM3724 Unit 1 Ch3 Slides 2022
No ratings yet
EECM3724 Unit 1 Ch3 Slides 2022
48 pages
Central Tendency - Lecture Notes
No ratings yet
Central Tendency - Lecture Notes
34 pages
4 Numerical Methods For Describing Data
No ratings yet
4 Numerical Methods For Describing Data
50 pages
ch03 Ver3
No ratings yet
ch03 Ver3
25 pages
ch03 Ver3
No ratings yet
ch03 Ver3
25 pages
Math264 Numerical Measures Apaydın
No ratings yet
Math264 Numerical Measures Apaydın
64 pages
Ken Black QA ch03
0% (1)
Ken Black QA ch03
61 pages
Lecture 3 Numerical Measures of Data
No ratings yet
Lecture 3 Numerical Measures of Data
36 pages
Understanding Measures of Dispersion
No ratings yet
Understanding Measures of Dispersion
42 pages
Chapter 3
No ratings yet
Chapter 3
17 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
59 pages
Slides Week2
No ratings yet
Slides Week2
43 pages
Statistical Data
No ratings yet
Statistical Data
41 pages
P3measure of Dispersion
No ratings yet
P3measure of Dispersion
25 pages
Lecture 06-Describing Data Visual Information
No ratings yet
Lecture 06-Describing Data Visual Information
49 pages
Introductory of Statistics - Chapter 3
No ratings yet
Introductory of Statistics - Chapter 3
7 pages
Basic Business Statistics: Concepts & Applications: Activity 4+ 5 + 6 Descriptive Statistics and Graphical Analysis
No ratings yet
Basic Business Statistics: Concepts & Applications: Activity 4+ 5 + 6 Descriptive Statistics and Graphical Analysis
33 pages
Exploring Numerical Data - Students
No ratings yet
Exploring Numerical Data - Students
97 pages
Chapter 3, Part A Descriptive Statistics: Numerical Measures
No ratings yet
Chapter 3, Part A Descriptive Statistics: Numerical Measures
7 pages
Session 2 Descriptive Statistics
No ratings yet
Session 2 Descriptive Statistics
33 pages
Numerical Descriptive Measures 1
No ratings yet
Numerical Descriptive Measures 1
39 pages
RMBS BPT402
No ratings yet
RMBS BPT402
103 pages
Quantitative Methods For Management
No ratings yet
Quantitative Methods For Management
118 pages
Lecture 3 - Stat HO
No ratings yet
Lecture 3 - Stat HO
21 pages
2 - Unit-Ii-2
No ratings yet
2 - Unit-Ii-2
66 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
26 pages
9 MMW Data Management UNgrouped N Grouped FM1B
No ratings yet
9 MMW Data Management UNgrouped N Grouped FM1B
42 pages
Variability Final
No ratings yet
Variability Final
53 pages
Student Notes 1.3 New
No ratings yet
Student Notes 1.3 New
6 pages
03 Numerical Description
No ratings yet
03 Numerical Description
52 pages
Descriptive Statistics Overview
No ratings yet
Descriptive Statistics Overview
30 pages
Session 2 Inferential Statistics Slides
100% (1)
Session 2 Inferential Statistics Slides
93 pages
DSILYTC Session 5 - Descriptive Statistics
No ratings yet
DSILYTC Session 5 - Descriptive Statistics
99 pages
FDSA Unit 2
No ratings yet
FDSA Unit 2
44 pages
Statistics I Chapter 2: Univariate Data Analysis
No ratings yet
Statistics I Chapter 2: Univariate Data Analysis
27 pages
3 Descriptive Statistics - Numerical
No ratings yet
3 Descriptive Statistics - Numerical
82 pages
Bus. Statt. Chapter-Lecture 2+3
No ratings yet
Bus. Statt. Chapter-Lecture 2+3
43 pages
Lec7 16.1 BM14 - CH16
No ratings yet
Lec7 16.1 BM14 - CH16
34 pages
Lec7 BM14 - CH15
No ratings yet
Lec7 BM14 - CH15
55 pages
Accounting Mock Exam 6 Ok
No ratings yet
Accounting Mock Exam 6 Ok
10 pages
Lec1 BM14 - CH06
No ratings yet
Lec1 BM14 - CH06
28 pages
Lec1 BM14 - CH08
No ratings yet
Lec1 BM14 - CH08
40 pages
Lecture 3
No ratings yet
Lecture 3
21 pages
Lecture 2
No ratings yet
Lecture 2
39 pages
Chapter 8 9 Extra
No ratings yet
Chapter 8 9 Extra
11 pages
BCTC 2022 Eng
No ratings yet
BCTC 2022 Eng
42 pages
Desing Thinking Lab Record
No ratings yet
Desing Thinking Lab Record
65 pages
AKG1212 CNC Router Specifications
No ratings yet
AKG1212 CNC Router Specifications
9 pages
Testing Rate at RUET 5-9-18
100% (4)
Testing Rate at RUET 5-9-18
5 pages
Barangay Nutrition Profile or Situation Analysis 1
No ratings yet
Barangay Nutrition Profile or Situation Analysis 1
2 pages
R481200E
No ratings yet
R481200E
2 pages
MBSM Dot Pro Private PDF NBM1119Y
No ratings yet
MBSM Dot Pro Private PDF NBM1119Y
1 page
Hymn of Grateful Praise Lyrics
No ratings yet
Hymn of Grateful Praise Lyrics
2 pages
2.3-2 Settiing-Up Wireless Access Point
100% (2)
2.3-2 Settiing-Up Wireless Access Point
19 pages
Dynamics Newton's Laws of Motion - Part 1
No ratings yet
Dynamics Newton's Laws of Motion - Part 1
27 pages
Mels Constuction Limitada: Commercial Management Mechanical Completion Certificate
No ratings yet
Mels Constuction Limitada: Commercial Management Mechanical Completion Certificate
1 page
LS English 7 Workbook Answers PDF Ellipsis Homeschooling 2
No ratings yet
LS English 7 Workbook Answers PDF Ellipsis Homeschooling 2
3 pages
Jsae Jaso M305-1988
100% (1)
Jsae Jaso M305-1988
25 pages
Model 88 System: High Speed. High Use Applications
No ratings yet
Model 88 System: High Speed. High Use Applications
2 pages
Idan's Thesis - Final
No ratings yet
Idan's Thesis - Final
32 pages
Solenoides Ss Series Parker
No ratings yet
Solenoides Ss Series Parker
34 pages
Chemistry Separation Techniques
No ratings yet
Chemistry Separation Techniques
14 pages
Organic Chemistry 6th Edition (Ebook PDF) Available Any Format
100% (5)
Organic Chemistry 6th Edition (Ebook PDF) Available Any Format
159 pages
Canon 2545i Error Codes
No ratings yet
Canon 2545i Error Codes
4 pages
Mobile and Cellular Communications Course Notes
No ratings yet
Mobile and Cellular Communications Course Notes
183 pages
Harnish New Scope-106650-CC-3880-1724722551
No ratings yet
Harnish New Scope-106650-CC-3880-1724722551
45 pages
DF-Cartex-SN Fermenter EN
No ratings yet
DF-Cartex-SN Fermenter EN
2 pages
Spectrum Wallboard Installation Manual V2
No ratings yet
Spectrum Wallboard Installation Manual V2
13 pages
User Manual For Model MPR-514-PA MPR-514R-PA Pharmaceutical Refrigerators 1440163966
No ratings yet
User Manual For Model MPR-514-PA MPR-514R-PA Pharmaceutical Refrigerators 1440163966
45 pages
RFA - SD 028 RevB 11kV 400V Substation Single Line Diagram
No ratings yet
RFA - SD 028 RevB 11kV 400V Substation Single Line Diagram
4 pages
Dat Questionaire
No ratings yet
Dat Questionaire
1 page
Datasheet - Type K Thermocouple
No ratings yet
Datasheet - Type K Thermocouple
2 pages
Creamy Carbonara Recipe Jamie Oliver Pasta Recipes
No ratings yet
Creamy Carbonara Recipe Jamie Oliver Pasta Recipes
1 page
Throwers Ten Exercise Program: What You Will Need
No ratings yet
Throwers Ten Exercise Program: What You Will Need
7 pages
"Home of The Wildcats": Thornton Township High School
No ratings yet
"Home of The Wildcats": Thornton Township High School
5 pages
Database Concurrency Techniques
No ratings yet
Database Concurrency Techniques
31 pages