0% found this document useful (0 votes)

26 views35 pages

Introduction To Statistics

The document provides an overview of statistics, focusing on measures of central tendency such as mean, median, and mode, as well as measures of dispersion like range, standard deviation, and variance. It emphasizes the importance of understanding both descriptive and inferential statistics, along with the potential pitfalls of using mean values affected by outliers. Additionally, it discusses the significance of graphing data for better visualization and understanding of statistical information.

Uploaded by

Shalini Kakkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views35 pages

Introduction To Statistics

Uploaded by

Shalini Kakkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 35

Introduction to Statistics

Measures of Central Tendency

Two Types of Statistics
• Descriptive statistics of a POPULATION
• Relevant notation (Greek):
  mean
– N population size
  sum

• Inferential statistics of SAMPLES from a

population.
– Assumptions are made that the sample reflects
the population in an unbiased form. Roman
Notation:
– X mean
– n sample size
  sum
• Be careful though because you may
want to use inferential statistics even
when you are dealing with a whole
population.

• Measurement error or missing data may

mean that if we treated a population as
complete that we may have inefficient
estimates.
– It depends on the type of data and project.
– Example of Democratic Peace.
• Also, be careful about the phrase
“descriptive statistics”. It is used
generically in place of measures of
central tendency and dispersion for
inferential statistics.

• Another name is “summary statistics”,

which are univariate:
– Mean, Median, Mode, Range, Standard
Deviation, Variance, Min, Max, etc.
Measures of Central Tendency
• These measures tap into the average
distribution of a set of scores or values in
the data.
– Mean
– Median
– Mode
What do you “Mean”?
The “mean” of some data is the average
score or value, such as the average
age of an MPA student or average
weight of professors that like to eat
donuts.

Inferential mean of a sample: X=(X)/n

Mean of a population: =(X)/N
Problem of being “mean”
• The main problem associated with the
mean value of some data is that it is
sensitive to outliers.

• Example, the average weight of political

science professors might be affected if
there was one in the department that
weighed 600 pounds.
Donut-Eating Professors
Professor Weight Weight

Schmuggles 165 165

Bopsey 213 213
Pallitto 189 410
Homer 187 610
Schnickerson 165 165
Levin 148 148
Honkey-Doorey 251 251
Zingers 308 308
Boehmer 151 151
Queenie 132 132
Googles-Boop 199 199
Calzone 227 227
194.6 248.3
The Median (not the cement in the middle of
the road)

• Because the mean average can be

sensitive to extreme values, the median is
sometimes useful and more accurate.

• The median is simply the middle value

among some scores of a variable. (no
standard formula for its computation)
What is the Median?
Professor Weight Weight

Rank order
Schmuggles 165 132
and choose
Bopsey 213 148
middle value.
Pallitto 189 151
Homer 187
If even then 165
Schnickerson 165
average 165
Levin 148
between two 187
Honkey-Doorey 251
in the middle 189
Zingers 308
Boehmer 151 199
Queenie 132 213
Googles-Boop 199 227
Calzone 227 251
194.6 308
Percentiles
• If we know the median, then we can go up
or down and rank the data as being above
or below certain thresholds.

• You may be familiar with standardized

tests. 90th percentile, your score was
higher than 90% of the rest of the sample.
The Mode (hold the pie and the ala)
(What does ‘ala’ taste like anyway??)

• The most frequent response or value

for a variable.

• Multiple modes are possible: bimodal

or multimodal.
Figuring the Mode
Professor Weight

What is the mode?

Schmuggles 165
Bopsey 213
Pallitto 189
Homer 187 Answer: 165
Schnickerson 165
Levin 148 Important descriptive
Honkey-Doorey 251 information that may help
Zingers 308 inform your research and
Boehmer 151 diagnose problems like lack
Queenie 132
of variability.
Googles-Boop 199
Calzone 227
Measures of Dispersion (not something
you cast…)

• Measures of dispersion tell us about

variability in the data. Also univariate.

• Basic question: how much do values differ

for a variable from the min to max, and
distance among scores in between. We
use:
– Range
– Standard Deviation
– Variance
• Remember that we said in order to glean
information from data, i.e. to make an
inference, we need to see variability in
our variables.

• Measures of dispersion give us

information about how much our
variables vary from the mean, because if
they don’t it makes it difficult infer
anything from the data. Dispersion is
also known as the spread or range of
variability.
The Range (no Buffalo roaming!!)
• r=h–l
– Where h is high and l is low

• In other words, the range gives us the

value between the minimum and maximum
values of a variable.

• Understanding this statistic is important in

understanding your data, especially for
management and diagnostic purposes.
The Standard Deviation
• A standardized measure of distance from
the mean.

• Very useful and something you do read

about when making predictions or other
statements about the data.
Formula for Standard Deviation

S = ( X  X ) 2

(n - 1)
=square root
=sum (sigma)
X=score for each point in data
_
X=mean of scores for the variable
n=sample size (number of
observations or cases
X X- mean x-mean squared
Smuggle 165 -29.6 875.2
Bopsey 213 18.4 339.2
Pallitto 189 -5.6 31.2
Homer 187 -7.6 57.5
Schnickerson 165 -29.6 875.2
Levin 148 -46.6 2170.0
Honkey-Doorey 251 56.4 3182.8
Zingers 308 113.4 12863.3
Boehmer 151 -43.6 1899.5
Queeny 132 -62.6 3916.7
Googles-boop 199 4.4 19.5
Calzone 227 32.4 1050.8
Mean 194.6 2480.1 49.8
We can see that the Standard Deviation equals 165.2
pounds. The weight of Zinger is still likely skewing this
calculation (indirectly through the mean).
Example of S in use
• Boehmer- Sobek paper.
– One standard deviation increase in
the value of X variable increases the
Probability of Y occurring by some
amount.
Table 2: Development and Relative Risk of Territorial Claim

Probability* % Change

Baseline 0.0401
development 0.0024 -94.3

pop density 0.0332 -17.3

pop growth 0.0469 16.8
Capability 0.0813 102.5
Openness 0.0393 -2
Capability and pop growth 0.0942 134.8

Change in prob after 1 sd change in given x variable, holding others at their means
Let’s go to computers!
• Type in data in the Excel sheet.
Variance

( X  X ) 2
2=
S (n - 1)
• Note that this is the same equation except for
no square root taken.

• Its use is not often directly reported in research

but instead is a building block for other statistical
methods
Organizing and Graphing
Data
Goal of Graphing?

1. Presentation of Descriptive Statistics

2. Presentation of Evidence

3. Some people understand subject

matter better with visual aids

4. Provide a sense of the underlying

data generating process (scatter-
plots)
What is the Distribution?
• Gives us a picture of
the variability and
central tendency.

• Can also show the

amount of skewness
and Kurtosis.
Graphing Data: Types
Creating Frequencies
• We create frequencies by sorting data
by value or category and then
summing the cases that fall into those
values.

• How often do certain scores occur?

This is a basic descriptive data
question.
Ranking of Donut-eating Profs.
(most to least)
Zingers 308
Honkey-Doorey 251
Calzone 227
Bopsey 213
Googles-boop 199
Pallitto 189
Homer 187
Schnickerson 165
Smuggle 165
Boehmer 151
Levin 148
Queeny 132
Here we have placed the Professors into
weight classes and depict with a histogram in
columns.
Weight Class Intervals of Donut-Munching Professors

3.5
3
2.5
2
Number
1.5
1
0.5
0
130-150 151-185 186-210 211-240 241-270 271-310 311+
Here it is another histogram depicted
as a bar graph.

Weight Class Intervals of Donut-Munching Professors

311+
271-310
241-270
211-240 Number
186-210
151-185
130-150

0 0.5 1 1.5 2 2.5 3 3.5

Pie Charts:
Proportions of Donut-Eating Professors by Weight Class

130-150
151-185
186-210
211-240
241-270
271-310
311+
Actually, why not use a donut
graph. Duh!
Proportions of Donut-Eating Professors by Weight Class

130-150
151-185
186-210
211-240
241-270
271-310
311+

See Excel for other options!!!!

Line Graphs: A Time Series
100

80
Approval

70
Approval

Economic approval
20

Month
Scatter Plot (Two variable)

Presidential Approval and Unemployment

100

80
Approval

60
Approve
40

0
0 2 4 6 8 10 12
Unemployment

Introduction To Statistics: Measures of Central Tendency
No ratings yet
Introduction To Statistics: Measures of Central Tendency
35 pages
Statistics: Central Tendency & Dispersion
No ratings yet
Statistics: Central Tendency & Dispersion
35 pages
Introduction To Statistics2312
No ratings yet
Introduction To Statistics2312
34 pages
Basics Statistics
No ratings yet
Basics Statistics
34 pages
Lecture 2
No ratings yet
Lecture 2
93 pages
2NUBIONormalCurve2T24 25
No ratings yet
2NUBIONormalCurve2T24 25
50 pages
Descriptive Measures in Statistics
No ratings yet
Descriptive Measures in Statistics
14 pages
Summary Biometry
No ratings yet
Summary Biometry
51 pages
Understanding Descriptive Statistics
100% (3)
Understanding Descriptive Statistics
7 pages
02 - Descriptive Statistics
No ratings yet
02 - Descriptive Statistics
45 pages
Lecture 2-Summarizing Data - HSciences Biostats - 010232en
No ratings yet
Lecture 2-Summarizing Data - HSciences Biostats - 010232en
37 pages
Unit 8. Data Analysis
No ratings yet
Unit 8. Data Analysis
69 pages
43hyrs Principles of Statistics 3
No ratings yet
43hyrs Principles of Statistics 3
56 pages
Statistical Methods in Social Sciences
No ratings yet
Statistical Methods in Social Sciences
69 pages
SPSS&Minitab
No ratings yet
SPSS&Minitab
32 pages
Descriptive Statistics - Measures of Central Tendency and Dispersion - PHD 2021
No ratings yet
Descriptive Statistics - Measures of Central Tendency and Dispersion - PHD 2021
31 pages
Descriptive vs. Inferential Stats Guide
No ratings yet
Descriptive vs. Inferential Stats Guide
19 pages
Descriptive Statistics Overview
No ratings yet
Descriptive Statistics Overview
38 pages
Descriptive Statistic
No ratings yet
Descriptive Statistic
37 pages
Lesson 1
No ratings yet
Lesson 1
37 pages
Basic Statistics
No ratings yet
Basic Statistics
105 pages
Key Concepts of Statistics
No ratings yet
Key Concepts of Statistics
20 pages
ISM Session 1-8+webinar1,2 Merged
No ratings yet
ISM Session 1-8+webinar1,2 Merged
718 pages
Biostatistics 5
No ratings yet
Biostatistics 5
28 pages
Basic Statistics
No ratings yet
Basic Statistics
24 pages
Lesson 6c, 7, 8-Print
No ratings yet
Lesson 6c, 7, 8-Print
5 pages
Topic1 3
No ratings yet
Topic1 3
41 pages
Introduction To Statistics PDF
No ratings yet
Introduction To Statistics PDF
32 pages
Notes 3 Descriptive Statistics RJMurden 2021
No ratings yet
Notes 3 Descriptive Statistics RJMurden 2021
47 pages
Descreptive Statistics 1
No ratings yet
Descreptive Statistics 1
74 pages
2 - Central Tendency and Dispersion - SFB
No ratings yet
2 - Central Tendency and Dispersion - SFB
69 pages
Basic Concepts in Biostatistics-1
No ratings yet
Basic Concepts in Biostatistics-1
40 pages
Intro to Descriptive Statistics
No ratings yet
Intro to Descriptive Statistics
51 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
41 pages
Biostatistics 140127003954 Phpapp02
No ratings yet
Biostatistics 140127003954 Phpapp02
47 pages
Introduction to Biostatistics Concepts
No ratings yet
Introduction to Biostatistics Concepts
35 pages
Comprehensive Statistics Guide
No ratings yet
Comprehensive Statistics Guide
81 pages
Click To Add Text Dr. Cemre Erciyes
No ratings yet
Click To Add Text Dr. Cemre Erciyes
69 pages
Ec310 Day 2 Lecture Notes
No ratings yet
Ec310 Day 2 Lecture Notes
10 pages
6.descriptve PPHD
No ratings yet
6.descriptve PPHD
70 pages
Statistics 1
No ratings yet
Statistics 1
9 pages
Statical Data 1
No ratings yet
Statical Data 1
32 pages
Lesson 6c, 7, 8
No ratings yet
Lesson 6c, 7, 8
46 pages
Univariate and Bivariate Statistical Analysis
No ratings yet
Univariate and Bivariate Statistical Analysis
63 pages
Descriptive Statistics-1
No ratings yet
Descriptive Statistics-1
7 pages
23 Biostatistics
No ratings yet
23 Biostatistics
18 pages
AYURSURE (Research and Stat) 4
No ratings yet
AYURSURE (Research and Stat) 4
44 pages
Understanding Basic Statistics Concepts
No ratings yet
Understanding Basic Statistics Concepts
31 pages
Descriptive Statistics Guide
No ratings yet
Descriptive Statistics Guide
6 pages
4x @6ote ) 'Btda2@m
No ratings yet
4x @6ote ) 'Btda2@m
55 pages
المحاضرة رقم 3
No ratings yet
المحاضرة رقم 3
44 pages
Data Presentation
No ratings yet
Data Presentation
104 pages
Choosing Central Tendency Measures
No ratings yet
Choosing Central Tendency Measures
5 pages
Lecture 06-Describing Data Visual Information
No ratings yet
Lecture 06-Describing Data Visual Information
49 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Statistics for Beginners
No ratings yet
Statistics for Beginners
35 pages
BDT Notes
No ratings yet
BDT Notes
40 pages
Map Info Pro Install Guide
No ratings yet
Map Info Pro Install Guide
81 pages
Nosql Q&A
No ratings yet
Nosql Q&A
204 pages
Data Warehousing & Mining Question Bank
No ratings yet
Data Warehousing & Mining Question Bank
10 pages
VirtualBox & Oracle Linux Setup Guide
No ratings yet
VirtualBox & Oracle Linux Setup Guide
87 pages
Salesforce BA Interview Questions and Answers
No ratings yet
Salesforce BA Interview Questions and Answers
19 pages
PRO22ENC1 & PRO22ENC2 Installation Manual
No ratings yet
PRO22ENC1 & PRO22ENC2 Installation Manual
23 pages
ABAP RESTful Programming Model Guide
No ratings yet
ABAP RESTful Programming Model Guide
46 pages
Data-Centric Consistency Models
No ratings yet
Data-Centric Consistency Models
12 pages
Chapter XI SIA
No ratings yet
Chapter XI SIA
5 pages
Machine Learning MCQs Set 1
No ratings yet
Machine Learning MCQs Set 1
5 pages
Data Pro Lesson Note Year 10
No ratings yet
Data Pro Lesson Note Year 10
20 pages
Krishna's Resume
No ratings yet
Krishna's Resume
1 page
P2 - ER Diagram
No ratings yet
P2 - ER Diagram
2 pages
Excel Data Management to SQLite
No ratings yet
Excel Data Management to SQLite
3 pages
Tugas Biostatistik Hubungan Tingkat Nyeri Terhadap Kejadian Post Op
No ratings yet
Tugas Biostatistik Hubungan Tingkat Nyeri Terhadap Kejadian Post Op
3 pages
Postgres & Java for Developers
No ratings yet
Postgres & Java for Developers
55 pages
Mesa County Election Forensic Analysis
100% (6)
Mesa County Election Forensic Analysis
22 pages
Chapter - 2: Object Oriented System Analysis & Design (OOAD)
No ratings yet
Chapter - 2: Object Oriented System Analysis & Design (OOAD)
49 pages
Integrity Check For Batch
No ratings yet
Integrity Check For Batch
3 pages
DBMS Lab Manual
No ratings yet
DBMS Lab Manual
29 pages
Flavors and Feast
No ratings yet
Flavors and Feast
55 pages
Mitigating API Broken Object Authorization
No ratings yet
Mitigating API Broken Object Authorization
19 pages
Understanding IDoc: SAP Data Exchange
No ratings yet
Understanding IDoc: SAP Data Exchange
31 pages
How To Knit Your Data Mesh On Snowflake-2
No ratings yet
How To Knit Your Data Mesh On Snowflake-2
14 pages
AZ-801 Exam ΓÇô 05102023-pages-46
No ratings yet
AZ-801 Exam ΓÇô 05102023-pages-46
2 pages
Full Stack Web Development Exam Guide
No ratings yet
Full Stack Web Development Exam Guide
8 pages
Database Design & Normalization Guide
No ratings yet
Database Design & Normalization Guide
30 pages
Class Xii Ip Practical File 2020 21
No ratings yet
Class Xii Ip Practical File 2020 21
52 pages
Latest Amazon MLS-C01 Dumps PDF (2024)
No ratings yet
Latest Amazon MLS-C01 Dumps PDF (2024)
3 pages

Introduction To Statistics

Uploaded by

Introduction To Statistics

Uploaded by

Introduction to Statistics

Measures of Central Tendency

• Inferential statistics of SAMPLES from a

• Measurement error or missing data may

• Another name is “summary statistics”,

Inferential mean of a sample: X=(X)/n

• Example, the average weight of political

Schmuggles 165 165

• Because the mean average can be

• The median is simply the middle value

• You may be familiar with standardized

• The most frequent response or value

• Multiple modes are possible: bimodal

What is the mode?

• Measures of dispersion tell us about

• Basic question: how much do values differ

• Measures of dispersion give us

• In other words, the range gives us the

• Understanding this statistic is important in

• Very useful and something you do read

pop density 0.0332 -17.3

• Its use is not often directly reported in research

1. Presentation of Descriptive Statistics

3. Some people understand subject

4. Provide a sense of the underlying

• Can also show the

• How often do certain scores occur?

Weight Class Intervals of Donut-Munching Professors

0 0.5 1 1.5 2 2.5 3 3.5

See Excel for other options!!!!

Presidential Approval and Unemployment

You might also like