Advanced Statistics
Review
Dr. Bhongybz '14
Why Statistics
Quantitative research will generate masses of
numerical raw data
its not in a suitable form to draw any conclusions its not easily digested!
It requires summarising and analysis or testing
before the research question can be answered or
hypotheses supported or rejected.
Statistical analysis is the method for achieving this.
Dr. Bhongybz '14
2 types of statistics
Descriptive Statistics
Summarise and describe the data
Inferential Statistics
which are for testing the data so we can
draw conclusions
Dr. Bhongybz '14
Descriptive statistics
Mean
the average
Median
mid-point, divides values in to two halves
Mode
the most frequently occurring value
These are measures of Central tendency:
how the data is clustered together
Dr. Bhongybz '14
Descriptive Statistics
Range
The difference between lowest to highest
value
Standard Deviation
The average deviation from the mean
These are Measures of dispersion, how spread
out the data is
Dr. Bhongybz '14
Inferential Statistics
Are used to test for differences
between groups, or
test for associations (correlations) in
the data
It allows the researcher to test
hypotheses that these differences or
associations exist
Dr. Bhongybz '14
Inferential Statistics
There are many inferential statistical
tests
They are designed for different sorts
of data, and
Different experimental designs, and
Have different rules (assumptions)
that have to be followed
Dr. Bhongybz '14
Inferential Statistics
Are divided into Parametric and NonParametric tests, e.g:
Chi Square = non-parametric
T-test = parametric
The parametric tests are more powerful,
but
They require higher level data and have
stricter rules (assumptions)
Dr. Bhongybz '14
Levels of Data
Nominal
(for non-parametric)
Naming, categories, e.g. gender
Ordinal
(for non-parametric)
Ranked data, e.g. nurses grade
Interval data
(for parametric)
On a scale with equal intervals, e.g.
temperature in centigrade
Ratio
(for parametric)
On a scale with a true zero, e.g. temperature in
Kelvin
Dr. Bhongybz '14
Probability
Inferential statistical tests are reported
with a probability that the result is due
to chance alone (the alpha level)
Usually this is expressed as p 0.05
Meaning that there is a 0.05 probability
that the result was mere chance, or a
95% certainty that it was a real effect
Dr. Bhongybz '14
10
Levels of Probability:
The 0.05 level and below are the conventions used in research
p=
1.0
= % chance
100%
or:
1 in 1
(dead cert!)
0.5
50%
1 in 2
(like toss of a coin)
0.05
5%
5 in 100, or 1 in 20
0.01
1%
1 in 100
0.001
0.1%
Dr. Bhongybz '14
1 in 1000
11
Hypotheses
Are used in experiments
They are statements of predicted
relationships between two or more
variables
Eg:
Back massage reduces anxiety
Dr. Bhongybz '14
12
Variables
In this example Back Massage is the
INDEPENDENT variable (IV)
This is manipulated / controlled by the
researcher
Anxiety is the DEPENDENT variable (DV)
This is measured to observe for changes
Dr. Bhongybz '14
13
Testing Hypotheses
We assume that there will be no effect or
difference in our test so we actually test
what is called
The Null Hypothesis
So, in our example, the null hypothesis
(H0) is:
There is no difference between back
massage and control groups anxiety levels
Dr. Bhongybz '14
14
Testing hypotheses
If the result is significant (p 0.05), the
the Null Hypothesis is rejected,
And the research (H1), or alternative
(Halt) hypothesis is accepted
Its like the principal of innocent
until proven guilty
Dr. Bhongybz '14
15
Statistical Significance
If we reject the null hypothesis (at say
the 0.05 level) this is like saying we are
95% certain that the findings did not occur
due to chance,
in other words, the measured effect is real
(at least we are 95% sure)
There is still a 5% (or 1 in 20) chance we
are drawing the wrong conclusion
Dr. Bhongybz '14
16
Type 1 & Type 2 errors
A type 1 error is a false positive - the
researcher incorrectly rejects the null
hypothesis - and declares a significant
finding
A type 2 error is a false negative when
the researcher incorrectly supports the null
hypothesis - and reports that there is no
effect / difference
Dr. Bhongybz '14
17
Type 1 errors
Risk of Type 1 errors is reduced by adopting a
more stringent alpha level (eg requiring
p 0.01 or p 0.001 instead of p 0.05
One may wish to reduce this risk if the
consequences of a false positive (type 1) error
are serious, such as in a drug trial
As one reduces the risk of Type 1 errors, the risk
of Type 2 errors increases, unless steps are
taken to prevent this
Dr. Bhongybz '14
18
Type 2 errors
The best way of reducing type 2 errors is to
increase the sample size in a study
This will increase the power of a study, so that
it is more likely to detect differences that exist
Power Analysis is a method for determining
adequate sample size
The convention is that power (beta) should be
set at 0.8
that is the probability of making a type 2 error is
1-0.8 = 0.2 or 20%
Dr. Bhongybz '14
19
Type I and Type II Errors
Dr. Bhongybz '14
20
Summary
There are Descriptive & Inferential Statistics
Inferential = Parametric OR Non-Parametric
Choice depends on Level of Data & Assumptions
Inferential Statistics are for testing hypotheses
Findings are reported as a probability that they are due to
chance
We say they are statistically significant if p 0.05
We may make type 1 or type 2 errors when drawing
conclusions
Dr. Bhongybz '14
21