UNIVERSITY OF GONDAR
DEPARTMENT OF STATISTICS
Training on Research Methodology and
Data Management Using Statistical
Software’s
April 4- 8, 2016,
Bahirdar, Ethiopia
1
Research Methodologies
2
Overview of Presentation
Definition of Statistics
Classification of Statistics (Descriptive Statistics
and Inferential Statistics)
Definition of Some Statistical Words
T test
ANOVA
Chi Square Test
regression and Correlation
3
Definition of Statistics
The raw data themselves like statistics of
births, statistics of deaths, statistics of
students, statistics of imports and exports,
etc.
It is the subject that deals with the collection,
organization, presentation, analysis and
interpretation of data
4
Descriptive statistics
The major concern of descriptive statistics is to
present information in a convenient, usable, and
understandable form. Example:
frequency,
Graph ,
Calculate the measures of central tendency
(means, median, and mode),
Calculate the dispersion of the scores (variances
and standard deviations).
5
Inferential Statistics
It is not concerned with just describing the
obtained data.
Rather, it addresses the problem of making
broader generalizations or inferences from
sample data to population.
6
Definition of words
Population
Sample
Parameter
Statistic
Variable
7
T-Test for Independent Groups
It is used for testing the differences between the means of
two independent groups.
For example:
“Do males and females differ in performance on a
standardized achievement test?
“What is the effect of drug vs. no drug on blood pressure
of patients?”
8
Paired-Samples T-Test
It is used in repeated measures or correlated groups
design, in which each subject is tested twice on the same
variable.
A common experiment of this type involves the before and
after design.
The test can also be used for the matched group design in
which pairs of subjects that are matched on one or more
characteristics (e.g., IQ, grades, and so forth) serve in the
two conditions.
9
One-Way Analysis of Variance (ANOVA)
It is an extension of the independent t-test.
It is used when the researcher is interested in
whether the means from several (> 2) independent
groups differ.
For example, if a researcher is interested in
investigating whether four ethnic groups differ in
their IQ scores, the one-way ANOVA can be used.
10
Linear Regression and Correlation
Regression Analysis is a statistical technique that can be
used to develop a mathematical equation showing how
variables are related.
Correlation Analysis deals with the measurement of the
closeness of the relationship which are described in the
regression equation.
Example: Study hour of the student with mark of the student
11
Test of association
The chi-square procedure test is used to test the hypothesis of
independency of two attributes
For instance we may be interested :
Whether the presence or absence of hypertension is independent
of smoking habit or not.
Whether the size of the family is independent of the level of
education attained by the mothers.
Whether there is association between father and son regarding
boldness.
Whether there is association between stability of marriage and
period of acquaintance ship prior to marriage.
12
What You Want to Test in your
office by using??
1) Independent T test
2) Paired T test
3) Chi Square Test
4) ANOVA
5) Regression and Correltion
13
Thank You!!!
14
Introduction to SPSS
15
What is SPSS??
The “Statistical Package for the Social Sciences”
It is a package of programs for manipulating,
analyzing, and presenting data;
16
SPSS Windows
Data Editor- is the window that is open at start-up and
is used to enter and store data in a spreadsheet format.
Output Viewer- opens automatically when you execute
an analysis or create a graph using dialog box or
command syntax to execute a procedure.
17
Data Editor Window
It is arranged in a spreadsheet format that contains variables in
columns and cases in rows. There are two sheets in the window.
The Data View is the sheet that is visible when you first
open the Data Editor and contains the data.
The Variable View this second sheet contains information
about the variable that is stored with the dataset.
The variable name should not begin with any special characters
such as numerals, comma, inequality symbols etc.)
18
Inserting new variables:
Click on the row that is below the row where you wish
to enter a new variable, click on edit on the menu bar,
and then click on Insert Variable from the pull-down
menu.
19
Deleting Cases and Variables
To delete a variable: In variable view (row), select the
row number that you wish to delete, click on Edit, and
then on Clear.
Deleting Cases: To delete a case, click on the case
number that you wish to delete, click on Edit from the
menu, and then on Clear.
20
Computing New Variables
You may want to create new variables in your datasets.
Such operations can be performed using the Compute
option available from the menu in the Data Editor:
Transform Compute...
Variables can also be computed conditionally by using
IF tab.
21
Recoding Variables
You can also modify the values of existing variables in
your dataset.
Transform Recode
22
Sorting cases
Sorting cases allows you to organize rows of data in
ascending or descending order on the basis of one or
more variable.
Data Sort Cases...
23
DATA DESCRIPTION
Descriptive Statistics
Analyze Descriptive Statistics Descriptive...
Analyze Descriptive Statistics Frequencies...
Graphs Legacy dialog
Finally we can export our out puts through different
techniques
24
Cross tabulation
While frequencies show the numbers of cases in each level
of a categorical variable, they do not give information about
the relationship between categorical variables.
The Crosstabs procedure is useful for investigating this type
of information because it can provide information about the
intersection of two variables.
Analyze Descriptive Statistics Crosstabs…
25
Exercise:
Create different variables in your field
Enter your data in SPSS
Do summery statistics using
Frequency distribution
Descriptive statistics
Graph/ diagram
26
Chi-square test
The Chi-square test for independence is used in
situations where you have two categorical
variables.
Analyze Descriptive Statistics Cross tab
27
Compare Means- T tests
The t- test is a useful technique for comparing mean
values of two sets of numbers.
T tests can be used to compare:
Two independent groups (independent-samples t test) or
To compare observations from two measurement
occasions for the same group (paired-samples t test).
Analyze > Compare Means > Means...
28
one-way ANOVA
It is a generalization of the independent-sample t
test that is used to compare the means of more
groups' scores on the same variable.
Analyze Compare Means One-Way ANOVA
29
Correlation and Regressions
It is a technique that can be used to investigate the
effect of one or more predictor variables on an
outcome variable
Analyze Regression Linear...
Analyze Correlate …….
30
Thank you!!!
31