0% found this document useful (0 votes)

36 views25 pages

Chapter6 Tests Relation Variables

The document discusses various statistical tests for examining relationships between variables, including parametric and non-parametric tests. It provides information and examples of Pearson correlation tests, Spearman correlation tests, chi-square tests of independence, t-tests, ANOVA, and Kruskal-Wallis tests. Examples are given of applying these tests along with interpreting results and checking assumptions.

Uploaded by

Syed Shahzaib Asghar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views25 pages

Chapter6 Tests Relation Variables

Uploaded by

Syed Shahzaib Asghar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

FACULTY OF ECONOMICS AND BUSINESS

CAMPUS BRUSSELS

Statistical Modelling
Tests for relation between 2 variables

1
Context

Parametric Non-parametric Non-parametric

(no normality or ordinal) (nominal)

1 sample t-test Sign test Binomial test

Wilcoxon signed-rank test
2 paired samples t-test differences /

2 independent samples t-test Mann-Whitney-Wilcoxon test Chi-square test

More than 2 ANOVA Kruskal-Wallis test Chi-square test

independent samples
Relation between 2 Pearson-correlation Spearman correlation Chi-square test
variables (linear relation two (relation between ordinal (relation two
quantitative variables) variables) qualitative
variables)

2
Chi-square test of independence
 Goal: Evaluate whether there is a statistical relation between two
qualitative variables.
o the two variables are independent
o the two variables are dependent
 Method: The chi-square test statistic is based on counts in the cross-table
of two variables. It measures the distance between
o observed counts
o expected counts if the two variables are statistically independent
number of rows in cross-table,
number of columns in cross-table

 If is true the chi-square statistic has a distribution with

degrees of freedom.
 Assumptions: (1) all , (2) not more than 20% cells with .

3
Chi-square test: approach
Example: Are the categorical variables education level and income
category related?

4
Chi-square test: approach
As we are at the boundary of violating the assumptions, we join the
categories college degree and post-undergraduate degree.

5
Pearson correlation test
 Goal: Evaluate whether two quantitative variables have a linear relation.
We also aim to assess the direction and strength of the linear relation.
 We distinguish
o The population correlation coefficient
o The sample correlation coefficient
 A correlation coefficient takes values between -1 and 1, i.e.
o means the variables are not related
o close to 0 means the variables have a weak relation
o means the variables have a perfect positive linear relation
o means the variables have a perfect negative linear relation

6
Pearson sample correlation
 Suppose we have a SRS of the variables and
The sample correlation between quantitative variables and is defined
as:

 A positive linear relation between and (see

top panel) means that observations with an -value
above average usually also have a -value above
average.
 A negative linear relation between and (see
bottom panel) means that observations with an -
value above average usually also have a -value
below average.

7
Pearson sample correlation

measures the size and direction of the linear relation between two
variables

8
Pearson sample correlation
measures the size and direction of the linear relation between and .
In this example but there is a strong non-linear (i.e., quadratic)
relation between and .

9
Pearson sample correlation
Outliers can have a very big effect on the sample correlation coefficient.

In this example one outlier increases the sample correlation from to

10
Sample Pearson correlation in SPSS
 We compute correlations between monthly wage, weekly working hours,
age for a sample of observations.
In SPSS: analyze/correlate/bivariate

Correlation
between age
monthly wage
= .302

11
Test
 there is no linear relation between and : .
 there is a linear relation between and : .
 If is true, and if has a bivariate Normal distribution (or if
than the test statistic is -distributed with degrees of
freedom:

12
Test
 If and have a bivariate normal distribution, the scatterplot has the
shape of an ellipse.
Bivariate normal distribution no bivariate normal distribution

Remark: if the Pearson correlation test is valid, even if and do

not have a bivariate Normal distribution.
13
Test in SPSS

14
Spearman correlation-test
 The non-parametric Spearman correlation test can be used
o to measure and test the relation between two ordinal qualitative
variables
o to measure and test the relation between two quantitative variables if
the assumptions of the Pearson correlation test are violated (i.e., small
sample and do not have a Bivariate Normal distribution).

The Spearman correlation (available in SPSS) is equivalent to the Pearson

correlation computed on the ranks of the observations.

We do not further discuss the test in this course.

15
Overview testing the relation between variables
(Parametric // non-parametric) test
2 quantitative variables:
Pearson correlation // Spearman correlation

 2 qualitative variables:
chi-square test

 1 quantitative variable and 1 qualitative variable

o qualitative variable with 2 categories:
independent samples -test // MWW-test
o qualitative variable with more than 2 categories
ANOVA // Kruskal Wallis-test

16
Exercise 1
 Suppose we have a sample of 4000 observations for the following
variables:
o Trust of a respondent in the government measured on a scale from 0 to 100.
o Country with categories 1=Belgium, 2=France, 3= the Netherlands
o Age measured in years
o Gender: nominal variable with categories 0=male, 1=female
Which test can you use to test whether there is a relation between
o Country and trust
o Gender and trust
o Country and gender
o Trust and age
 Formulate the null and alternative hypothesis for each test. Discuss
whether/when the proposed test is valid in the present context.

17
Exercise 2
 Consider the cross-table between two qualitative variables education level and
type of company for a sample of observations . The table contains
observed counts and expected counts if the variables are assumed to be
independent.

 Compute the expected counts for the first row of the table, compute the chi-
square test statistic and test (using ) the null hypothesis that education
level and type of company are statistically independent. Formulate a conclusion
about the result of the test.
18
Exercise 3
 We compute the Pearson sample correlation between household income
and years with current employer in a SRS of employees.
Correlations

1 ,625

N 850 850
,625 1

N 850 850

 Test whether the population correlation is positive (using ) and

draw a conclusion. Indicate whether the assumptions of the test are
satisfied.
19
Solution Exercise 1
Relation country and trust

 the null hypothesis is wrong
 To test , we can use a one-way ANOVA with as dependent variable
trust and as factor country.
 If the assumptions of the ANOVA are violated (residuals not normally
distributed, population variances not equal), the non-parametric Kruskal-
Wallis test could be used.
Relation gender and trust


 To test , we can use an independent samples t-test with as dependent
variable trust and as factor gender.

20
Solution Exercise 1
 The sample is very large and hence the t-statistic has an
approximate t-distribution if population variances for males and females
are equal. If the null-hypothesis of equal population variances for
males/females is rejected, a Welch correction to the t-statistic can be used.
Country and Gender
 country and gender are statistically independent
 country and gender are statistically dependent
 To test we can use a Pearson chi-square test on the cross-table country
x gender. The assumptions are (1) that all expected counts are larger than 1,
(2) that not more than 20% of the cells in the cross-table are smaller than 5.
 Stated otherwise, the chi-square test tests the null hypothesis that the
proportion of males is the same in the three countries:
versus is wrong

21
Solution Exercise 1
Trust and age


 To test that there is no linear relation between age and trust in the
population we can use a Pearson correlation test. As the sample size is
large the test statistic has an approximate t-distribution and
hence the test is valid.

22
Solution Exercise 2

 the variables company size and diploma are statistically independent

 the variables company size and diploma are statistically dependent
 Expected counts
o Small company and low education level: (794)(734)/1476=394.8
o Small company and average education level (794)(505)/1476=271.7
o Small company and high education level: (794)(237)/1476=127.5

23
Solution Exercise 2


 Let
 and hence we reject . We conclude with 95% confidence that
company size and diploma are statistically related.
 The assumptions of the test are satisfied:
o All expected counts are larger than or equal to 1
o There are no cells with an expected count smaller than 5, hence the
proportion of cells with is smaller than 20%.

24
Solution Exercise 3
 We test against H A :   0 with


 and hence we reject . We conclude with 95% confidence that the
population correlation between household income and years with current
employer is positive.
 The scatterplot shows that the assumption of a bivariate Normal
distribution for the two variables is doubtful. However, as the sample size
is large , the test statistic will have an approximate t-distribution
and hence the test is valid.
 Remark: to reduce the influence of outliers it is recommended to apply a
natural log transformation to household income.

Pearson R-Chi Square-ANOVA
No ratings yet
Pearson R-Chi Square-ANOVA
92 pages
Test of Difference Correlational SP Es
No ratings yet
Test of Difference Correlational SP Es
6 pages
Pearson Correlation in Inferential Statistics
No ratings yet
Pearson Correlation in Inferential Statistics
76 pages
Non Parametrics
No ratings yet
Non Parametrics
72 pages
Testing Hypothesis
No ratings yet
Testing Hypothesis
6 pages
Wage Relationship Analysis Techniques
No ratings yet
Wage Relationship Analysis Techniques
86 pages
Pearson R Correlation: Test
No ratings yet
Pearson R Correlation: Test
5 pages
Statistical Tests
No ratings yet
Statistical Tests
55 pages
Inferential Statistics
No ratings yet
Inferential Statistics
79 pages
Data Analysis: Parametric vs. Non-Parametric Tests
No ratings yet
Data Analysis: Parametric vs. Non-Parametric Tests
19 pages
Chapter 1 - Presentation-1
No ratings yet
Chapter 1 - Presentation-1
26 pages
SPSS Pearson R
No ratings yet
SPSS Pearson R
20 pages
Lecture 4 Regression Analysis
No ratings yet
Lecture 4 Regression Analysis
51 pages
Correlation (Pearson, Kendall, Spearman)
100% (1)
Correlation (Pearson, Kendall, Spearman)
4 pages
Inbound 2323158544640608273
No ratings yet
Inbound 2323158544640608273
149 pages
SPSS Guide for Social Science Students
No ratings yet
SPSS Guide for Social Science Students
27 pages
Week 11 - Correlational Research
No ratings yet
Week 11 - Correlational Research
36 pages
Spss Tutorials: Pearson Correlation
No ratings yet
Spss Tutorials: Pearson Correlation
10 pages
SPSS Def + Job Description
No ratings yet
SPSS Def + Job Description
54 pages
06 Correlational Statistics
No ratings yet
06 Correlational Statistics
32 pages
Biostatistics: Correlation & Regression
No ratings yet
Biostatistics: Correlation & Regression
60 pages
SPSS Training Program & Introduction To Statistical Testing: Variance and Variables
No ratings yet
SPSS Training Program & Introduction To Statistical Testing: Variance and Variables
13 pages
Add1 Bivariate Analysis
No ratings yet
Add1 Bivariate Analysis
4 pages
Chi Square
No ratings yet
Chi Square
12 pages
Main Concepts For Choosing The Right Statistical Treatment
No ratings yet
Main Concepts For Choosing The Right Statistical Treatment
19 pages
Pearson Correlation Coefficient Explained
No ratings yet
Pearson Correlation Coefficient Explained
6 pages
Statistical Tests and Types Explained
No ratings yet
Statistical Tests and Types Explained
31 pages
Correlations and Regssions
No ratings yet
Correlations and Regssions
4 pages
Understanding Correlation Analysis
No ratings yet
Understanding Correlation Analysis
102 pages
Short Term Training Programme On Data Analytics Using SPSS and RCMDR
No ratings yet
Short Term Training Programme On Data Analytics Using SPSS and RCMDR
20 pages
Recal5 RelationAnalysis
No ratings yet
Recal5 RelationAnalysis
83 pages
Understanding Correlation Coefficients
No ratings yet
Understanding Correlation Coefficients
44 pages
Chap2 Bivariate ANALYSIS
No ratings yet
Chap2 Bivariate ANALYSIS
19 pages
Module9-Correlation and Regression (Business)
No ratings yet
Module9-Correlation and Regression (Business)
15 pages
8614.educational Statitics Unit 7
No ratings yet
8614.educational Statitics Unit 7
39 pages
Correlation and Regression Original
No ratings yet
Correlation and Regression Original
44 pages
Psychstat Semifinals Reviewer
No ratings yet
Psychstat Semifinals Reviewer
5 pages
DAF1212 Business Statistics II
No ratings yet
DAF1212 Business Statistics II
67 pages
BUP-06-Correlation, Regression and Logistic
No ratings yet
BUP-06-Correlation, Regression and Logistic
27 pages
Psychstat Semifinals Reviewer (Bundalian)
No ratings yet
Psychstat Semifinals Reviewer (Bundalian)
8 pages
SPSS Explained 2nd Edition-312-339
No ratings yet
SPSS Explained 2nd Edition-312-339
28 pages
NonParametrics pt1
No ratings yet
NonParametrics pt1
13 pages
t-Tests and Correlations Explained
No ratings yet
t-Tests and Correlations Explained
2 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
5 pages
Research Paper
No ratings yet
Research Paper
20 pages
Statistical Tests Overview
No ratings yet
Statistical Tests Overview
2 pages
Correlation Methods Explained
No ratings yet
Correlation Methods Explained
2 pages
9.3 Parametric Tests
No ratings yet
9.3 Parametric Tests
4 pages
Week8 Tutorial
No ratings yet
Week8 Tutorial
17 pages
Business Statistics II Course Outline
No ratings yet
Business Statistics II Course Outline
71 pages
Topic 7 - Sample
No ratings yet
Topic 7 - Sample
44 pages
Quantitative Software Slides
No ratings yet
Quantitative Software Slides
97 pages
Chapter 5 Hypothesis Testing
No ratings yet
Chapter 5 Hypothesis Testing
27 pages
Linear Correlation 1205885176993532 3
No ratings yet
Linear Correlation 1205885176993532 3
102 pages
Egression & Orrelation: Nalysis
0% (1)
Egression & Orrelation: Nalysis
48 pages
Statistical Analysis Methods Overview
No ratings yet
Statistical Analysis Methods Overview
40 pages
Lecture 5557
No ratings yet
Lecture 5557
15 pages
Bivariate Pearson Correlation Overview
No ratings yet
Bivariate Pearson Correlation Overview
10 pages
Inferential Statistics with SPSS & Stata
No ratings yet
Inferential Statistics with SPSS & Stata
86 pages