0% found this document useful (0 votes)

58 views25 pages

Correlation: A Bit About Pearson's R

1) The maximum value of the Pearson correlation coefficient r equals 1 because this indicates a perfect positive linear relationship between the two variables. 2) A positive correlation means as one variable increases, so does the other, while a negative correlation means as one increases, the other decreases. 3) The Fisher r to z transformation converts the sampling distribution of r from skewed to normal, allowing hypothesis tests to be conducted.

Uploaded by

Giancarlo Marchesini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views25 pages

Correlation: A Bit About Pearson's R

Uploaded by

Giancarlo Marchesini

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Correlation

A bit about Pearson’s r

Questions
• Why does the • Give an example in
maximum value of r which data properly
equal 1.0? analyzed by ANOVA
• What does it mean cannot be used to infer
when a correlation is causality.
positive? Negative?
• Why do we care about
• What is the purpose of
the Fisher r to z the sampling
transformation? distribution of the
• What is range correlation coefficient?
restriction? Range • What is the effect of
enhancement? What do reliability on r?
they do to r?
Basic Ideas
• Nominal vs. continuous IV
• Degree (direction) & closeness
(magnitude) of linear relations
– Sign (+ or -) for direction
– Absolute value for magnitude
• Pearson product-moment correlation
coefficient

r
 z z
X Y

N
Illustrations
Plot of Weight by Height Plot of Errors by Study Time
210
30

180

20
Weight

Errors
150

120 10

90
60 63 66 69 72 75 0
Height 0 100 200 300 400
Study Time
Plot of SAT-V by Toe Size
700

600 Positive, negative, zero

SAT-V

500

400
1.5 1.6 1.7 1.8 1.9
Toe Size
Simple Formulas
r
 xy Use either N throughout or else
NS X SY use N-1 throughout (SD and
x  X  X and y  Y  Y denominator); result is the
same as long as you are
SX 
(X  X ) 2
consistent.
N

Cov( X , Y ) 
 xy
N
Pearson’s r is the average
r
 zz x y z
XX cross product of z scores.
N SX Product of (standardized)
moments from the means.
Graphic Representation
Plot of Weight by Height Plot of Weight by Height in Z-scores
210 2

180 1 - +
M e a n = 1 5 0 .7 lb s .

Z-weight
Weight

150 0

120 -1 + -
M e a n = 6 6 .8 In c h es

90 -2
60 63 66 69 72 75 -2 -1 0 1 2
Height Z-height

1. Conversion from raw to z.

2. Points & quadrants. Positive & negative products.
3. Correlation is average of cross products. Sign &
magnitude of r depend on where the points fall.
4. Product at maximum (average =1) when points on line
where zX=zY.
Descriptive Statistics
N Minimum Maximum Mean Std. Deviation
Ht 10 60.00 78.00 69.0000 6.05530
Wt 10 110.00 200.00 155.0000 30.27650
Valid N (listwise) 10

r = 1.0
r=1

Leave X, add error to Y.

r=.99
r=.99

Add more error.

r=.91
With 2 variables, the correlation is the z-score slope.
Review
• Why does the maximum value of r
equal 1.0?
• What does it mean when a correlation is
positive? Negative?
Sampling Distribution of r
Statistic is r, parameter is ρ (rho). In general, r is slightly
biased. Sampling Distributions of r
0 .0 8

Relative Frequ ency

0 .0 6

rho=-.5 rho=0 rho=.5

0 .0 4

0 .0 2

0 .0 0
-1 .2 -0 .8 -0 .4 0 .0 0 .4 0 .8 1 .2

(1   2 2
) Obs er v e d r
The sampling variance is approximately:  2r 
N
Sampling variance depends both on N and on ρ.
Empirical Sampling Distributions of the Correlation Coefficient

  .5; N  100   .7; N  100

  .5; N  50   .7; N  50
0.9 + 0
| 0 |
| 0 |
| 0 0 |
0.8 + 0 | |
| | | |
| | | +-----+
| 0 | +-----+ | |
0.7 + 0 | *--+--* *--+--*
| | | +-----+ | |
| | | | +-----+
| | | | |
0.6 + | | | |
| | +-----+ 0 |
| +-----+ | | 0 |
| | | | | 0 |
0.5 + *--+--* *--+--* 0 0
| | | | | 0 0
| +-----+ | | * 0
| | +-----+ 0
0.4 + | | 0
| | | * 0
| | | *
| | |
0.3 + 0 |
| 0 | *
| 0 |
| 0 0
0.2 + 0 0
| 0 0
| 0 0
| 0
0.1 + 0
| 0
| 0
| 0
0 + *
| *
| *
|
-0.1 +
------------+-----------+-----------+-----------+-----------
param .5_N100 .5_N50 .7_N100 .7_N50
Fisher’s r to z Transformation
Fisher r to z Transformation
1.5
r z
.10 .10 1.3
 (1  r ) 
.20 .20 z  .5 ln  
 (1  r ) 
1.1

z (output)
.30 .31 0.9
.40 .42
0.6
.50 .55
.60 .69 0.4

.70 .87 0.2

.80 1.10
0.0
.90 1.47 0.0 0.2 0.4 0.6 0.8 1.0

r (sample value input)

Sampling distribution of z is normal as N increases.

Pulls out short tail to make better (normal) distribution.
Sampling variance of z = (1/(n-3)) does not depend on ρ.
Hypothesis test: H0 :   0

r
t  N 2 Result is compared to t with (N-
1 r2 2) df for significance.

Say r=.25, N=100

.25 .25
t  98  9.899  2.56 p< .05
1  .25 2 .986

t(.05, 98) = 1.984.

Hypothesis test 2: H 0 :   value
1 r 1 
.5 log e .5 log e One sample z test where r is
1 r 1 
z sample value and ρ is
1/ N  3
hypothesized population value.

Say N=200, r = .54, and ρ is .30.

1.54 1.30
.5 log e .5 log e .60.31
z 1.54 1.30 z =4.13
1 / 200  3 .07

Compare to unit normal, e.g., 4.13 > 1.96 so it is

significant. Our sample was not drawn from a
population in which rho is .30.
Hypothesis test 3: H 0 : 1   2
Testing equality of correlations from 2 INDEPENDENT
samples. 1 r 1 r
.5 log e 1
.5 log e 2
1  r1 1  r2
z
1 / ( N1  3)  1 / ( N 2  3)

Say N1=150, r1=.63, N2=175, r2=70.

1.63 1.70
.5 log e .5 log e .74 .87
z 1 .63 1.70 z = -1.18, n.s.
1 / (150  3)  1 / (175  3) .11
Hypothesis test 4:H 0 : 1   2  ...   k
Testing equality of any number of independent correlations.
k

 (n  3) z
i i Q   (ni  3)( zi  z ) 2
z i 1

 (n  3) i
Compare Q to chi-square with k-1 df.

Study r n z (n-3)z zbar (z-zbar)2 (n-3)(z-zbar)2

1 .2 200 .2 39.94 .41 .0441 8.69

2 .5 150 .55 80.75 .41 .0196 2.88
3 .6 75 .69 49.91 .41 .0784 5.64
sum 425 170.6 17.21=Q
Chi-square at .05 with 2 df = 5.99. Not all rho are equal.
Hypothesis test 5: dependent r
H 0 : 12  13 Hotelling-Williams test

( N  1)(1  r23 )
t ( N 3)  (r12  r13 )
2( N  1) /( N  3) | R | r 2 (1  r23 )3
Say N=101, r12=.4, r13=.6, r23=.3
r  (r12  r13 ) / 2 r  (.4  .6) / 2  .5
| R | 1  r122  r132  r232  2(r12 )( r13 )( r23 )

| R | 1  .42  .62  .32  2(.4)(.6)(.3)  .534

(100)(1  .3)
t ( N 3)  (.4  .6)  2.1
2(100) /(98).534  .5 (1  .3)
2 3

t(.05, 98) = 1.98

H 0 : 12  34 See my notes.
Review
• What is the purpose of the Fisher r to z
transformation?
• Test the hypothesis that   
1 2
– Given that r1 = .50, N1 = 103
– r2 = .60, N2 = 128 and the samples are
independent.
• Why do we care about the sampling
distribution of the correlation
coefficient?
Range Restriction/Enhancement
Reliability
Reliability sets the ceiling for validity. Measurement error
attenuates correlations.
 XY  T X TY
 XX ' YY '
If correlation between true scores is .7 and reliability of
X and Y are both .8, observed correlation is 7.sqrt(.8*.8)
= .7*.8 = .56.
Disattenuated correlation

T X TY
  XY /  XX ' YY '
If our observed correlation is .56 and the reliabilities
of both X and Y are .8, our estimate of the correlation
between true scores is .56/.8 = .70.
Review
• What is range restriction? Range
enhancement? What do they do to r?
• What is the effect of reliability on r?
SAS Power Estimation
proc power; proc power;
onecorr dist=fisherz onecorr
corr = 0.35 corr = 0.35
nullcorr = 0.2 nullcorr = 0
sides = 1 sides = 2
ntotal = 100 ntotal = .
power = .; power = .8;
run; run;

Computed N Total
Computed Power Alpha = .05
Actual alpha = .05 Actual Power = .801
Power = .486 Ntotal = 61
Power for Correlations
Rho N required against
Null: rho = 0
.10 782
.15 346
.20 193
.25 123
.30 84
.35 61

Sample sizes required for powerful conventional

significance tests for typical values of the correlation
coefficient in psychology. Power = .8, two tails,
alpha is .05.

Brese Nham Algorithm
No ratings yet
Brese Nham Algorithm
2 pages
The Assumptions of Anova: Dennis Monday Gary Klein Sunmi Lee
100% (1)
The Assumptions of Anova: Dennis Monday Gary Klein Sunmi Lee
25 pages
ANOVA Assumptions
No ratings yet
ANOVA Assumptions
25 pages
IB Math Exam Review: Key Concepts
100% (2)
IB Math Exam Review: Key Concepts
5 pages
11 Maths Key PDF
No ratings yet
11 Maths Key PDF
5 pages
AP Calculus AB Formula Sheet
No ratings yet
AP Calculus AB Formula Sheet
1 page
Homework 2
No ratings yet
Homework 2
4 pages
Ejercicio 1 Command
No ratings yet
Ejercicio 1 Command
6 pages
NOT - Gate - Biased
No ratings yet
NOT - Gate - Biased
2 pages
Chapter 1
No ratings yet
Chapter 1
14 pages
Summary (ES) أ. احمد الزريعي
No ratings yet
Summary (ES) أ. احمد الزريعي
7 pages
Math Exam Marking Scheme
No ratings yet
Math Exam Marking Scheme
9 pages
Ch.6 Matrices Cheatsheet
No ratings yet
Ch.6 Matrices Cheatsheet
1 page
Transform Tables for Engineers
No ratings yet
Transform Tables for Engineers
6 pages
Math 125 - Exam 1 Solutions
No ratings yet
Math 125 - Exam 1 Solutions
9 pages
Tabel of Raw Residual Variance
No ratings yet
Tabel of Raw Residual Variance
58 pages
Reiteration
No ratings yet
Reiteration
3 pages
Linear Programming 1
No ratings yet
Linear Programming 1
11 pages
Maths Revision: Key Concepts
No ratings yet
Maths Revision: Key Concepts
12 pages
Ans Maths 2017
No ratings yet
Ans Maths 2017
11 pages
Dot Product
No ratings yet
Dot Product
9 pages
CBSE Class 12 Maths Marking Scheme 2016 17
No ratings yet
CBSE Class 12 Maths Marking Scheme 2016 17
11 pages
KVS Maths Part A-183-192
No ratings yet
KVS Maths Part A-183-192
10 pages
Polynomial Graphs for Students
No ratings yet
Polynomial Graphs for Students
26 pages
1754045825023.paper - 1
No ratings yet
1754045825023.paper - 1
76 pages
8-Set-4 MS Bright
No ratings yet
8-Set-4 MS Bright
4 pages
Reverse Email Address Lookup: Esalerugs - 75% Off Retail
No ratings yet
Reverse Email Address Lookup: Esalerugs - 75% Off Retail
1 page
Tolerance Analysis and Equations
No ratings yet
Tolerance Analysis and Equations
26 pages
Sat 2020
No ratings yet
Sat 2020
2 pages
What I Need To Know From Trig: Opp Hyp Adj Hyp Opp Adj Hyp Opp Hyp Adj Adj Opp
No ratings yet
What I Need To Know From Trig: Opp Hyp Adj Hyp Opp Adj Hyp Opp Hyp Adj Adj Opp
4 pages
Trigonometric Ratios at Key Angles
100% (2)
Trigonometric Ratios at Key Angles
2 pages
Unit 2
No ratings yet
Unit 2
66 pages
K-Map and Simplification
No ratings yet
K-Map and Simplification
38 pages
A Level 数学公式合集
No ratings yet
A Level 数学公式合集
1 page
OR - Gate - Biased
No ratings yet
OR - Gate - Biased
4 pages
Class 12 Maths Pre-Board Marking Scheme
No ratings yet
Class 12 Maths Pre-Board Marking Scheme
8 pages
Root Locus and Compensator Design
No ratings yet
Root Locus and Compensator Design
17 pages
Informe Masa-Resorte Img Parte2
No ratings yet
Informe Masa-Resorte Img Parte2
1 page
2A Ch01 ConsolidationExercise Soln
No ratings yet
2A Ch01 ConsolidationExercise Soln
7 pages
HarjotKaur 4535433 Math Assignment
No ratings yet
HarjotKaur 4535433 Math Assignment
14 pages
AMC Lecture Notes Systems of Equations
No ratings yet
AMC Lecture Notes Systems of Equations
8 pages
Unit II.4 Computer Arithmetic Fast Mult
No ratings yet
Unit II.4 Computer Arithmetic Fast Mult
8 pages
Levelling Adjustment OUT
No ratings yet
Levelling Adjustment OUT
3 pages
SAT Math Refresher PDF
100% (2)
SAT Math Refresher PDF
2 pages
Decreasing: Increasing
No ratings yet
Decreasing: Increasing
6 pages
Essential SSLC Mathematics Formulas
No ratings yet
Essential SSLC Mathematics Formulas
2 pages
Math Solutions: Inequalities & Linear Programming
No ratings yet
Math Solutions: Inequalities & Linear Programming
18 pages
AS & A Level Math Cheatsheet
No ratings yet
AS & A Level Math Cheatsheet
95 pages
Linear Algebra PDF
No ratings yet
Linear Algebra PDF
20 pages
MS-BNT-0501-0502 (Updated)
No ratings yet
MS-BNT-0501-0502 (Updated)
2 pages
Class XII Mathematics Pre-Board Scheme
No ratings yet
Class XII Mathematics Pre-Board Scheme
8 pages
Angles in Quadrilateral
No ratings yet
Angles in Quadrilateral
6 pages
A Level Formula Sheet
100% (1)
A Level Formula Sheet
1 page
Peta Amusement-Park 11stem15a g1
No ratings yet
Peta Amusement-Park 11stem15a g1
3 pages
Syllabus
No ratings yet
Syllabus
5 pages
ACTIVITY 2 I 12 Finals
No ratings yet
ACTIVITY 2 I 12 Finals
1 page
6.electromagnetic Induction
No ratings yet
6.electromagnetic Induction
13 pages
Basic Principles of Nursing Care PDF
No ratings yet
Basic Principles of Nursing Care PDF
11 pages
Cha 4 - Sensory, Attentional and Perceptual Processes PDF
No ratings yet
Cha 4 - Sensory, Attentional and Perceptual Processes PDF
4 pages
Modal Verbs for Past Speculation
No ratings yet
Modal Verbs for Past Speculation
4 pages
Visual Statistics Use R!
50% (2)
Visual Statistics Use R!
388 pages
Indian Sign Language Character Recognition: Course Project-CS365A
No ratings yet
Indian Sign Language Character Recognition: Course Project-CS365A
14 pages
Science Milestones Timeline
No ratings yet
Science Milestones Timeline
1 page
Preventive Mantenance of HT BOARD
No ratings yet
Preventive Mantenance of HT BOARD
6 pages
Shaswat Resume (1) - 1
No ratings yet
Shaswat Resume (1) - 1
1 page
Waaree Aditya Series Solar Datasheet
No ratings yet
Waaree Aditya Series Solar Datasheet
2 pages
Profhilo Brochure
No ratings yet
Profhilo Brochure
17 pages
FMHM Coursefile
No ratings yet
FMHM Coursefile
77 pages
SCM - Midterm Test of 40 Questions - 25'
No ratings yet
SCM - Midterm Test of 40 Questions - 25'
3 pages
Preliminary Project Works Overview
67% (3)
Preliminary Project Works Overview
53 pages
Therminal Velocity
No ratings yet
Therminal Velocity
4 pages
Applied - (11) - Practical Research 1 - Sem I and II - CLAS4 - Identifying The Inquiry and Stating The Problem - v3
100% (1)
Applied - (11) - Practical Research 1 - Sem I and II - CLAS4 - Identifying The Inquiry and Stating The Problem - v3
27 pages
All Parts & Price List
100% (1)
All Parts & Price List
7 pages
UiTM Jengka Academic Affairs Overview
No ratings yet
UiTM Jengka Academic Affairs Overview
20 pages
Usha Martin
67% (3)
Usha Martin
61 pages
Reading I
No ratings yet
Reading I
10 pages
Kinasang-an: Understanding Globalization
100% (1)
Kinasang-an: Understanding Globalization
131 pages
Dynamics of Rigid Bodies PS Solution
No ratings yet
Dynamics of Rigid Bodies PS Solution
2 pages
Dab 6
No ratings yet
Dab 6
8 pages
Fundamentals of Machine Learning 4341603
No ratings yet
Fundamentals of Machine Learning 4341603
9 pages
Energy Efficient Base Stations Sleep Mode Techniques in Green Cellular Networks
No ratings yet
Energy Efficient Base Stations Sleep Mode Techniques in Green Cellular Networks
25 pages
Auxiliary Heater Control Guide
No ratings yet
Auxiliary Heater Control Guide
2 pages
Art Therapy
No ratings yet
Art Therapy
9 pages
Welland Water HR Strategy Analysis
No ratings yet
Welland Water HR Strategy Analysis
2 pages
Motor Impairments in Children With Autism Spectrum Disorder - A Systematic Review and Meta Analysis
No ratings yet
Motor Impairments in Children With Autism Spectrum Disorder - A Systematic Review and Meta Analysis
22 pages
IUP Map for PT Multi Harapan Utama
No ratings yet
IUP Map for PT Multi Harapan Utama
1 page

Correlation: A Bit About Pearson's R

Uploaded by

Correlation: A Bit About Pearson's R

Uploaded by

Correlation

A bit about Pearson’s r

600 Positive, negative, zero

1. Conversion from raw to z.

Leave X, add error to Y.

Add more error.

Relative Frequ ency

rho=-.5 rho=0 rho=.5

  .5; N  100   .7; N  100

.70 .87 0.2

r (sample value input)

Sampling distribution of z is normal as N increases.

Say r=.25, N=100

t(.05, 98) = 1.984.

Say N=200, r = .54, and ρ is .30.

Compare to unit normal, e.g., 4.13 > 1.96 so it is

Say N1=150, r1=.63, N2=175, r2=70.

Study r n z (n-3)z zbar (z-zbar)2 (n-3)(z-zbar)2

1 .2 200 .2 39.94 .41 .0441 8.69

| R | 1  .42  .62  .32  2(.4)(.6)(.3)  .534

t(.05, 98) = 1.98

Sample sizes required for powerful conventional

You might also like