0% found this document useful (0 votes)

199 views4 pages

Lecture 14. HGR Maximal Correlation (After-Class)

Uploaded by

laijiahao0430

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

199 views4 pages

Lecture 14. HGR Maximal Correlation (After-Class)

Uploaded by

laijiahao0430

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Lecture 14.

HGR Maximal Correlation (After-

class)

Notes: HGR correlation is a correlation metric in statistics. Compared with commonly-used correlations, it has the
advantage to handle non-linear statistical dependency. HGR maximum correlation can be computed by ACE (Alternative
Conditional Expectation) algorithm.

HGR: Hirschfeld-Gebelein-Rényi

Setup
Given 2 discrete random variables X, Y , want to measure the correlation between X and Y , how can we do?

Example (Pearson Correlation Coefficient):

E[(X − E[X])(Y − E[Y ])]

γ(X, Y ) ≜
var(X)var(Y )

If γ(X, Y ) = 0 ⇏X, Y are independent

A good correlation measurement ρ(X, Y ) should satisfy:

1. commutable ρ(X, Y ) = ρ(Y , X)

2. 0 ≤ ρ(X, Y ) ≤ 1
3. ρ(X, Y ) = 0 if and only if X, Y are independent
4. ρ(X, Y ) = 1, if Y = ξ(X) or X = η(Y ), for some deterministic functions ξ, η.
5. For one-to-one functions ξ, η, ρ(ξ(X), η(Y )) = ρ(X, Y )

Example (Pearson Correlation Coefficient):

E[(X − E[X])(Y − E[Y ])]

γ(X, Y ) ≜ , −1 ≤ γ(X, Y ) ≤ 1
var(X)var(Y )

∣γ(X, Y )∣ satisfies (1), (2), not (3), (4), (5).

∣γ(X, Y )∣ = 1, iff X = aY + b or Y = cX + d

Example (Mutual Information):

Lecture 14. HGR Maximal Correlation (After-class) 1

PX Y (x, y)
I(X; Y ) ≜ ∑ PX Y (x, y) log
x,y
PX (x)PY (y)

I(X; Y ) satisfies (1), (3), (5), not (2), (4).

Can we find some ρ(X; Y ) satisfying (1)-(5)?

Definition: The HGR maximal correlation ρ(X, Y ) is defined as

ρ(X, Y ) ≜ max E[f(X)g(Y )]

f : X →R, g : Y→R
E[f (X)]=E[g(Y )]=0
E[f 2 (X)]=E[g 2 (Y )]=1

ρ(X, Y ) = maxf ,g γ(f(X), g(Y ))

Data X, Y , want to find features of X and Y such that these features are the most related ⇒ HGR maximal
correlation functions

ρ(X, Y ) satisfies (1)-(5).

check (5): for one-to-one functions ξ, η,

ρ(ξ(X), η(Y )) = max γ(f(ξ(X)), g(η(Y )))

f ,g
= max
′ ′
γ(f ′ (X), g′ (Y ))
f ,g
= ρ(X, Y )

= ξ(X), let K(Y ): Y → R be a one-to-one function such that E[K(Y )] = 0,

check (4): If Y
E[K (Y )] = 1, then take f(X) = K(ξ(X)), g(Y ) = K(Y )
2

⇒ E[f(X) ⋅ g(Y )] = E[K(ξ(X)) ⋅ K(Y )] = E[K 2 (Y )] = 1 ⇒ ρ(X, Y ) = 1

~
Definition: The canonical dependence matrix B ∈ R∣Y∣×∣X ∣ is defined as

~ PX Y (x, y) − PX (x)PY (y)

B(y, x) ≜ , ∀x ∈ X , y ∈ Y.
PX (x)PY (y)

⎧ 1
3
if (x, y) = (0, 0)
=⎨
1
if (x, y) = (0, 1)
Example: X, Y are binary, PX Y (x, y) 6

⎩
1
6
if (x, y) = (1, 0)
1
3
if (x, y) = (1, 1)
PX (0) = 12 , PX (1) = 12 , PY (0) = 12 , PY (1) = 12
~ ~ 1
−1⋅1 ~ ~ 1 1 1
6−2⋅2
B(0, 0) = B(1, 1) = 3 12 12 = 16 B(0, 1) = B(1, 0) = 1 1
= − 16
2⋅2 2⋅2

Lecture 14. HGR Maximal Correlation (After-class) 2

⎡ − ⎤
1 1
~ 6 6
B=
⎣− ⎦
1 1
6 6

Definition: The information vectors ϕ, ψ associated to functions f , g are defined:

⁍, ⁍

Whenever functions f , g are given, we have the corresponding ϕ, ψ

Property:

Norm-square: ∥ϕ∥2 = ∑x ϕ2 (x) = ∑x PX (x)f 2 (x) = E[f 2 (X)], ∥ψ∥2 = E[g2 (Y )]

Inner produce:

⁍,

⟨ϕ, PX ⟩ = ∑ ϕ(x) ⋅ PX (x) = ∑ PX (x) ⋅ f(x) = E[f(X)]

x x

⁍,

⟨ψ, PY ⟩ = ∑ ψ(x) ⋅ PY (y) = ∑ PY (y) ⋅ g(y) = E[g(Y )]

y y

~
Theorem: The HGR maximal correlation ρ(X, Y ) is the largest singular value of B
Proof: Note that E[f(X) ⋅ g(Y )] = ∑ PX Y (x, y) ⋅ f(x) ⋅ g(y) =
x,y
⎛ ⎞
PXY (x,y) PX (x)PY (y)
∑ ( PX (x)f(x))( PY (y)g(y)) − ( PX (x)f(x))( PY (y)g(y))
PX (x)PY (y) PX (x)PY (y)
⎝ ⎠
x,y

=0
PX (x)PY (y)
∑ ( PX (x)f(x))( PY (y)g(y)) = ∑x,y PX (x)f(x)PY (y)g(y) = E[f(X)]E[g(Y )] =
x,y PX (x)PY (y)

PX Y (x, y) − PX (x)PY (y)

⇒ E[f(X) ⋅ g(Y )] = ∑ ( PX (x)f(x))( PY (y)g(y)) =
x,y PX (x)PY (y)
ϕ(x) ψ(y)
~
B (y,x)
~ ~
∑ B(y, x)ϕ(x)ψ(y) = ψT Bϕ
x,y
~

Lecture 14. HGR Maximal Correlation (After-class) 3

~
⇒ ρ(X, Y ) = max ψT Bϕ
ϕ,ψ
subject to : ∥ϕ∥2 = ∥ψ∥2 = 1 (↔ E[f 2 (X)] = E[g2 (Y )] = 1)
⟨ϕ, PX ⟩ = ⟨ψ, PY ⟩ = 0 (↔ E[f(X)] = E[g(Y )] = 0)

~ ~
Fact 1: arg max∥ϕ∥2 =∥ψ∥2 =1 ψT Bϕ = largest right/left singular vectors of B
Fact 2: singular vectors are orthogonal with each other

Check:
~ ~ PXY (x,y)−PX (x)PY (y) PY (y)−PY (y)
[B ⋅ PX ](y) = ∑x B(y, x) PX (x) = ∑x = =0
PY (y) PY (y)
~ ~
⇒ PX is a right singular vector of B with singular value 0, since B ⋅ PX = 0
~
PY is also a left singular vector of B
~
⇒ ρ(X, Y ) = σ1 (the largest singular value of B), with “=” iff ϕ and ψ are the largest right/left singular vectors of
~
B
~
⇒ Let ϕ1 and ψ1 be the largest right/left singular vectors of B, then

1 1
f ∗ (x) = ⋅ ϕ1 (x), g∗ (y) = ⋅ ψ1 (y)
PX (x) PY (y)
~
Check (3): ρ(X, Y ) = 0 iff σ1 = 0 iff B = 0 iff PX Y (x, y) = PX (x)PY (y) iff X , Y are independent.

Lecture 14. HGR Maximal Correlation (After-class) 4

Maximal Correlation Functions Overview
No ratings yet
Maximal Correlation Functions Overview
36 pages
Graphical Models
No ratings yet
Graphical Models
43 pages
Discussion Notes 2-6
No ratings yet
Discussion Notes 2-6
3 pages
Rohatgi - An Introduction To Probability and Statistics Wiley 2015 - Removed
No ratings yet
Rohatgi - An Introduction To Probability and Statistics Wiley 2015 - Removed
13 pages
Wald 3 Web
No ratings yet
Wald 3 Web
76 pages
f x x ρ X,Y
No ratings yet
f x x ρ X,Y
2 pages
Ma702 - 13
No ratings yet
Ma702 - 13
4 pages
Maximum Entropy Models in Statistics
No ratings yet
Maximum Entropy Models in Statistics
20 pages
Canonical Correlation Explained
No ratings yet
Canonical Correlation Explained
18 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
396 pages
Bivariate Distribution Family W Specified Correlation&Given Marginals Xiang 2014
No ratings yet
Bivariate Distribution Family W Specified Correlation&Given Marginals Xiang 2014
15 pages
SSRN Id3512994
No ratings yet
SSRN Id3512994
34 pages
SMM Not 3
No ratings yet
SMM Not 3
11 pages
Croux-Dehon2010 Article InfluenceFunctionsOfTheSpearma
No ratings yet
Croux-Dehon2010 Article InfluenceFunctionsOfTheSpearma
19 pages
1 Math Fundamentals: 1.1 Integrals, Factors and Techniques
No ratings yet
1 Math Fundamentals: 1.1 Integrals, Factors and Techniques
11 pages
HW1 Solutions
No ratings yet
HW1 Solutions
9 pages
After Mid-Sem 2
No ratings yet
After Mid-Sem 2
3 pages
Extreme Eigenvalues of Nonlinear Correlation Mat 2022 Stochastic Processes A
No ratings yet
Extreme Eigenvalues of Nonlinear Correlation Mat 2022 Stochastic Processes A
22 pages
Course 3&4
No ratings yet
Course 3&4
22 pages
Bivariate Statistics and Hypothesis Testing
No ratings yet
Bivariate Statistics and Hypothesis Testing
15 pages
Moment Generating Functions & Distributions
No ratings yet
Moment Generating Functions & Distributions
49 pages
R Supplementary Distributions Guide
No ratings yet
R Supplementary Distributions Guide
26 pages
7thcanonical Correlation Analysis PDF
No ratings yet
7thcanonical Correlation Analysis PDF
13 pages
1 Inequalities: 1.1 Markov
No ratings yet
1 Inequalities: 1.1 Markov
15 pages
MSA-Lecture 6-Canonical Correlation Analysis
No ratings yet
MSA-Lecture 6-Canonical Correlation Analysis
12 pages
Czekanowski Index-Based Similarity As Alternative Correlation Measure in N-Asset Portfolio Analysis
No ratings yet
Czekanowski Index-Based Similarity As Alternative Correlation Measure in N-Asset Portfolio Analysis
1 page
נוסחאות ואי שיוויונים
No ratings yet
נוסחאות ואי שיוויונים
12 pages
Lecture2 2015
No ratings yet
Lecture2 2015
58 pages
2.7 Correlation Coefficient and Bivariate Normal Distribution
No ratings yet
2.7 Correlation Coefficient and Bivariate Normal Distribution
11 pages
Probability and Statistics Exam
No ratings yet
Probability and Statistics Exam
8 pages
Lect 9
No ratings yet
Lect 9
6 pages
Canonical Correlation
No ratings yet
Canonical Correlation
7 pages
Cheatsheet PDF
100% (1)
Cheatsheet PDF
4 pages
ECE 650: Bivariate Distributions Overview
100% (1)
ECE 650: Bivariate Distributions Overview
40 pages
A Step by Step Guide To Bi-Gaussian Disjunctive Kriging
No ratings yet
A Step by Step Guide To Bi-Gaussian Disjunctive Kriging
15 pages
CC 5
No ratings yet
CC 5
2 pages
Lec 36
No ratings yet
Lec 36
17 pages
Statistical Machine Learning 1665832214
No ratings yet
Statistical Machine Learning 1665832214
55 pages
Pearson Correlation Coefficient - Wikipedia
No ratings yet
Pearson Correlation Coefficient - Wikipedia
27 pages
MATH2010 2022 23 AutumnNotes Gappy
No ratings yet
MATH2010 2022 23 AutumnNotes Gappy
92 pages
Math 156 Final Cheat Sheet
No ratings yet
Math 156 Final Cheat Sheet
2 pages
Advanced Multivariate Statistics
No ratings yet
Advanced Multivariate Statistics
18 pages
The DCC Package: Riccardo (Jack) Lucchetti Giulio Palomba Luca Pedini
No ratings yet
The DCC Package: Riccardo (Jack) Lucchetti Giulio Palomba Luca Pedini
13 pages
Cours 2 MVA
No ratings yet
Cours 2 MVA
5 pages
Revision Concepts
No ratings yet
Revision Concepts
5 pages
Understanding Normal Distribution Basics
No ratings yet
Understanding Normal Distribution Basics
27 pages
Rayleigh Distribution
No ratings yet
Rayleigh Distribution
7 pages
M-Estimators in Statistical Estimation
No ratings yet
M-Estimators in Statistical Estimation
55 pages
Kocherlakota-Bivariate Discrete Distributions (Statistics - A Series of Textbooks and Monographs) (1992) PDF
No ratings yet
Kocherlakota-Bivariate Discrete Distributions (Statistics - A Series of Textbooks and Monographs) (1992) PDF
359 pages
Statistical+Inference+1 Shaw2007
No ratings yet
Statistical+Inference+1 Shaw2007
66 pages
STA 277 Statistics Summer2023 Solutions 7075
No ratings yet
STA 277 Statistics Summer2023 Solutions 7075
13 pages
Mcom 3 Sem Statistical Analysis Cgs S 2019
No ratings yet
Mcom 3 Sem Statistical Analysis Cgs S 2019
4 pages
Inequalites Mso205
No ratings yet
Inequalites Mso205
5 pages
Chapitre 7 - Functions of Random Variables
No ratings yet
Chapitre 7 - Functions of Random Variables
23 pages
Informative Feature Extraction
No ratings yet
Informative Feature Extraction
9 pages
Probability Assignment Guide
No ratings yet
Probability Assignment Guide
2 pages
English Practice
No ratings yet
English Practice
4 pages
Yandex Knowledge Graph API Overview
No ratings yet
Yandex Knowledge Graph API Overview
63 pages
Provably Powerful Graph Networks: Haggai Maron Heli Ben-Hamu Hadar Serviansky Yaron Lipman
No ratings yet
Provably Powerful Graph Networks: Haggai Maron Heli Ben-Hamu Hadar Serviansky Yaron Lipman
15 pages
Protein Design via Annotation Alignment
No ratings yet
Protein Design via Annotation Alignment
20 pages
Aligning Transformers With Weisfeiler-Leman: K K K K
No ratings yet
Aligning Transformers With Weisfeiler-Leman: K K K K
51 pages
R E P GNN G B: Ethinking The Xpressive Ower of S Via Raph Iconnectivity
No ratings yet
R E P GNN G B: Ethinking The Xpressive Ower of S Via Raph Iconnectivity
60 pages
On The Connection Between MPNN and Graph Transformer
No ratings yet
On The Connection Between MPNN and Graph Transformer
23 pages
Factors Affecting Demand (Sales) Forecasting
No ratings yet
Factors Affecting Demand (Sales) Forecasting
16 pages
Participatory GIS - A People's GIS?: Christine E. Dunn
No ratings yet
Participatory GIS - A People's GIS?: Christine E. Dunn
25 pages
Strength of Materials Lab Report
No ratings yet
Strength of Materials Lab Report
5 pages
FOCS Chiller Range
No ratings yet
FOCS Chiller Range
8 pages
Neural Network and Water Jet Cutting of Abrasive Materials
No ratings yet
Neural Network and Water Jet Cutting of Abrasive Materials
7 pages
Data Analyst
No ratings yet
Data Analyst
21 pages
Certificate Course
No ratings yet
Certificate Course
6 pages
Effective Learning Techniques and Methods
No ratings yet
Effective Learning Techniques and Methods
3 pages
Metric Screw Threads Standard
No ratings yet
Metric Screw Threads Standard
20 pages
The Great Indian Bustard Saga - Supreme Court Recognizes Right Against Adverse Effects of Climate Change, Read Judgment
No ratings yet
The Great Indian Bustard Saga - Supreme Court Recognizes Right Against Adverse Effects of Climate Change, Read Judgment
4 pages
Presentation-Test & Commissioning
No ratings yet
Presentation-Test & Commissioning
16 pages
SCS Completion Tool Catalog
100% (1)
SCS Completion Tool Catalog
118 pages
SCHNEIDER - 9-03 - DIN16270 - e
No ratings yet
SCHNEIDER - 9-03 - DIN16270 - e
1 page
Sundyne Prospekt Int Geared PDF
100% (1)
Sundyne Prospekt Int Geared PDF
16 pages
Berklee Brand Identity Analysis
No ratings yet
Berklee Brand Identity Analysis
58 pages
Ethnolinguistics and Cultural Concepts - Truth, Love, Hate - James William Underhill
No ratings yet
Ethnolinguistics and Cultural Concepts - Truth, Love, Hate - James William Underhill
261 pages
Lecture - 1 Introduction History of Material Science and Metallurgy
No ratings yet
Lecture - 1 Introduction History of Material Science and Metallurgy
42 pages
Septiembre 2021 TRW - SPR
No ratings yet
Septiembre 2021 TRW - SPR
4 pages
Scheme JS 2 Classes + History
No ratings yet
Scheme JS 2 Classes + History
194 pages
Understanding Newton's Laws of Motion
No ratings yet
Understanding Newton's Laws of Motion
14 pages
Discuss The Advantages and The Disadvantages of The Following Tests
No ratings yet
Discuss The Advantages and The Disadvantages of The Following Tests
2 pages
The Burnout Society by Byung-Chul Han
No ratings yet
The Burnout Society by Byung-Chul Han
14 pages
Gabrielle Unsworth CVsept23
No ratings yet
Gabrielle Unsworth CVsept23
2 pages
Electric Motors in EVs and Public Transport
No ratings yet
Electric Motors in EVs and Public Transport
7 pages
Multi Media and ICT: Elizabeth C. Adeva
No ratings yet
Multi Media and ICT: Elizabeth C. Adeva
84 pages
Medium Voltage Terminations Catalogue 2017
No ratings yet
Medium Voltage Terminations Catalogue 2017
24 pages
LTC6811 1 6811 2
No ratings yet
LTC6811 1 6811 2
92 pages
Written Report Unit 2 Bsed Fil 1a
No ratings yet
Written Report Unit 2 Bsed Fil 1a
13 pages
Learning Burden in Vocabulary Teaching
No ratings yet
Learning Burden in Vocabulary Teaching
38 pages