0% found this document useful (0 votes)

70 views45 pages

Principal Component Analysis

Principal Component Analysis (PCA) is a technique used to reduce the dimensionality of data by transforming correlated variables into a smaller number of uncorrelated variables called principal components. It does this by finding the directions of maximum variance in high-dimensional data and projecting the data onto those directions. The first principal component accounts for as much of the variability in the data as possible, and each succeeding component accounts for as much of the remaining variability as possible. PCA is widely used in data analysis and for visualizing multivariate data.

Uploaded by

anand_sesham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views45 pages

Principal Component Analysis

Uploaded by

anand_sesham

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 45

Principal Component Analysis

(PCA)

www.csce.uark.edu/~xintaowu/5073/PCA1.ppt

Data Reduction

summarization of data with many (p)

variables by a smaller set of (k) derived
(synthetic, composite) variables.
p

Data Reduction

Residual variation is information in A

that is not retained in X
balancing act between
clarity of representation, ease of
understanding
oversimplification: loss of important or
relevant information.

Principal Component Analysis

(PCA)
probably the most widely-used and wellknown of the standard multivariate
methods
invented by Pearson (1901) and Hotelling
(1933)
first applied in ecology by Goodall (1954)
under the name factor analysis (principal
factor analysis is a synonym of PCA).

Principal Component Analysis

(PCA)
takes a data matrix of n objects by p

variables, which may be correlated, and

summarizes it by uncorrelated axes
(principal components or principal axes)
that are linear combinations of the
original p variables
the first k components display as much as
possible of the variation among objects.

Geometric Rationale of PCA

objects are represented as a cloud of n

points in a multidimensional space with an
axis for each of the p variables
the centroid of the points is defined by
the mean of each variable
the variance of each variable is the
average squared deviation of its n values
around the mean of that variable.

1
2
X im X i
Vi

n 1 m 1
n

Geometric Rationale of PCA

degree to which the variables are linearly
correlated is represented by their
covariances.

1 n
X im X i X jm X j
Cij

n 1 m 1
Covariance of
variables i and j
Sum over all
n objects

Value of
Mean of
variable i variable i
in object m

Value of
variable j
in object m

Mean of
variable j

Geometric Rationale of PCA

objective of PCA is to rigidly rotate the
axes of this p-dimensional space to new
positions (principal axes) that have the
following properties:

ordered such that principal axis 1 has the

highest variance, axis 2 has the next highest
variance, .... , and axis p has the lowest
variance
covariance among each pair of the principal
axes is zero (the principal axes are
uncorrelated).

2D Example of PCA

variables X1 and X2 have positive covariance & each has a

similar variance.

X 2 4.91

X 1 8.35

V1 6.67

V2 6.24

C1, 2 3.42

Configuration is Centered

each variable is adjusted to a mean of

zero (by subtracting the mean from each value).

Principal Components are Computed

PC 1 has the highest possible variance (9.88)

PC 2 has a variance of 3.03
PC 1 and PC 2 have zero covariance.

The Dissimilarity Measure Used in PCA is

Euclidean Distance

PCA uses Euclidean Distance calculated

from the p variables as the measure of
dissimilarity among the n objects
PCA derives the best possible k dimensional
(k < p) representation of the Euclidean
distances among objects.

Generalization to p-dimensions
In practice nobody uses PCA with only 2
variables
The algebra for finding principal axes
readily generalizes to p variables
PC 1 is the direction of maximum variance
in the p-dimensional cloud of points
PC 2 is in the direction of the next
highest variance, subject to the
constraint that it has zero covariance
with PC 1.

Generalization to p-dimensions
PC 3 is in the direction of the next highest
variance, subject to the constraint that it
has zero covariance with both PC 1 and PC 2
and so on... up to PC p

each principal axis is a linear combination of the original

two variables
PCj = ai1Y1 + ai2Y2 + ainYn
aijs are the coefficients for factor i, multiplied by the
measured value for variable j
PC 1

PC 2

PC axes are a rigid rotation of the original variables

PC 1 is simultaneously the direction of maximum variance
and a least-squares line of best fit (squared distances
of points away from PC 1 are minimized).
PC 1

PC 2

Generalization to p-dimensions
if we take the first k principal components, they
define the k-dimensional hyperplane of best fit
to the point cloud
of the total variance of all p variables:

PCs 1 to k represent the maximum possible proportion of

that variance that can be displayed in k dimensions
i.e. the squared Euclidean distances among points
calculated from their coordinates on PCs 1 to k are the
best possible representation of their squared Euclidean
distances in the full p dimensions.

Covariance vs Correlation
using covariances among variables only
makes sense if they are measured in the
same units
even then, variables with high variances will
dominate the principal components
these problems are generally avoided by
standardizing each variable to unit variance
and zero mean.

X im

Xi
Standard deviation
of variable i
SD i

Mean
variable i

Covariance vs Correlation
covariances between the standardized
variables are correlations
after standardization, each variable has a
variance of 1.000
correlations can be also calculated from the
variances and covariances:
Correlation between
variables i and j

rij

Variance
of variable i

C ij
ViV j

Covariance of
variables i and j
Variance
of variable j

The Algebra of PCA

first step is to calculate the cross-products

matrix of variances and covariances (or
correlations) among every pair of the p
variables
square, symmetric matrix
diagonals are the variances, off-diagonals
are the covariances.

6.6707

3.4170

1.0000

0.5297

3.4170

6.2384

0.5297

1.0000

Variance-covariance Matrix

Correlation Matrix

The Algebra of PCA

in matrix notation, this is computed as

S XX

where X is the n x p data matrix, with each

variable centered (also standardized by SD if using
correlations).

6.6707

3.4170

1.0000

0.5297

3.4170

6.2384

0.5297

1.0000

Variance-covariance Matrix

Correlation Matrix

Manipulating Matrices
transposing: could change the columns to
rows or the rows to columns
X =

10 0 4
7 1 2

multiplying matrices

X =

10 7
0 1
4 2

must have the same number of columns in the

premultiplicand matrix as the number of rows in
the postmultiplicand matrix

The Algebra of PCA

sum of the diagonals of the variancecovariance matrix is called the trace
it represents the total variance in the data
it is the mean squared Euclidean distance
between each object and the centroid in pdimensional space.

6.6707

3.4170

1.0000

0.5297

3.4170

6.2384

0.5297

1.0000

Trace = 12.9091

Trace = 2.0000

The Algebra of PCA

finding the principal axes involves

eigenanalysis of the cross-products matrix
(S)
the eigenvalues (latent roots) of S are
solutions () to the characteristic equation

S I 0

The Algebra of PCA

the eigenvalues, 1, 2, ... p are the

variances of the coordinates on each
principal component axis
the sum of all p eigenvalues equals the trace
of S (the sum of the variances of the
original variables).

6.6707

3.4170

6.2384

1 = 9.8783
2 = 3.0308

Note:

=12.9091
1
2
Trace = 12.9091

The Algebra of PCA

each eigenvector consists of p values which

represent the contribution of each
variable to the principal component axis
eigenvectors are uncorrelated (orthogonal)
their cross-products are zero.

Eigenvectors

0.7291

-0.6844

0.6844

0.7291

0.7291*(-0.6844) + 0.6844*0.7291 = 0

The Algebra of PCA

coordinates of each object i on the kth

principal axis, known as the scores on PC k,
are computed as

z ki u1k x1i u2 k x 2 i u pk x pi
where Z is the n x k matrix of PC scores, X
is the n x p centered data matrix and U is
the p x k matrix of eigenvectors.

The Algebra of PCA

variance of the scores on each PC axis is

equal to the corresponding eigenvalue for
that axis
the eigenvalue represents the variance
displayed (explained or extracted) by
the kth axis
the sum of the first k eigenvalues is the
variance explained by the k-dimensional
ordination.

1 = 9.8783

2 = 3.0308

Trace = 12.9091

PC 1 displays (explains)
9.8783/12.9091 = 76.5% of the total variance

The Algebra of PCA

The cross-products matrix computed among
the p principal axes has a simple form:
all off-diagonal values are zero (the principal
axes are uncorrelated)
the diagonal values are the eigenvalues.

PC1

PC2

PC1

9.8783

0.0000

PC2

0.0000

3.0308

Variance-covariance Matrix
of the PC axes

A more challenging example

data from research on habitat definition in

the endangered Baw Baw frog
16 environmental and structural variables
measured at each of 124 sites
correlation matrix used because variables
have different units

Philoria frosti

Eigenvalues
Axis

Eigenvalue

% of
Variance

Cumulative %
of Variance

5.855

36.60

3.420

21.38

57.97

1.122

7.01

64.98

1.116

6.97

71.95

0.982

6.14

78.09

0.725

4.53

82.62

0.563

3.52

86.14

0.529

3.31

89.45

0.476

2.98

92.42

0.375

2.35

94.77

Interpreting Eigenvectors
correlations
between variables
and the principal
axes are known as
loadings
each element of
the eigenvectors
represents the
contribution of a
given variable to a
component

0.3842

0.0659

-0.1177

-0.1159

0.1696

-0.5578

Cond

-0.2729

-0.1200

0.3636

0.0538

-0.2800

0.2621

-0.0765

0.3855

-0.1462

maxERht

0.0248

0.4879

0.2426

avERht

0.0599

0.4568

0.2497

%ER

0.0789

0.4223

0.2278

%VEG

0.3305

-0.2087

-0.0276

%LIT

-0.3053

0.1226

0.1145

%LOG

-0.3144

0.0402

-0.1067

-0.0886

-0.0654

-0.1171

0.1364

-0.1262

0.4761

DistSWH

-0.3787

0.0101

0.0042

DistSW

-0.3494

-0.1283

0.1166

Altitude

TempSurf
Relief

H1Moss

How many axes are needed?

does the (k+1)th principal axis represent
more variance than would be expected
by chance?
several tests and rules have been
proposed
a common rule of thumb when PCA is
based on correlations is that axes with
eigenvalues > 1 are worth interpreting

What are the assumptions of PCA?

assumes relationships among variables are

LINEAR

cloud of points in p-dimensional space has linear

dimensions that can be effectively summarized
by the principal axes

if the structure in the data is NONLINEAR

(the cloud of points twists and curves its
way through p-dimensional space), the
principal axes will not be an efficient and
informative summary of the data.

When should PCA be used?

In community ecology, PCA is useful for
summarizing variables whose relationships
are approximately linear or at least
monotonic
e.g. A PCA of many soil properties might be
used to extract a few components that
summarize main dimensions of soil variation

PCA is generally NOT useful for

ordinating community data
Why? Because relationships among
species are highly nonlinear.

The Horseshoe or Arch Effect

community trends along environmenal
gradients appear as horseshoes in PCA
ordinations
none of the PC axes effectively
summarizes the trend in species
composition along the gradient
SUs at opposite extremes of the gradient
appear relatively close together.

Ambiguity of Absence

Axis 2

Beta Diversity 2R - Covariance

Axis 1

The HorseshoeEffect
curvature of the gradient and the degree
of infolding of the extremes increase with
beta diversity
PCA ordinations are not useful summaries
of community data except when beta
diversity is very low
using correlation generally does better than
covariance
this is because standardization by species
improves the correlation between Euclidean
distance and environmental distance.

What if theres more than one underlying

ecological gradient?

The Horseshoe Effect

when two or more underlying gradients with
high beta diversity a horseshoe is usually
not detectable
the SUs fall on a curved hypersurface that
twists and turns through the p-dimensional
species space
interpretation problems are more severe
PCA should NOT be used with community
data (except maybe when beta diversity is very low).

Impact on Ordination History

by 1970 PCA was the ordination method of
choice for community data
simulation studies by Swan (1970) &
Austin & Noy-Meir (1971) demonstrated
the horseshoe effect and showed that the
linear assumption of PCA was not
compatible with the nonlinear structure of
community data
stimulated the quest for more appropriate
ordination methods.

Understanding Principal Component Analysis
100% (1)
Understanding Principal Component Analysis
45 pages
PCA for Data Scientists
No ratings yet
PCA for Data Scientists
45 pages
PCA in Biological Data Analysis
No ratings yet
PCA in Biological Data Analysis
45 pages
PCA: Data Reduction Techniques
No ratings yet
PCA: Data Reduction Techniques
32 pages
PCA for Data Simplification
No ratings yet
PCA for Data Simplification
70 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
17 pages
Lecture 6 - PCA - Lecturefin
No ratings yet
Lecture 6 - PCA - Lecturefin
71 pages
PCA: Dimensionality Reduction Explained
No ratings yet
PCA: Dimensionality Reduction Explained
47 pages
Presentation A I STD 2
No ratings yet
Presentation A I STD 2
63 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Qrm2024 Topic5 Pca Fa
No ratings yet
Qrm2024 Topic5 Pca Fa
67 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
Dimensionality Reduction Using PCA (Principal Component Analysis)
No ratings yet
Dimensionality Reduction Using PCA (Principal Component Analysis)
13 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
PCA
100% (1)
PCA
33 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Unit Iii Dimentionality Reduction
No ratings yet
Unit Iii Dimentionality Reduction
12 pages
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
No ratings yet
Principal Component Analysis: Term Paper For Data Mining & Data Warehousing
11 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
PCA Guide and R Implementation
No ratings yet
PCA Guide and R Implementation
11 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Presentation
No ratings yet
Presentation
31 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
Lecture 9 - Data Prep - Reduction - PCA-M
No ratings yet
Lecture 9 - Data Prep - Reduction - PCA-M
44 pages
Dimensionality Reduction Using Principal Component Analysis
No ratings yet
Dimensionality Reduction Using Principal Component Analysis
32 pages
Pattern Recognition PCA: Subrata Datta Dept. of AIML Nsec
No ratings yet
Pattern Recognition PCA: Subrata Datta Dept. of AIML Nsec
19 pages
Principal Component Analysis Guide
No ratings yet
Principal Component Analysis Guide
23 pages
PCA Tutorial for CS898 Course
No ratings yet
PCA Tutorial for CS898 Course
12 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
5 Dimentionality Reduction
No ratings yet
5 Dimentionality Reduction
27 pages
Principal Component Analysis and Cluster Analysis
No ratings yet
Principal Component Analysis and Cluster Analysis
14 pages
Data Pre-Processing-IV (Feature Extraction-PCA)
No ratings yet
Data Pre-Processing-IV (Feature Extraction-PCA)
23 pages
Principal Computer Analysis (PCA)
No ratings yet
Principal Computer Analysis (PCA)
25 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
20 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
Introduction to PCA Techniques
No ratings yet
Introduction to PCA Techniques
26 pages
CAMI16 - Data Analytics
No ratings yet
CAMI16 - Data Analytics
28 pages
Principal Component Analysis Overview
No ratings yet
Principal Component Analysis Overview
33 pages
03 Principal Components Analysis
No ratings yet
03 Principal Components Analysis
3 pages
Dimensionality Reduction by Pca: Non - Feasible
No ratings yet
Dimensionality Reduction by Pca: Non - Feasible
26 pages
WIREs Computational Stats - 2010 - Abdi - Principal Component Analysis
No ratings yet
WIREs Computational Stats - 2010 - Abdi - Principal Component Analysis
27 pages
WIREs Computational Stats - 2010 - Abdi - Principal Component Analysis
No ratings yet
WIREs Computational Stats - 2010 - Abdi - Principal Component Analysis
27 pages
Dimensionality Reduction Explained
No ratings yet
Dimensionality Reduction Explained
60 pages
Deep Learning Notes III To IV
No ratings yet
Deep Learning Notes III To IV
22 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
Principal Components Analysis (PCA) : R. Jothi
No ratings yet
Principal Components Analysis (PCA) : R. Jothi
47 pages
PCA for Chemical Industry PhDs
No ratings yet
PCA for Chemical Industry PhDs
15 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
23 pages
What Is PCA?: Image Source
No ratings yet
What Is PCA?: Image Source
17 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
Computer Vision: Spring 2006 15-385,-685
No ratings yet
Computer Vision: Spring 2006 15-385,-685
58 pages
Pca 1
No ratings yet
Pca 1
3 pages
Chapter2 - Machine Instructions and Programs
No ratings yet
Chapter2 - Machine Instructions and Programs
85 pages
Automatic Speech Recognition
No ratings yet
Automatic Speech Recognition
30 pages
Intorduction To Artificial Intelligence: Rina Dechter
No ratings yet
Intorduction To Artificial Intelligence: Rina Dechter
41 pages
Uninformed Search: Models To Be Studied in CS 540
No ratings yet
Uninformed Search: Models To Be Studied in CS 540
24 pages
CPU Instruction Execution Basics
No ratings yet
CPU Instruction Execution Basics
47 pages
FOL Notes
No ratings yet
FOL Notes
8 pages
Elec 263 Computer Architecture and Organization
No ratings yet
Elec 263 Computer Architecture and Organization
5 pages
Lecture Slides For: Ethem Alpaydin © The MIT Press, 2010
No ratings yet
Lecture Slides For: Ethem Alpaydin © The MIT Press, 2010
30 pages
BE Seminar Evaluation Format
No ratings yet
BE Seminar Evaluation Format
2 pages
Answer: C Answer: A, B, C, D Answer: D: Week 5 Solutions
No ratings yet
Answer: C Answer: A, B, C, D Answer: D: Week 5 Solutions
2 pages
Weka Data Mining for Breast Cancer Analysis
No ratings yet
Weka Data Mining for Breast Cancer Analysis
3 pages
Advanced Linear Algebra Problems
No ratings yet
Advanced Linear Algebra Problems
3 pages
Timeline
No ratings yet
Timeline
1 page
Owe Business Case Case Study - Revised
No ratings yet
Owe Business Case Case Study - Revised
4 pages
Critical Reading Skills
100% (4)
Critical Reading Skills
16 pages
Nonlinear Analysis of Stress-Strain of Reinforced
No ratings yet
Nonlinear Analysis of Stress-Strain of Reinforced
6 pages
7 Keeping Your Code Readable
No ratings yet
7 Keeping Your Code Readable
7 pages
Adjectives: Gradable and Non-Gradable
No ratings yet
Adjectives: Gradable and Non-Gradable
3 pages
Sap PP Masterdata End User Training
No ratings yet
Sap PP Masterdata End User Training
63 pages
Tuc 2 Tu 2 e
No ratings yet
Tuc 2 Tu 2 e
17 pages
Bender Koppitz2-Chap. 2
No ratings yet
Bender Koppitz2-Chap. 2
22 pages
Characterization of Mesoporous Activated Carbons Prepared by Pyrolysisnext Term of Sewage Sludge With Pyrolusite
No ratings yet
Characterization of Mesoporous Activated Carbons Prepared by Pyrolysisnext Term of Sewage Sludge With Pyrolusite
5 pages
Lecture 4-Mathematical Modeling of Electrical Systems
No ratings yet
Lecture 4-Mathematical Modeling of Electrical Systems
58 pages
Study Plan: International Islamic University Malaysia
No ratings yet
Study Plan: International Islamic University Malaysia
4 pages
UNIT III Design and Architecture Part II
No ratings yet
UNIT III Design and Architecture Part II
94 pages
The UCR Time Series Archive
No ratings yet
The UCR Time Series Archive
12 pages
2-CLIMATE1 - Greenhouse Gas Removal Solutions
No ratings yet
2-CLIMATE1 - Greenhouse Gas Removal Solutions
6 pages
Banana Island Game Setup Guide
No ratings yet
Banana Island Game Setup Guide
10 pages
Sample Size Determination
No ratings yet
Sample Size Determination
4 pages
Technical Data Sheet: Century Paper & Board Mills Limited
No ratings yet
Technical Data Sheet: Century Paper & Board Mills Limited
1 page
BR111
No ratings yet
BR111
44 pages
Definition Test 60-120
No ratings yet
Definition Test 60-120
3 pages
Bob Heilig - Legacy Leadership - FB Groups Guide
No ratings yet
Bob Heilig - Legacy Leadership - FB Groups Guide
15 pages
TMTO Attacks and Rainbow Tables Explained
No ratings yet
TMTO Attacks and Rainbow Tables Explained
29 pages
Fundamentals of Industrial Instrumentation and Process Control (By William Dunn)
No ratings yet
Fundamentals of Industrial Instrumentation and Process Control (By William Dunn)
13 pages
Power BI Architecture
100% (2)
Power BI Architecture
47 pages
Programme Guide-Mcom Compressed
No ratings yet
Programme Guide-Mcom Compressed
105 pages
Grade 6 Mathematics Take Home Pack (Multiply & Divide) Term 2
No ratings yet
Grade 6 Mathematics Take Home Pack (Multiply & Divide) Term 2
11 pages
Coursera - Getting Started With Azure DevOps
No ratings yet
Coursera - Getting Started With Azure DevOps
1 page
Presentation On Building
No ratings yet
Presentation On Building
18 pages
1 DVS Prinsiples & Practice of Marine Diesel Engines 85 (Turbo)
100% (1)
1 DVS Prinsiples & Practice of Marine Diesel Engines 85 (Turbo)
85 pages

Principal Component Analysis

Uploaded by

Principal Component Analysis

Uploaded by

Principal Component Analysis

summarization of data with many (p)

Residual variation is information in A

Principal Component Analysis

Principal Component Analysis

variables, which may be correlated, and

Geometric Rationale of PCA

objects are represented as a cloud of n

Geometric Rationale of PCA

Geometric Rationale of PCA

ordered such that principal axis 1 has the

variables X1 and X2 have positive covariance & each has a

each variable is adjusted to a mean of

Principal Components are Computed

PC 1 has the highest possible variance (9.88)

The Dissimilarity Measure Used in PCA is

PCA uses Euclidean Distance calculated

each principal axis is a linear combination of the original

PC axes are a rigid rotation of the original variables

PCs 1 to k represent the maximum possible proportion of

The Algebra of PCA

first step is to calculate the cross-products

The Algebra of PCA

in matrix notation, this is computed as

where X is the n x p data matrix, with each

must have the same number of columns in the

The Algebra of PCA

The Algebra of PCA

finding the principal axes involves

The Algebra of PCA

the eigenvalues, 1, 2, ... p are the

The Algebra of PCA

each eigenvector consists of p values which

The Algebra of PCA

coordinates of each object i on the kth

The Algebra of PCA

variance of the scores on each PC axis is

The Algebra of PCA

A more challenging example

data from research on habitat definition in

How many axes are needed?

What are the assumptions of PCA?

assumes relationships among variables are

cloud of points in p-dimensional space has linear

if the structure in the data is NONLINEAR

When should PCA be used?

PCA is generally NOT useful for

The Horseshoe or Arch Effect

Beta Diversity 2R - Covariance

What if theres more than one underlying

The Horseshoe Effect

Impact on Ordination History

You might also like