0% found this document useful (0 votes)

35 views61 pages

Lecture 13 Financial Time Series

This document outlines a lecture on principal components analysis and factor analysis. It begins by discussing factor models, which try to explain complex phenomena through a small number of underlying factors. It then discusses principal components analysis, which reduces dimensionality in data to identify the key underlying variables, or principal components, that explain the variance. Finally, it compares principal components analysis and factor analysis, noting they are both used to reduce dimensionality but have different statistical objectives.

Uploaded by

alejandro315am

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views61 pages

Lecture 13 Financial Time Series

Uploaded by

alejandro315am

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Lecture 13

Principal Components Analysis and Factor

Analysis

Prof. Dr. Svetlozar Rachev

Institute for Statistics and Mathematical Economics
University of Karlsruhe

Financial Econometrics, Summer Semester 2007

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Copyright

These lecture-notes cannot be copied and/or distributed

without permission.
The material is based on the text-book:
Financial Econometrics: From Basics to Advanced
Modeling Techniques
(Wiley-Finance, Frank J. Fabozzi Series)
by Svetlozar T. Rachev, Stefan Mittnik, Frank Fabozzi, Sergio
M. Focardi,Teo Jaši c̀.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Outline

I Factor models.
I Principal components analysis.
I Factor analysis.
I PCA and factor analysis compared.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models

I Factor models are statistical models that try to explain

complex phenomena through a small number of basic causes
or factors.
I Factor models serve two main purposes:
1. They reduce the dimensionality of models to make
estimation possible;
2. They find the true causes that drive data.
I Factor models were introduced by Charles Spearman in 1904.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models

I The Spearman model explains intellectual abilities through one

common factor, the famous ”general intelligence” g factor,
plus another factor s which is specific to each distinct ability.
I Louis Leon Thurstone developed the first true multifactor
model of intelligence, where were identified the following
seven primary mental abilities:
Verbal Comprehension Word Fluency
Number Facility Spatial Visualization
Associative Memory Perceptual Speed
Reasoning.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models

I In the early applications of factor models to psychometrics,

the statistical model was essentially a conditional multivariate
distribution. The objective was to explain psychometric tests
as probability distributions conditional on the value of one or
more factors. In this way, one can make predictions of, for
example, the future success of young individuals in different
activities.
I In economics, factor models are typically applied to time
series. The objective is to explain the behavior of a large
number of stochastic processes, typically price, returns, or rate
processes, in terms of a small number of factors. These
factors are themselves stochastic processes.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models

I In order to simplify both modeling and estimation, most factor

models employed in financial econometrics are static models.
This means that time series are assumed to be sequences of
temporally independent and identically distributed (IID)
random variables so that the series can be thought as
independent samples extracted from one common distribution.
I In financial econometrics, factor models are needed not only
to explain data but to make estimation feasible. Factor
models able to explain all pairwise correlations in terms of a
much smaller number of correlations between factors.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Linear Factor Models Equations

Linear factor models are regression models of the following type:

K
X
Xi = αi + βij fj + εi
j=1

where
Xi = a set of N random variables
fj = a set of K common factors
εi = the noise terms associated with each variable Xi
βij ’s are the factor loadings or factor sensitivities, which express
the influence of the j-th factor on the i-th variable.
Note: In this formulation, factor models are essentially static
models, but it is possible to add a dynamics to both the variables
and the factors.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models

I One of the key objectives of factor models is that the

covariances between the variables Xi is determined only by the
covariances between factors.
I Suppose that the noise terms are mutually uncorrelated, so
that
0 i 6= j
E (εi εj ) =
σi2 i = j
and that the noise terms are uncorrelated with the factors,
that is, E (εi fj ) = 0, ∀i, j.
I Suppose also that both factors and noise terms have a zero
mean, so that E (Xi ) = αi .
Factor models that respect the above constraints are called strict
factor models.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models

Lets compute the covariances of a strict factor model:

K
X K
X
E ((Xi − αi )(Xj − αj )) = E βis fs + εi βjt ft + εj
s=1 t=1

K
X K
X K
X
=E βis fs βjt ft +E βis fs (εj )
s=1 t=1 s=1

K
X
+E (εi ) βjt ft + E (εi εj )
t=1
X
= βis E (fs ft )βjt + E (εi εj )
s,t

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models

We can express the above compactly in matrix form. Lets write a

factor model in matrix form as follows:

X = α + βf + ε

where
X = (X1 , . . . , XN )0 = the N-vector of variables
α = (α1 , . . . , αN )0 = the N-vector of means
ε = (ε1 , . . . , εN )0 = the N-vector of idiosyncratic noise terms
f = (f1 , . . . , fK )0 = the K -vector of factors
 
β11 · · · β1K
β =  ... .. ..  = the N × K matrix of factor loadings.

. . 
βN1 · · · βNK

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models
I Lets define the following:
Σ = the N × N variance-covariance matrix of the variables X
Ω = the K × K variance-covariance matrix of the factors
Ψ = N × N variance-covariance matrix of the error terms ε.
I If we assume that our model is a strict factor model, the
matrix Ψ will be a diagonal matrix with the noise variances on
the diagonal, that is,
 2 
Ψ1 · · · 0
Ψ =  ... .. .. 

. . 
0 ··· Ψ2N
I We can express the variance-covariance matrix of the variables
in the following way:

Σ = βΩβ 0 + Ψ
Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical
Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models

I In applied work, factor models will often be approximate

factor models. They allow idiosyncratic terms to be weakly
correlated among themselves and with the factors.
I As many different factor models have been proposed for
explaining stock returns, an important question is whether a
factor model is fully determined by the observed time series.
I An estimation procedure cannot univocally determine the
hidden factors and the factor loadings from the observable
variables X.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models

In fact, suppose that we multiply the factors by any nonsingular

matrix R. We obtain other factors

g = Rf

with a covariance matrix

Ωg = RΩR−1

and we can write a new factor model:

X = α + βf + ε = α + βR−1 Rf + ε = α + βg g + ε

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models

I In order to solve this indeterminacy, we can always choose the

matrix R so that the factors g are a set of orthonormal
variables, that is, uncorrelated variables (the orthogonality
condition) with unit variance (the normality condition).
I In order to make the model uniquely identifiable, we can
stipulate that factors must be a set of orthonormal variables
and that, in addition, the matrix of factor loadings is diagonal.
I Under this additional assumption, a strict factor model is
called a normal factor model. The model is still undetermined
under rotation, that is multiplication by any nonsingular
matrix such that RR0 = I .

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models

Summary:
I A set of variables has a normal factor representation if it is
represented by the following factor model:

X = α + βf + ε

where factors are orthonormal variables and noise terms are

such that the covariance matrix can be represented as follows:

Σ = ββ 0 + Ψ

where β is the diagonal matrix of factor loadings and Ψ is a

diagonal matrix.
I Approximate factor models are uniquely identifiable only in
the limit of an infinite number of series.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models: Types of Factors and Their Estimation

In financial econometrics, the factors used in factor models can

belong to three different categories:
I Macroeconomic factors
I Fundamental factors
I Statistical factors
Macroeconomic factors are macroeconomic variables that are
believed to determine asset returns (Example: GNP, the inflation
rate, the unemployment rate, or the steepness of the yield curve).
Fundamental factors are variables that derive from financial
analysis.
Statistical factors are factors that derive from a mathematical
process.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models: Types of Factors and Their Estimation

I Macroeconomic factors are exogenous factors that must be

estimated as variables exogenous to the factor model. They
influence the model variables but are not influenced by them.
I A factor model is estimated as a linear regression model,
means that there is indeed a linear relationship between the
factors and the model variables.
I However, such a model will have no explanatory power. The
variance of each variable that is not explained by common
factors appears as noise.
I Adding factors might improve the explanatory power of the
model but, in general, worsens the ability to estimate the
model because there are more parameters to estimate.
There is a trade-off between adding explanatory factors and
the ability to estimate them.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Factor Models: Types of Factors and Their Estimation

I Statistical factors are obtained through a logical process of

analysis of the given variables.
I Statistical factors are factors that are endogenous to the
system. They are typically determined with one of two
statistical processes; namely, principal component analysis or
factor analysis.
I Note that factors defined through statistical analysis are linear
combinations of the variables.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Principal Components Analysis

I Principal components analysis (PCA) was introduced in 1933

by Harold Hotelling as a way to determine factors with
statistical learning techniques when factors are not
exogenously given.
I Given a variance-covariance matrix, one can determine factors
using the technique of PCA.
I The concept of PCA is the following.
Consider a set of n stationary time series Xi .
Consider next a linear combination of these series, that is, a
portfolio of securities. Each portfolio P is identified by an
n-vector of weights ωP and is characterized by a variance σP2 .
Lastly, consider a normalized portfolio, which has the largest
possible variance. In this context, a normalized portfolio is a
portfolio such that the squares of the weights sum to one.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Principal Components Analysis
I If we assume that returns are IID sequences, jointly normally
distributed with variance-covariance matrix σ, a lengthy direct
calculation demonstrates that each portfolios return will be
normally distributed with variance

σP2 = ωPT σωP

I The normalized portfolio of maximum variance can therefore

be determined in the following way: Maximize ωpT σωP
subject to the normalization condition

ωPT ωP = 1

where the product is a scalar product.

I It can be demonstrated that the solution of this problem is the
eigenvector ω2 corresponding to the largest eigenvalue λ2 of
the variance-covariance matrix σ.
Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical
Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Principal Components Analysis

I Consider next the set of all normalized portfolios orthogonal

to ω1 , that is, portfolios completely uncorrelated with ω1 .
These portfolios are identified by the following relationship:

ω1T ωP = ωPT ω1 = 0

Among this set, the portfolio of maximum variance is given by

the eigenvector ω2 corresponding to the second largest
eigenvalue λ2 of the variance-covariance matrix σ.
I If there are n distinct eigenvalues, we can repeat this process
n times. In this way, we determine the n portfolios Pi of
maximum variance. The weights of these portfolios are the
orthonormal eigenvectors of the variance- covariance matrix σ.
I These portfolios of maximum variance are all mutually
uncorrelated.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Principal Components Analysis

I It can be demonstrated that we can recover all the original

return time series as linear combinations of these portfolios:
n
X
Xj = αj,i Pi
i=1

I Thus far we have succeeded in replacing the original n

correlated time series Xj with n uncorrelated time series Pi
with the additional insight that each Xj is a linear
combination of the Pi .
I Suppose now that only p of the portfolios Pi have a significant
variance, while the remaining n-p have very small variances.
We can then implement a dimensionality reduction by
choosing only those portfolios whose variance is significantly
different from zero. Lets call these portfolios factors F.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Principal Components Analysis

I It is clear that we can approximately represent each series Xi

as a linear combination of the factors plus a small
uncorrelated noise. In fact we can write
p
X n
X p
X
Xj = αj,i Fi + αj,i Pi = αi,j Fi + εj
i=1 i=p+1 i=1

where the last term is a noise term.

I Therefore to implement PCA one computes the eigenvalues
and the eigenvectors of the variance-covariance matrix and
chooses the eigenvalues significantly different from zero.
I The corresponding eigenvectors are the weights of portfolios
that form the factors. Criteria of choice are somewhat
arbitrary.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Principal Components Analysis

I Suppose, however, that there is a strict factor structure, which

means that returns follow a strict factor model as defined
earlier in this chapter:

r = a + βf + ε

I The matrix β can be obtained diagonalizing the

variance-covariance matrix.
I In general, the structure of factors will not be strict and one
will try to find an approximation by choosing only the largest
eigenvalues.

Prof. Dr. Svetlozar Rachev Institute for Statistics and Mathematical

Lecture
Economics
13 Principal
University
Components
of Karlsruhe
Analysis and Factor Analysis
Principal Components Analysis

I PCA works either on the variance-covariance matrix or on the

correlation matrix. The technique is the same but results are
generally different.
I PCA applied to the variance-covariance matrix is sensitive to
the units of measurement, which determine variances and
covariances. If PCA is applied to prices and not to returns,
the currency in which prices are expressed matters; one
obtains different results in different currencies. In these cases,
it might be preferable to work with the correlation matrix.
I PCA is a generalized dimensionality reduction technique
applicable to any set of multidimensional observations. It
admits a simple geometrical interpretation which can be easily
visualized in the three-dimensional case.