0% found this document useful (0 votes)

86 views37 pages

Data Dimensionality Reduction

SVD and PCA are dimensionality reduction techniques that can be applied to datasets with many attributes. SVD decomposes a matrix into the product of three matrices, revealing underlying linear structures in the data. PCA projects data onto a new set of orthogonal attributes (principal components) that successively capture the most variance. Both techniques aim to represent the data with fewer dimensions while preserving as much information as possible.

Uploaded by

Akesh Varma Kothapalli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

86 views37 pages

Data Dimensionality Reduction

Uploaded by

Akesh Varma Kothapalli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

SVD and PCA

Real data usually have thousands, or millions of

dimensions
E.g., web documents, where the dimensionality is the
vocabulary of words
Facebook graph, where the dimensionality is the
number of users
Huge number of dimensions causes problems
Data becomes very sparse, some algorithms become
meaningless (e.g. density based clustering)
The complexity of several algorithms depends on the
dimensionality and they become infeasible.
Usually the data can be described with fewer
dimensions, without losing much of the meaning
of the data.
The data reside in a space of lower dimensionality

Essentially, we assume that some of the data is

noise, and we can approximate the useful part
with a lower dimensionality space.
Dimensionality reduction does not just reduce the
amount of data, it often brings out the useful part of
the data
We have already seen a form of
dimensionality reduction

LSH, and random projections reduce the

dimension while preserving the distances
SVD is “the Rolls-Royce and the Swiss Army
Knife of Numerical Linear Algebra.”*
*Dianne O’Leary, MMDS ’06
We are given n objects and d attributes describing the
objects. Each object has d numeric values describing
it.
We will represent the data as a n×d real matrix A.
We can now use tools from linear algebra to process the
data matrix

Our goal is to produce a new n×k matrix B such that

It preserves as much of the information in the original
matrix A as possible
It reveals something about the structure of the data in A
d terms
(e.g., theorem, proof, etc.)

n documents

Aij = frequency of the j-th

term in the i-th document

Find subsets of terms that bring documents

together
d movies

n customers
Aij = rating of j-th
product by the i-th
customer

Find subsets of movies that capture the behavior or

the customers
We assume that vectors are column vectors.
We use for the transpose of vector (row vector)
Dot product: (1×, ×1 → 1×1)
The dot product is the projection of vector on (and vice versa)
4
1, 2, 3 1 12

2
cos,

If |||| 1 (unit vector) then is the projection length of on

4
1, 2, 3 1 0 orthogonal vectors
2
Orthonormal vectors: two unit vectors that are orthogonal
An n×m matrix A is a collection of n row vectors and m column
vectors
| | |

| | |
Matrix-vector multiplication
Right multiplication : projection of u onto the row vectors of , or
projection of row vectors of onto .
Left-multiplication : projection of onto the column vectors of ,
or projection of column vectors of onto
Example:
1 0
1,2,3 0 1 1,2
0 0
Row space of A: The set of vectors that can be written
as a linear combination of the rows of A
All vectors of the form

Column space of A: The set of vectors that can be

written as a linear combination of the columns of A
All vectors of the form .

Rank of A: the number of linearly independent row (or

column) vectors
These vectors define a basis for the row (or column) space
of A
In a rank-1 matrix, all columns (or rows) are
multiples of the same column (or row) vector
1 2 1
2 4 2
3 6 3
All rows are multiples of 1,2, 1
All columns are multiples of 1,2,3
External product: (×1 , 1×! → ×!)
The resulting ×! has rank 1: all rows (or columns)
are linearly dependent

(Right) Eigenvector of matrix A: a vector v
such that "
": eigenvalue of eigenvector

A square matrix A of rank r, has r orthonormal

eigenvectors , , … , $ with eigenvalues
" , " , … , "$ .
Eigenvectors define an orthonormal basis for
the column space of A
)
0
)
% Σ ' , , ⋯ , $
⋱ ⋮
0
)$ $
[n×m] = [n×r] [r×r] [r×m]
r: rank of matrix A

) , , ) , ⋯ , )$ : singular values of matrix (also, the square roots of

eigenvalues of and )
, , … , $ : left singular vectors of (also eigenvectors of )
, , … , $ : right singular vectors of (also, eigenvectors of )

) - ) - ⋯ - )$ $ $

Special case: A is symmetric positive definite
matrix

" - " - ⋯ - "$ $ $

" , " , ⋯ , "$ , 0: Eigenvalues of A

, , … , $ : Eigenvectors of A
The left singular vectors are an orthonormal
basis for the row space of A.
The right singular vectors are an orthonormal
basis for the column space of A.

If A has rank r, then A can be written as the sum

of r rank-1 matrices

There are r “linear components” (trends) in A.

Linear trend: the tendency of the row vectors of A to align with vector
v
Strength of the i-th linear trend: ||./ || 0/
Document-term matrix
Blue and Red rows (colums) are linearly dependent

There are two prototype documents (vectors of words): blue and red
To describe the data is enough to describe the two prototypes, and the
projection weights for each row

A is a rank-2 matrix
2
1 , 1
2
Document-term matrix

There are two prototype documents and words but

they are noisy
We now have more than two singular vectors, but the
strongest ones are still about the two types.
By keeping the two strongest singular vectors we obtain
most of the information in the data.
▪ This is a rank-2 approximation of the matrix A
nxd nxk kxk kxd

Uk (Vk): orthogonal matrix containing the top k left (right)

singular vectors of A.
Σk: diagonal matrix containing the top k singular values of A

Ak is an approximation of A
Ak is the best approximation of A
The rank-k approximation matrix 3
produced by the top-k singular vectors of A
minimizes the Frobenious norm of the
difference with the matrix A
3 arg max > ?
9:$;<3 9 =3

> ? @ AB >AB
A,B
We can project the row (and column) vectors
of the matrix A into a k-dimensional space
and preserve most of the information
(Ideally) The k dimensions reveal latent
features/aspects/topics of the term
(document) space.
(Ideally) The 3 approximation of matrix A,
contains all the useful information, and what
is discarded is noise
Rows (columns) are linear combinations of k
latent factors
E.g., in our extreme document example there are
two factors
Some noise is added to this rank-k matrix
resulting in higher rank

SVD retrieves the latent factors (hopefully).

A = U Σ VT
features

significant sig. significant

noise noise
= noise

objects
Data: Users rating movies
Sparse and often noisy
Assumption: There are k basic user profiles, and each user
is a linear combination of these profiles
E.g., action, comedy, drama, romance
Each user is a weighted cobination of these profiles
The “true” matrix has rank k
What we observe is a noisy, and incomplete version of this
matrix C
The rank-k approximation C3 is provably close to 3
Algorithm: compute C3 and predict for user and movie
!, the value C3 !, .
Model-based collaborative filtering
PCA is a special case of SVD on the centered
covariance matrix.
Goal: reduce the dimensionality while preserving the
“information in the data”
Information in the data: variability in the data
We measure variability using the covariance matrix.
Sample covariance of variables X and Y

@ DA EF GA EH
A
Given matrix A, remove the mean of each column
from the column vectors to get the centered matrix C
The matrix ' I I is the covariance matrix of the
row vectors of A.
We will project the rows of matrix A into a new
set of attributes (dimensions) such that:
The attributes have zero covariance to each other
(they are orthogonal)
Each attribute captures the most remaining variance
in the data, while orthogonal to the existing attributes
▪ The first attribute should capture the most variance in the
data

For matrix C, the variance of the rows of C when

projected to vector x is given by ) ID
The right singular vector of C maximizes ) !
Input: 2-d dimensional points

5 Output:

2nd (right)
singular 1st (right) singular vector:
vector direction of maximal variance,
4

2nd (right) singular vector:

direction of maximal variance,
3 after removing the projection of
the data along the first singular
1st (right) singular vector.
vector
2
4.0 4.5 5.0 5.5 6.0
5

2nd (right)
singular σ1: measures how much of the
4
vector data variance is explained by the
first singular vector.

σ2: measures how much of the

3 data variance is explained by the
σ1 second singular vector.
1st (right) singular
vector
2
4.0 4.5 5.0 5.5 6.0
The variance in the direction of the k-th principal component
is given by the corresponding singular value σk2

Singular values can be used to estimate how many

components to keep

Rule of thumb: keep enough to explain 85% of the variation:

∑j =1
σ 2
j

n
≈ 0 . 85
∑j =1
σ 2
j
drugs

⋯ <
⋮ ⋱ ⋮ students

< ⋯ <<
legal illegal
AB : usage of student i of drug j
%Σ'
First right singular vector
Drug 1

More or less same weight to all drugs
Discriminates heavy from light users
Second right singular vector
Positive values for legal drugs, negative for illegal
Drug 2
The chosen vectors are such that minimize the sum of square differences
between the data vectors and the low-dimensional projections

1st (right) singular

vector
2
4.0 4.5 5.0 5.5 6.0
Latent Semantic Indexing (LSI):
Apply PCA on the document-term matrix, and
index the k-dimensional vectors
When a query comes, project it onto the k-
dimensional space and compute cosine similarity
in this space
Principal components capture main topics, and
enrich the document representation
# SVD
dat = seq(1,240,2)
X = matrix(dat,ncol=12)
s = svd(X)
A = diag(s$d)
s$u %*% A %*% t(s$v) # X = U A V'
dat = seq(1,240,2)
X = matrix(dat,ncol=12)
s = svd(X, nu = nrow(X), nv = ncol(X))
A = diag(s$d)
A = cbind(A, 0) # Add two columns with zero, in order to A have the same dimensions of X.
A = cbind(A, 0)p
s$u %*% A %*% t(s$v) # X = U A V'
install.packages("jpeg")
library(jpeg)
tux = readJPEG("tux.jpg")
tux = imagematrix(tux,type='grey')
plot(tux)
reduce <- function(A,dim) {
#Calculates the SVDprincomp
sing <- svd(A)
#Approximate each result of SVD with the given dimension
u<-as.matrix(sing$u[, 1:dim])
v<-as.matrix(sing$v[, 1:dim])
d<-as.matrix(diag(sing$d)[1:dim, 1:dim])
#Create the new approximated matrix
return(imagematrix(u%*%d%*%t(v),type='grey'))
}
tux_d = svd(tux)
length(tux_d$d)
plot(reduce(tux,1))
# 90% reduction
plot(reduce(tux,35))
plot(pc$scores[,2], pc$scores[,1])
# PCA
pc = princomp(iris2)
summary(pc)
pc$scores
pc$loadings

Data Mining: Dimensionality Reduction Pca - SVD
No ratings yet
Data Mining: Dimensionality Reduction Pca - SVD
33 pages
SFML DATE 19 Lecture3 Svdpca Notes
No ratings yet
SFML DATE 19 Lecture3 Svdpca Notes
6 pages
Linear Algebra: SVD and Eigen Concepts
No ratings yet
Linear Algebra: SVD and Eigen Concepts
94 pages
Linear Algebra: SVD and Eigen Concepts
No ratings yet
Linear Algebra: SVD and Eigen Concepts
94 pages
5-Dimension Reduction
No ratings yet
5-Dimension Reduction
48 pages
Chapter 6
No ratings yet
Chapter 6
55 pages
Data Characterization
No ratings yet
Data Characterization
31 pages
Isye 6416: Computational Statistics Spring 2023: Prof. Yao Xie
No ratings yet
Isye 6416: Computational Statistics Spring 2023: Prof. Yao Xie
44 pages
Math Prelims
No ratings yet
Math Prelims
40 pages
Matrix Magnitude and Properties Explained
No ratings yet
Matrix Magnitude and Properties Explained
20 pages
Feature Extraction in Data Analysis
No ratings yet
Feature Extraction in Data Analysis
90 pages
04 EigenValuesSolSAE
No ratings yet
04 EigenValuesSolSAE
16 pages
Lecture 3 Introduction To Linear Algebra (Part 2)
No ratings yet
Lecture 3 Introduction To Linear Algebra (Part 2)
57 pages
Mobrob Linear Algebra
No ratings yet
Mobrob Linear Algebra
42 pages
Slide11 Dimred BK v3 0104
No ratings yet
Slide11 Dimred BK v3 0104
91 pages
APS1070 Lecture (6) Slides Annotated
No ratings yet
APS1070 Lecture (6) Slides Annotated
77 pages
Dimensionality Reduction Techniques in ML
No ratings yet
Dimensionality Reduction Techniques in ML
85 pages
CS168: The Modern Algorithmic Toolbox Lecture #9: The Singular Value Decomposition (SVD) and Low-Rank Matrix Approximations
No ratings yet
CS168: The Modern Algorithmic Toolbox Lecture #9: The Singular Value Decomposition (SVD) and Low-Rank Matrix Approximations
10 pages
Dimensionality Reduction DR
No ratings yet
Dimensionality Reduction DR
31 pages
Lecture10 CF Dimensionality Reduction V0
No ratings yet
Lecture10 CF Dimensionality Reduction V0
30 pages
01 Linear Algebra To Robots
No ratings yet
01 Linear Algebra To Robots
40 pages
SVD and Low-Rank Approximations
No ratings yet
SVD and Low-Rank Approximations
11 pages
Linear Algebra Essentials
No ratings yet
Linear Algebra Essentials
13 pages
Multivariate Analysis Notes
No ratings yet
Multivariate Analysis Notes
54 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
60 pages
TA WEEK 3 Copy
No ratings yet
TA WEEK 3 Copy
27 pages
PCA: Eigenvalues and Eigenvectors Explained
No ratings yet
PCA: Eigenvalues and Eigenvectors Explained
73 pages
15PCA
No ratings yet
15PCA
27 pages
I2ml3e Chap6
No ratings yet
I2ml3e Chap6
37 pages
PCA for Image Processing Students
No ratings yet
PCA for Image Processing Students
26 pages
CHP 4
No ratings yet
CHP 4
72 pages
07 DimensionalityReduction
No ratings yet
07 DimensionalityReduction
49 pages
Application of Eigen Value & Eigen Vectror
No ratings yet
Application of Eigen Value & Eigen Vectror
14 pages
Numerical Linear Algebra in Data Mining: Lars Eld en
No ratings yet
Numerical Linear Algebra in Data Mining: Lars Eld en
58 pages
Pcais : Ok Let'S Dispel The Magic Behind This Black Box
No ratings yet
Pcais : Ok Let'S Dispel The Magic Behind This Black Box
72 pages
Notes On Matrices
No ratings yet
Notes On Matrices
8 pages
Lin Agebra Rev
No ratings yet
Lin Agebra Rev
18 pages
Singular Value Decomposition Guide
No ratings yet
Singular Value Decomposition Guide
9 pages
PCA Overview and Key Comparisons
100% (2)
PCA Overview and Key Comparisons
17 pages
VECTORS
No ratings yet
VECTORS
6 pages
L02 Notes
No ratings yet
L02 Notes
6 pages
Ruiz Modified I2ml3e Chap6
No ratings yet
Ruiz Modified I2ml3e Chap6
38 pages
Lecture 15
No ratings yet
Lecture 15
43 pages
6 Dimension Reduction Theory
No ratings yet
6 Dimension Reduction Theory
18 pages
Eigenvalues and Eigenvectors Tutorial
No ratings yet
Eigenvalues and Eigenvectors Tutorial
11 pages
Matrix Algebra for Random Vectors
No ratings yet
Matrix Algebra for Random Vectors
37 pages
SVD and Underconstrained Least Squares
No ratings yet
SVD and Underconstrained Least Squares
24 pages
3 Pca
No ratings yet
3 Pca
83 pages
Dama50 2024 2025 Unit3n
No ratings yet
Dama50 2024 2025 Unit3n
56 pages
Singular Value Decomposition Guide
No ratings yet
Singular Value Decomposition Guide
24 pages
MIT9 40S18 Lec17
No ratings yet
MIT9 40S18 Lec17
55 pages
Mathbootcamp Ampba Aug 2021
No ratings yet
Mathbootcamp Ampba Aug 2021
193 pages
Nonlinear Optimization Course Overview
No ratings yet
Nonlinear Optimization Course Overview
11 pages
Visualization 9 Dim Reduction
No ratings yet
Visualization 9 Dim Reduction
73 pages
Linear Algebra: Submitted by Ahmad Saeed Submitted To Sir Muzzam Ali BITM-F18-022
No ratings yet
Linear Algebra: Submitted by Ahmad Saeed Submitted To Sir Muzzam Ali BITM-F18-022
5 pages
Matrix Theory: Transformations & Eigenvalues
No ratings yet
Matrix Theory: Transformations & Eigenvalues
10 pages
Data Structure and Algorithms With Python
100% (16)
Data Structure and Algorithms With Python
369 pages
The Python Bible
97% (33)
The Python Bible
506 pages
Finite Volume Method
No ratings yet
Finite Volume Method
48 pages
Linear Algebra Optimization Machine Learning PDF
100% (12)
Linear Algebra Optimization Machine Learning PDF
507 pages
Bayesian Statistical Methods
100% (11)
Bayesian Statistical Methods
288 pages
Machine Learning With Python
100% (15)
Machine Learning With Python
692 pages
Machine Learning in Finance: Matthew F. Dixon Igor Halperin Paul Bilokon
83% (12)
Machine Learning in Finance: Matthew F. Dixon Igor Halperin Paul Bilokon
565 pages
Python Finance - Harnessing The - Bisette, Vincent
75% (4)
Python Finance - Harnessing The - Bisette, Vincent
498 pages
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
100% (19)
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
208 pages
Machine Learning Projects in Python
100% (17)
Machine Learning Projects in Python
135 pages
Mathematical Statistics With Applications PDF
100% (18)
Mathematical Statistics With Applications PDF
644 pages
Numpy For Quantitative Finance
100% (3)
Numpy For Quantitative Finance
471 pages
(Cambridge Series in Statistical and Probabilistic Mathematics) Gerhard Tutz, Ludwig-Maximilians-Universität Munchen - Regression For Categorical Data-Cambridge University Press (2012)
100% (4)
(Cambridge Series in Statistical and Probabilistic Mathematics) Gerhard Tutz, Ludwig-Maximilians-Universität Munchen - Regression For Categorical Data-Cambridge University Press (2012)
574 pages
Coffee Break NumPy PDF
100% (7)
Coffee Break NumPy PDF
211 pages
Machine Learning - An Applied Mathematics Introduction PDF
100% (14)
Machine Learning - An Applied Mathematics Introduction PDF
246 pages
Cornelis W. Oosterlee, Lech A. Grzelak - Mathematical Modeling and Computation in Finance - With Exercises and Python and MATLAB Computer Codes-World Scientific Europe (2019)
100% (5)
Cornelis W. Oosterlee, Lech A. Grzelak - Mathematical Modeling and Computation in Finance - With Exercises and Python and MATLAB Computer Codes-World Scientific Europe (2019)
1,310 pages
Machine Learning in Finance
100% (6)
Machine Learning in Finance
300 pages
EBOOK - Python Crash Course For Data Analysis
100% (12)
EBOOK - Python Crash Course For Data Analysis
168 pages
Doing Math With Python (En)
100% (10)
Doing Math With Python (En)
265 pages
Ordinary Differential Equations 9781498733816 Compress
91% (11)
Ordinary Differential Equations 9781498733816 Compress
864 pages
Essential Calculus Skills Practice Workbook With Full Solutions
95% (87)
Essential Calculus Skills Practice Workbook With Full Solutions
528 pages
Hackers Guide To Machine Learning With Python PDF
100% (16)
Hackers Guide To Machine Learning With Python PDF
272 pages
AI Concepts Using Python
100% (10)
AI Concepts Using Python
428 pages
Python For Science and Engineering
100% (15)
Python For Science and Engineering
304 pages
Object Oriented Python Tutorial
100% (21)
Object Oriented Python Tutorial
111 pages
Python in Excel (2024)
100% (14)
Python in Excel (2024)
607 pages
(Hunt, J.) A Beginners Guide To Python 3 Programming
96% (47)
(Hunt, J.) A Beginners Guide To Python 3 Programming
440 pages
Python Programming for Beginners_ From Basics to AI Integrations. 5-Minute Illustrated Tutorials, Coding Hacks, Hands-On Exercises & Case Studies to Master Python in 7 Days and Get Paid More by Prince
100% (15)
Python Programming for Beginners_ From Basics to AI Integrations. 5-Minute Illustrated Tutorials, Coding Hacks, Hands-On Exercises & Case Studies to Master Python in 7 Days and Get Paid More by Prince
244 pages
Python Notes For Professionals
100% (18)
Python Notes For Professionals
814 pages
Advance Statistical Methods in Data Science Chen
100% (8)
Advance Statistical Methods in Data Science Chen
229 pages
Understanding Relations and Sets
No ratings yet
Understanding Relations and Sets
5 pages
12th Batch Astar A0 A1 A3 A5 Solution
No ratings yet
12th Batch Astar A0 A1 A3 A5 Solution
23 pages
Grade 11 Math Midterm Exam
No ratings yet
Grade 11 Math Midterm Exam
3 pages
Polynomial Problems and Solutions
No ratings yet
Polynomial Problems and Solutions
13 pages
January 2021 QP
No ratings yet
January 2021 QP
32 pages
Calculus Quiz by Krishav Malhotra
No ratings yet
Calculus Quiz by Krishav Malhotra
8 pages
National Grammar School: Cambridge Ordinary Level
No ratings yet
National Grammar School: Cambridge Ordinary Level
6 pages
Two-Phase Method Assignment Solutions
No ratings yet
Two-Phase Method Assignment Solutions
15 pages
Two Way
No ratings yet
Two Way
26 pages
Python
No ratings yet
Python
47 pages
Fundamentals of Kalman Filtering
No ratings yet
Fundamentals of Kalman Filtering
83 pages
Solutions Goodman
No ratings yet
Solutions Goodman
18 pages
On The Almost Threshold Policy For Multisourcing Under Uncertain Supplies
No ratings yet
On The Almost Threshold Policy For Multisourcing Under Uncertain Supplies
18 pages
8th Math Curriculum Map
100% (1)
8th Math Curriculum Map
7 pages
(Ebook) Mathematical Inequalities, Volume 1: Symmetric Polynomial Inequalities by Vasile Cîrtoaje ISBN 9786139882366, 6139882362 PDF Version
No ratings yet
(Ebook) Mathematical Inequalities, Volume 1: Symmetric Polynomial Inequalities by Vasile Cîrtoaje ISBN 9786139882366, 6139882362 PDF Version
150 pages
Ma30227 Optimization-Techniques 2024 Autumn Midsem
No ratings yet
Ma30227 Optimization-Techniques 2024 Autumn Midsem
3 pages
MAT291 Special Test 1 Solution
No ratings yet
MAT291 Special Test 1 Solution
7 pages
BCA - 1st - SEMESTER - MATH Notes
91% (11)
BCA - 1st - SEMESTER - MATH Notes
25 pages
Coordinate Plane Basics for Students
No ratings yet
Coordinate Plane Basics for Students
2 pages
Mathematician
No ratings yet
Mathematician
37 pages
7 LP Simplex Maximization
No ratings yet
7 LP Simplex Maximization
18 pages
Vector Analysis: Multivector Review and Training Center
No ratings yet
Vector Analysis: Multivector Review and Training Center
10 pages
IPMAT Linear Equations Formulas
No ratings yet
IPMAT Linear Equations Formulas
6 pages
Maxima & Minima Solutions Q No 1 To 17
No ratings yet
Maxima & Minima Solutions Q No 1 To 17
25 pages
HP28S Calculus Solution Manual
100% (1)
HP28S Calculus Solution Manual
93 pages
ODE in Maple PDF
No ratings yet
ODE in Maple PDF
6 pages
Jee Mains 2025 Planner
No ratings yet
Jee Mains 2025 Planner
2 pages
Laplace Transform of Unit Step Function
No ratings yet
Laplace Transform of Unit Step Function
14 pages
Functions of Random Variables Overview
No ratings yet
Functions of Random Variables Overview
34 pages
Real Numbers
No ratings yet
Real Numbers
25 pages

Data Dimensionality Reduction

Uploaded by

Data Dimensionality Reduction

Uploaded by

SVD and PCA

Real data usually have thousands, or millions of

Essentially, we assume that some of the data is

LSH, and random projections reduce the

Our goal is to produce a new n×k matrix B such that

Aij = frequency of the j-th

Find subsets of terms that bring documents

Find subsets of movies that capture the behavior or

If |||| 1 (unit vector) then is the projection length of on

Column space of A: The set of vectors that can be

Rank of A: the number of linearly independent row (or

A square matrix A of rank r, has r orthonormal

) , , ) , ⋯ , )$ : singular values of matrix  (also, the square roots of

 )   - )   - ⋯ - )$ $ $

 "   - "   - ⋯ - "$ $ $

" , " , ⋯ , "$ , 0: Eigenvalues of A

If A has rank r, then A can be written as the sum

There are r “linear components” (trends) in A.

There are two prototype documents and words but

Uk (Vk): orthogonal matrix containing the top k left (right)

SVD retrieves the latent factors (hopefully).

significant sig. significant

For matrix C, the variance of the rows of C when

2nd (right) singular vector:

σ2: measures how much of the

Singular values can be used to estimate how many

Rule of thumb: keep enough to explain 85% of the variation:

1st (right) singular

You might also like

) , , ) , ⋯ , )$ : singular values of matrix (also, the square roots of

) - ) - ⋯ - )$ $ $

" - " - ⋯ - "$ $ $

" , " , ⋯ , "$ , 0: Eigenvalues of A