0% found this document useful (0 votes)

34 views32 pages

Dimensionality Reduction Using PCA: Unsupervised Machine Learning

Pdf

Uploaded by

rajasweetyji369

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views32 pages

Dimensionality Reduction Using PCA: Unsupervised Machine Learning

Pdf

Uploaded by

rajasweetyji369

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Dimensionality Reduction using PCA:

Unsupervised Machine Learning

Dr. Arvind Selwal

Department of Computer Science & IT
Central University of Jammu
J&K, India-181143
Email: [email protected], [email protected]
Dimensionality Reduction
• Reduce the data down to its basic components, chipping away any
unnecessary part.
• Assume, the minions data would’ve been represented in 3-D.
Dimensionality Reduction
• Clearly, EV3 is unnecessary. Reduce it and represent the data in
terms of EV1 and EV2.
• Re-arrange the axes along the Eigenvectors, rather than the
original 3-D axes.
Intuition behind using PCA

• Let’s take an example of counting the minions that are scattered in

a 2-D space. Suppose we want to project them onto a 1-D line and
count.

Courtesy: https://www.youtube.com/channel/UCFJPdVHPZOYhSyxmX_C_Pew
Intuition behind using PCA

• How to choose the 1-D line?

– Vertical: the minions will collide onto each other while projecting => ✖
– At an angle: still, the possibility of collision is there.
– Horizontal: least possibility of collision. Max. variation. => ✔
Principal Component Analysis (PCA)

• Purpose of PCA
– To compress a lot of data into small data such that the compressed data
captures the essence the original data.
– Dimensionality Reduction.
– X-D into 3-D or 2-D.

• How is Dimensionality Reduction useful?

– Data processing in higher dimensions involves high time & space complexity and
computing cost.
– There is a risk of over-fitting.

• Not all the features in the dataset are relevant to the problem.
Some features are more relevant than others. The processing may
be done for the more relevant features only, without significant
loss of the information.
Principal Component Analysis (PCA)

• In the minion’s example:

– We reduced the dimensionality from 2 to 1.
– The horizontal line would be the Principal Component.

• How to determine the Principal Component, mathematically?

– Using the concepts: Covariance matrix, Eigen-vectors, etc.
– Discussed in the further slides.
10
11
12
Eigenvector and Eigenvalues
• Eigenvector is a direction. E.g., in the minion's example, the
eigenvectors were the directions of the lines - vertical,
horizontal or at an angle.
• Eigenvalue is a number telling how much variance is there
in the data in that direction. E.g., in the minion's example, the
eigenvalue is the number telling how spread out the minions
are on the line.
• Principal Component = Eigenvector with higher
eigenvalue.
• Every Eigenvector has a corresponding Eigenvalue.
• Eigenvectors & Eigenvalues that exist = No. of dimensions
(experimental observation).
How to find Eigenvector and Eigenvalues
• Let A be an nn matrix.
– x is an eigenvector of A if:
Ax = x
–  is called the eigenvalue associated with x

• How to find the Eigenvalue  ?

– Equate the determinant |A- I| to 0. Here, I is the Identity Matrix.
|A- I| = 0 (Characteristic Equation)
– Eigenvalues are the roots of the Characteristic Equation.

• How to find the Eigenvectors?

– Use the values of  in the equation (A – I)x = 0.
15
16
17
Example on Covariance Matrix

Covariance Matrix

 cov(H , H ) cov(H , M ) 
 
 cov(M , H ) cov(M , M ) 

 var( H ) 104 .5 
  
 104 .5 var( M ) 

 47.7 104.5 
  
104 .5 370 
19
Variance and Covariance

Variance
‘How spread out a given dataset is.’

Courtesy: https://www.youtube.com/watch?v=g-Hb26agBFg
Variance and Covariance

Covariance Covariance Matrix

‘Total variation of two variables
from their expected values.’

C nn  (ci , j )
where :
ci , j  cov(Ai , A j )
A1 , ..., A n  given n attributes .

• Covariance:
– Positive ⇒ both the variables increase together.
– Negative ⇒ as one variable increases, the other decreases.
– Zero ⇒ both the variables are independent of each other.
Courtesy: Smith, Lindsay I. A tutorial on principal components analysis. 2002.
22
23
24
25
26
27
28
29
30
Dimensionality Reduction

Advantages:
• Reduces redundant features,
• Solves multi-collinearity issue,
• Helps compressing the data and reduce the space requirements,
• Quickens the time required to perform the same computation.

Applications:
• Stock Market Analysis,
• Image and Text processing,
• Speech Recognition,
• Recommendation Engine, etc.
Thanks!

P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
5 Dimentionality Reduction
No ratings yet
5 Dimentionality Reduction
27 pages
PCA for Data Simplification
No ratings yet
PCA for Data Simplification
70 pages
Presentation A I STD 2
No ratings yet
Presentation A I STD 2
63 pages
Dimensionality Reduction Using Principal Component Analysis
No ratings yet
Dimensionality Reduction Using Principal Component Analysis
32 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
33 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
PCA
100% (1)
PCA
33 pages
Lecture 9 - Data Prep - Reduction - PCA-M
No ratings yet
Lecture 9 - Data Prep - Reduction - PCA-M
44 pages
1501589578da Mod15 Q1 e Text
No ratings yet
1501589578da Mod15 Q1 e Text
9 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Unit 3
No ratings yet
Unit 3
102 pages
PCA - Feb 8
No ratings yet
PCA - Feb 8
28 pages
Unit 3
No ratings yet
Unit 3
28 pages
U4 - PCA - 5th Sem - DS
No ratings yet
U4 - PCA - 5th Sem - DS
14 pages
Dimensonality Reduction
No ratings yet
Dimensonality Reduction
25 pages
5-Dimension Reduction
No ratings yet
5-Dimension Reduction
48 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
Module3 Notes
No ratings yet
Module3 Notes
13 pages
Projecting Data To A Lower Dimension With PCA
No ratings yet
Projecting Data To A Lower Dimension With PCA
6 pages
Module 2-PCA-1
No ratings yet
Module 2-PCA-1
26 pages
Unit Iii Dimentionality Reduction
No ratings yet
Unit Iii Dimentionality Reduction
12 pages
Principal Component Analysis Guide
No ratings yet
Principal Component Analysis Guide
23 pages
Principle Component Analysis
No ratings yet
Principle Component Analysis
7 pages
PCA Complete
No ratings yet
PCA Complete
8 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
22AIP3101A Session 7
No ratings yet
22AIP3101A Session 7
28 pages
Deep Learning Notes III To IV
No ratings yet
Deep Learning Notes III To IV
22 pages
Principal Component Analysis (PCA) : RV College of Engineering
No ratings yet
Principal Component Analysis (PCA) : RV College of Engineering
80 pages
Principal Computer Analysis (PCA)
No ratings yet
Principal Computer Analysis (PCA)
25 pages
What Is Principal Component Analysis (PCA) ?
No ratings yet
What Is Principal Component Analysis (PCA) ?
13 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
35 pages
Dim Reduction & Pattern Recognition
No ratings yet
Dim Reduction & Pattern Recognition
63 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
Dimensionality Reduction Techniques
No ratings yet
Dimensionality Reduction Techniques
7 pages
W4.2 DataPreProcessing-PCA
No ratings yet
W4.2 DataPreProcessing-PCA
22 pages
Unsupervised Learning: PCA & Clustering
No ratings yet
Unsupervised Learning: PCA & Clustering
96 pages
Module 3
No ratings yet
Module 3
41 pages
Principal Components Analysis (PCA) : R. Jothi
No ratings yet
Principal Components Analysis (PCA) : R. Jothi
47 pages
Data Analysis: Dr. C Santhosh Kumar
No ratings yet
Data Analysis: Dr. C Santhosh Kumar
22 pages
Feature Extraction in Machine Learning
No ratings yet
Feature Extraction in Machine Learning
17 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
Unsupervised ML 2 - Dr. Niyati - NIT KKR
No ratings yet
Unsupervised ML 2 - Dr. Niyati - NIT KKR
54 pages
Dimensionality Reduction Technique
No ratings yet
Dimensionality Reduction Technique
17 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
48 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
15 pages
ML RUSA Module 5 Dim Red
No ratings yet
ML RUSA Module 5 Dim Red
85 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
38 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
Module3 OTML
No ratings yet
Module3 OTML
67 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
104 pages
PCA: Data Reduction Techniques
No ratings yet
PCA: Data Reduction Techniques
32 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
62 pages
ML Mod32019
No ratings yet
ML Mod32019
6 pages
Pca Ica
No ratings yet
Pca Ica
34 pages
Matrices Template
No ratings yet
Matrices Template
6 pages
Creating Vectors and Matrices in MATLAB
No ratings yet
Creating Vectors and Matrices in MATLAB
9 pages
Linear Systems Overview
No ratings yet
Linear Systems Overview
69 pages
Nilpotent Matrix Properties in Linear Algebra
No ratings yet
Nilpotent Matrix Properties in Linear Algebra
2 pages
2.1 Determinants by Cofactor Expansion
No ratings yet
2.1 Determinants by Cofactor Expansion
21 pages
5 - Intro To Matrices (Lecture)
No ratings yet
5 - Intro To Matrices (Lecture)
7 pages
LU Decomposition in MATLAB for Trusses
No ratings yet
LU Decomposition in MATLAB for Trusses
5 pages
Matrices
No ratings yet
Matrices
26 pages
BC0043 Computer Oriented Numerical Methods Paper 2
No ratings yet
BC0043 Computer Oriented Numerical Methods Paper 2
14 pages
Matrix MCQs and Case Studies
100% (1)
Matrix MCQs and Case Studies
12 pages
Putnam Linear Algebra Tips
No ratings yet
Putnam Linear Algebra Tips
6 pages
Matrix Algebra - MTH-174 - Beyond Syllabus-2
No ratings yet
Matrix Algebra - MTH-174 - Beyond Syllabus-2
6 pages
Iit - Jam Mathematics - 2022 Linear Algebra Matrices Assignment - 5
No ratings yet
Iit - Jam Mathematics - 2022 Linear Algebra Matrices Assignment - 5
2 pages
MAT 423 Lesson 2 Notes
No ratings yet
MAT 423 Lesson 2 Notes
34 pages
ANM Assignment: MATLAB Methods
No ratings yet
ANM Assignment: MATLAB Methods
12 pages
Machine Learning Problem Set Solutions
No ratings yet
Machine Learning Problem Set Solutions
5 pages
MATH 425 Spring 2013 Homework Assignments
No ratings yet
MATH 425 Spring 2013 Homework Assignments
8 pages
PCA in Stata: Syntax and Options
No ratings yet
PCA in Stata: Syntax and Options
17 pages
Vector
No ratings yet
Vector
8 pages
11th Maths EM Book Back 1 Mark Question Paper 1 English Medium PDF Download
No ratings yet
11th Maths EM Book Back 1 Mark Question Paper 1 English Medium PDF Download
4 pages
Linear Algebra Solutions for Students
No ratings yet
Linear Algebra Solutions for Students
6 pages
IIT JAM 2025 Linear Algebra DPP Solutions
No ratings yet
IIT JAM 2025 Linear Algebra DPP Solutions
6 pages
One Mark Questions on Relations and Functions
No ratings yet
One Mark Questions on Relations and Functions
40 pages
Linear Algebra Lecture Notes Overview
No ratings yet
Linear Algebra Lecture Notes Overview
72 pages
Maths Paperset
No ratings yet
Maths Paperset
14 pages
Numerical Solutions For Ce Problems: Engr. Ryan Kim Lagadia
No ratings yet
Numerical Solutions For Ce Problems: Engr. Ryan Kim Lagadia
12 pages
Abhi 123
No ratings yet
Abhi 123
33 pages
9 Matrix Coding
No ratings yet
9 Matrix Coding
6 pages
Advanced Matrix Multiplication Algorithms
No ratings yet
Advanced Matrix Multiplication Algorithms
32 pages
Unit-1 (Matrices) - AST - 1.3
No ratings yet
Unit-1 (Matrices) - AST - 1.3
1 page

Dimensionality Reduction Using PCA: Unsupervised Machine Learning

Uploaded by

Dimensionality Reduction Using PCA: Unsupervised Machine Learning

Uploaded by

Dimensionality Reduction using PCA:

Unsupervised Machine Learning

Dr. Arvind Selwal

• Let’s take an example of counting the minions that are scattered in

• How to choose the 1-D line?

• How is Dimensionality Reduction useful?

• In the minion’s example:

• How to determine the Principal Component, mathematically?

• How to find the Eigenvalue  ?

• How to find the Eigenvectors?

Covariance Covariance Matrix

You might also like