0% found this document useful (0 votes)

19 views15 pages

4.5 Principal Component Analysis

Uploaded by

Mrinal Borah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views15 pages

4.5 Principal Component Analysis

Uploaded by

Mrinal Borah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

Topic: Principal Component Analysis

Prof. Bhabesh Deka

Dept. of ECE
Tezpur University
2
Dimensionality Reduction Can think of W as a linear
mapping that transforms low-
dim to high-dim
 A broad class of techniques Some dim-red techniques assume a
nonlinear mapping function such
that
 Goal is to compress the original representation of the inputs For example, can be
modeled by a kernel or a
deep neural net
 Example: Approximate each input , as a linear combination of
“basis” vectors , each also
Note: These “basis” vectors need
not necessarily be linearly
𝐾 is
independent. But for some dim. 𝒙 𝑛 ≈ ∑ 𝑧 𝑛𝑘 𝒘 𝑘= 𝐖 𝒛 𝑛
red. techniques, e.g., classic 𝑘=1 is
principal component analysis
 We have represented each by a -dim vector (a new feat. rep)
(PCA), they are

 To store such inputs , we need to keep and

 Originally we required storage, now storage
 If , this yields substantial storage saving, hence good compression
3
Dimensionality Reduction
Each “basis” image is like a
“template” that captures the
 Dim-red for face images common properties of face K=4 “basis” face
images in the dataset images

A face
𝑧𝑛 1 𝒘 𝑧𝑛 2 𝒘 𝑧𝑛 3 𝒘 𝑧𝑛 4 𝒘 4
image 1 2 3

 In this example, is a low-dim feature rep. for Like 4 new

features

 Essentially, each face image in the dataset now represented by just

4 real numbers 
 Different dim-red algos differ in terms of how the basis vectors are
defined/learned
4
Principal Component Analysis (PCA)
 A classic linear dim. reduction method (Pearson, 1901;
Hotelling, 1930)
 Can be seen as
𝑒2
 Learning directions (co-ordinate axes) that capture maximum
PCA is essentially doing avariance
change in
data of axes in which we are
Standard co-ordinate axis () 𝑤 𝑤2 1 representing the data

Sm
Each input will still have 2 co-
New co-ordinate axis ()

all
ordinates, in the new co-ordinate

va
To reduce dimension, can only keep the co-

ri
nce

an
ordinates of those directions that have var
ia system, equal to the distances

ce
ge
La r measured from the new origin
largest variances (e.g., in this example, if
we want to reduce to one-dim, we can keep
𝑒1
the co-ordinate of each point along and
throw away ). We won’t lose much
information
Subject to
𝑁 orthonormality
arg min ∑ ‖𝒙 𝑛 −𝑾 𝒛 𝑛‖ =arg min ‖ 𝑿 − 𝒁𝑾 ‖
2 2
constraints: for and
 Learning
𝑾,𝒁projection
𝑛 =1
directions that𝑾result
,𝒁 in smallest reconstruction error
Principal Component Analysis: the algorithm 5

 Center the data (subtract the mean from each data point)
 Compute the covariance matrix using the centered data matrix as
1 ⊤
𝐒= 𝐗 𝐗 (Assuming is arranged as )
𝑁
 Do an eigendecomposition of the covariance matrix (many
methods exist)
 Take top leading eigvectors with eigvalues
 The -dimensional projection/embedding of Note:
each Caninput is
decide how
many eigvecs to use
based on how much
𝒛𝑛 ≈ 𝐖⊤
𝐾 𝒙𝑛 is the “projection matrix” variance we want to
of size campure (recall that
each gives the variance
in the direction (and their
sum is the total variance)
6

Understanding PCA: The variance perspective

Solving PCA by Finding Max. Variance Directions 7

 Consider projecting an input along a direction

 Projection/embedding of (red points below) will be (green pts
below)
Mean of projections of all inputs:
𝒘1
𝒙𝑛

𝒙𝑛
is the cov matrix of the data:
Variance of the projections:
⊤
𝒘 1

For already centered data, =

 Want such that variance is maximized and =
s.t.
Need this constraint
otherwise the objective’s
8
Max. Variance Direction Variance along
the direction
Note: Total
variance of the
data is equal to
the sum of
 Our objective function was s.t. eigenvalues of ,
i.e.,
 Can construct a Lagrangian for this problem PCA would keep
the top such
directions of
1- largest variances

 Taking derivative w.r.t. and setting to zero gives Note: In general,

will have
 Therefore is an eigenvector of the cov matrix with eigenvalueeigvecs

 Claim: is the eigenvector of with largest eigenvalue . Note that

 Thus variance will be max. if is the largest eigenvalue (and is the

corresponding top eigenvector; also known as the first Principal
Component)
 Other large variance directions can also be found likewise (with
9

Understanding PCA: The reconstruction perspective

10
Alternate Basis and Reconstruction
 Representing a data point in the standard orthonormal basis
𝐷
𝒙 𝑛= ∑ 𝑥 𝑛𝑑 𝒆 𝑑 is a vector of all zeros except a
single 1 at the position. Also, for
 Let’s represent the same data
𝑑=1
point in a new orthonormal basis

𝐷 denotes the co-ordinates of

is the projection of along the
direction since (verify) 𝒙 𝑛= ∑ 𝑧 𝑛𝑑 𝒘 𝑑 in the new basis
 Ignoring directions along which
𝑑=1 projection is small, we can
approximate as
𝐾 𝐾 𝐾 Note that is the
𝒙 𝑛 = ∑ 𝑧 𝑛𝑑 𝒘 𝑑 ¿ ∑ ( 𝒙¿ ¿ 𝑛⊤ 𝒘 𝑑 ) 𝒘 𝑑 ¿ ¿ ∑ (𝒘 𝑑 𝒘 ¿ ¿ 𝑑 )𝒙 𝑛 ¿
⊤ reconstruction error on .
𝒙𝑛≈ ^ Would like it to minimize
𝑑 =1 𝑑=1 𝑑=1 w.r.t.

 Now is represented by dim. rep. and (verify)

⊤ is the “projection matrix”
Also,
𝒛 𝑛 ≈ 𝐖 𝐾 𝒙𝑛 of size
11
Minimizing Reconstruction Error
 We plan to use only directions so would like them to be such
that the total reconstruction error is minimizedConstant;
doesn’t depend
𝑁 𝑁 on the ’s
ℒ ( 𝒘 1 , 𝒘 2 , … , 𝒘 𝐾 ) = ∑ ‖𝒙 𝑛 − ^
𝒙 𝑛‖ = ∑ ¿ ¿ ¿ ¿
2
(verify)
𝑛=1 𝑛=1 Variance along
 Each optimal can be found by solving

 Thus minimizing the reconstruction error is equivalent to

maximizing variance
 The directions can be found by solving the
eigendecomposition of
 Note:
 Thus s.t. orthonormality on columns of is the same as solving the
12
Principal Component Analysis
 Center the data (subtract the mean from each data point)
 Compute the covariance matrix using the centered data matrix as
1 ⊤
𝐒= 𝐗 𝐗 (Assuming is arranged as )
𝑁
 Do an eigendecomposition of the covariance matrix (many
methods exist)
 Take top leading eigvectors with eigvalues
 The -dimensional projection/embedding of Note:
each Caninput is
decide how
many eigvecs to use
based on how much
𝒛𝑛 ≈ 𝐖⊤
𝐾 𝒙𝑛 is the “projection matrix” variance we want to
of size campure (recall that
each gives the variance
in the direction (and their
sum is the total variance)
13
Singular Value Decomposition (SVD)
 Any matrix of size can be represented as the following
decomposition min { 𝑁 , 𝐷 }
𝐗 = 𝐔 𝚲 𝐕 ⊤= ∑ 𝜆 𝑘 𝒖𝑘 𝒗 ⊤
𝑘
𝑘=1

Diagonal matrix. If , last rows are all

zeros; if , last columns are all zeros

 is matrix of left singular vectors, each

 is also orthonormal
 is matrix of right singular vectors, each
 is also orthonormal
 is with only diagonal entries - singular values
 Note: If is symmetric then it is known as eigenvalue decomposition
14
Low-Rank Approximation via SVD
 If we just use the top singular values, we get a rank- SVD

 Above SVD approx. can be shown to minimize the

reconstruction error
 Fact: SVD gives the best rank- approximation of a matrix

 PCA is done by doing SVD on the covariance matrix (left and right
singular vectors are the same and become eigenvectors, singular
15
Dim-Red as Matrix Factorization
 If we don’t care about the orthonormality constraints, then dim-red
can also be achieved by solving a matrix factorization problem on
the data matrix
D K D
K W
N X ≈ N Z
Matrix
containing the
low-dim rep of

{𝐙 ^ }=argmin
^ ,𝐖
𝐙 , 𝐖 ‖𝐗 − 𝐙𝐖 ‖
2
If , such a factorization gives
a low-rank approximation of
the data matrix X

 Can solve such problems using ALT-OPT

PGDM Project Report For (BATCH 2018 - 20) SECTION, B. ON Statistical Analysis
100% (1)
PGDM Project Report For (BATCH 2018 - 20) SECTION, B. ON Statistical Analysis
16 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
62 pages
Dim Reduction & Pattern Recognition
No ratings yet
Dim Reduction & Pattern Recognition
63 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
60 pages
Prs l6
No ratings yet
Prs l6
10 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
Lec 22
No ratings yet
Lec 22
16 pages
Dimensionality Reduction by Pca: Non - Feasible
No ratings yet
Dimensionality Reduction by Pca: Non - Feasible
26 pages
PCA
100% (1)
PCA
33 pages
20 Pca
No ratings yet
20 Pca
50 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
Principal Component Analysis: Atent Ariables
No ratings yet
Principal Component Analysis: Atent Ariables
13 pages
کتاب نهم بارگزاری شده
No ratings yet
کتاب نهم بارگزاری شده
55 pages
Dimension Reduction
No ratings yet
Dimension Reduction
23 pages
Lecture 7: Principal Component Analysis (PCA) (Draft: Version 0.9.1)
No ratings yet
Lecture 7: Principal Component Analysis (PCA) (Draft: Version 0.9.1)
11 pages
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
No ratings yet
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
31 pages
PrincipalComponentAnalysis LectureNotesPublic
No ratings yet
PrincipalComponentAnalysis LectureNotesPublic
24 pages
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
No ratings yet
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
40 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
Presentation A I STD 2
No ratings yet
Presentation A I STD 2
63 pages
D3S2 - Unsupervised - Dimensionality Reduction
No ratings yet
D3S2 - Unsupervised - Dimensionality Reduction
81 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
Principal Component Analysis (PCA) : Gundimeda Venugopal
No ratings yet
Principal Component Analysis (PCA) : Gundimeda Venugopal
17 pages
Ferath Kherif PCA
No ratings yet
Ferath Kherif PCA
17 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
27 pages
Data Pre-Processing-IV (Feature Extraction-PCA)
No ratings yet
Data Pre-Processing-IV (Feature Extraction-PCA)
23 pages
Pca Kmeans GMM
No ratings yet
Pca Kmeans GMM
96 pages
Mathematical Approach To PCA
No ratings yet
Mathematical Approach To PCA
8 pages
MLSP Exp02
No ratings yet
MLSP Exp02
10 pages
Part1 Lecture 12 Annotated
No ratings yet
Part1 Lecture 12 Annotated
12 pages
Lecture 9 - PCA
No ratings yet
Lecture 9 - PCA
44 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
No ratings yet
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
20 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
Face Recognition PAC
No ratings yet
Face Recognition PAC
24 pages
LectureNotes PCA
No ratings yet
LectureNotes PCA
20 pages
cs229 Notes10 PDF
No ratings yet
cs229 Notes10 PDF
6 pages
IDS 4 (Week 14)
No ratings yet
IDS 4 (Week 14)
66 pages
Dimensionality Reduction With Principal Component Analysis
No ratings yet
Dimensionality Reduction With Principal Component Analysis
39 pages
CHBE413CDS Lecture 12 Unsupervised DimRed
No ratings yet
CHBE413CDS Lecture 12 Unsupervised DimRed
30 pages
Unit 3
No ratings yet
Unit 3
28 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
18 pages
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
No ratings yet
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
22 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
3 pages
PCA revis-BoW PDF
No ratings yet
PCA revis-BoW PDF
47 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
16 pages
MLPDF 2
No ratings yet
MLPDF 2
9 pages
Dimensionality Reduction Using Principal Component Analysis
No ratings yet
Dimensionality Reduction Using Principal Component Analysis
32 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Week12 PCA BayesianInference Before Lecture
No ratings yet
Week12 PCA BayesianInference Before Lecture
82 pages
PCA ChrisDing4
No ratings yet
PCA ChrisDing4
74 pages
Computer Vision and Image Processing - Fundamentals and Applications
No ratings yet
Computer Vision and Image Processing - Fundamentals and Applications
34 pages
Pca
No ratings yet
Pca
6 pages
Data Analysis: Dr. C Santhosh Kumar
No ratings yet
Data Analysis: Dr. C Santhosh Kumar
22 pages
Pca
No ratings yet
Pca
16 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Computer Vision: Spring 2006 15-385,-685
No ratings yet
Computer Vision: Spring 2006 15-385,-685
58 pages
Program: Course Code: Course Name:: M.C.A. MCAS9220 Data Science Fundamentals
No ratings yet
Program: Course Code: Course Name:: M.C.A. MCAS9220 Data Science Fundamentals
28 pages
Kernel Methods: Fundamentals and Applications
From Everand
Kernel Methods: Fundamentals and Applications
Fouad Sabry
No ratings yet
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Solutionbank: Edexcel AS and A Level Modular Mathematics
No ratings yet
Solutionbank: Edexcel AS and A Level Modular Mathematics
67 pages
Trigonometry Formulae - Trigo Formulae For LAKSHYA JEE
No ratings yet
Trigonometry Formulae - Trigo Formulae For LAKSHYA JEE
2 pages
Porous Media in Openfoam: Chalmers Spring 2009
No ratings yet
Porous Media in Openfoam: Chalmers Spring 2009
14 pages
Defining Computational Aesthetics Hoenig
No ratings yet
Defining Computational Aesthetics Hoenig
6 pages
Unit-4 Part 2 Modelling and Evaluation
No ratings yet
Unit-4 Part 2 Modelling and Evaluation
35 pages
Polinomlar
No ratings yet
Polinomlar
6 pages
Fundamentals in Business Analytics Reviewer Prelims
No ratings yet
Fundamentals in Business Analytics Reviewer Prelims
5 pages
Experiment 1 Computation of Parameters and Modelling of Transmission Lines
No ratings yet
Experiment 1 Computation of Parameters and Modelling of Transmission Lines
13 pages
S-DLP Inverse Variation
No ratings yet
S-DLP Inverse Variation
5 pages
Sample DOKA Paper U For Year 11 13
No ratings yet
Sample DOKA Paper U For Year 11 13
4 pages
Seismic Arrester Design
No ratings yet
Seismic Arrester Design
14 pages
ORF 544 Week 5 Derivative Free Stochastic Optimization VFA and DLA
No ratings yet
ORF 544 Week 5 Derivative Free Stochastic Optimization VFA and DLA
123 pages
Investor: Awareness Guide
100% (1)
Investor: Awareness Guide
24 pages
Quiz No.1 - Physics
No ratings yet
Quiz No.1 - Physics
3 pages
KND 100M
No ratings yet
KND 100M
297 pages
Ch.4 Pile Foundations: 4.3 Ultimate Pile Capacity (Dynamic Analysis)
No ratings yet
Ch.4 Pile Foundations: 4.3 Ultimate Pile Capacity (Dynamic Analysis)
9 pages
Salahaddin University College of Science Mathematics Department Stage Two
No ratings yet
Salahaddin University College of Science Mathematics Department Stage Two
14 pages
2G Kpi
No ratings yet
2G Kpi
61 pages
Abaqus Example Problems Manual (6
No ratings yet
Abaqus Example Problems Manual (6
18 pages
Stata Journal
No ratings yet
Stata Journal
192 pages
Traupal Notes
No ratings yet
Traupal Notes
41 pages
API 579 Fitness For Service For Nozzles and Flanges (APIFFSB) Module Overview
No ratings yet
API 579 Fitness For Service For Nozzles and Flanges (APIFFSB) Module Overview
49 pages
Summative Test 1
No ratings yet
Summative Test 1
2 pages
Mst121 Chapter A1
No ratings yet
Mst121 Chapter A1
52 pages
Chapter 1
No ratings yet
Chapter 1
20 pages
RS Aggrawal Solutions For Class 6 Maths Chapter 16 Triangles
No ratings yet
RS Aggrawal Solutions For Class 6 Maths Chapter 16 Triangles
8 pages
Design Research of Railway Bridges With Span Length Over 1000m in China
No ratings yet
Design Research of Railway Bridges With Span Length Over 1000m in China
6 pages
Gate 2022 Me Forenoon Paper Analysis Answerkey
No ratings yet
Gate 2022 Me Forenoon Paper Analysis Answerkey
9 pages
Signals & Systems (Common To Ec/Tc/It/Bm/Ml)
No ratings yet
Signals & Systems (Common To Ec/Tc/It/Bm/Ml)
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

4.5 Principal Component Analysis

Uploaded by

4.5 Principal Component Analysis

Uploaded by

Topic: Principal Component Analysis

Prof. Bhabesh Deka

 To store such inputs , we need to keep and

 In this example, is a low-dim feature rep. for Like 4 new

 Essentially, each face image in the dataset now represented by just

Understanding PCA: The variance perspective

 Consider projecting an input along a direction

For already centered data, =

 Taking derivative w.r.t. and setting to zero gives Note: In general,

 Claim: is the eigenvector of with largest eigenvalue . Note that

 Thus variance will be max. if is the largest eigenvalue (and is the

Understanding PCA: The reconstruction perspective

𝐷 denotes the co-ordinates of

 Now is represented by dim. rep. and (verify)

 Thus minimizing the reconstruction error is equivalent to

Diagonal matrix. If , last rows are all

 is matrix of left singular vectors, each

 Above SVD approx. can be shown to minimize the

 Can solve such problems using ALT-OPT

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.