Welcome to Scribd!

0% found this document useful (0 votes)

80 views

Three Derivations of Principal Component Analysis

Uploaded by

There are three main derivations of principal component analysis (PCA): 1. By maximizing variance - the PCA basis vectors are the eigenvectors of the covariance/correlation matrix, as these vectors maximize the variance of the projected data points. 2. By minimizing error - the PCA basis vectors minimize the reconstruction error when representing data points as a linear combination of the basis vectors. 3. By diagonalizing the correlation matrix - if the data is transformed by the eigenvectors of the correlation matrix, this diagonalizes the correlation matrix of the transformed data.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Three Derivations of Principal Component Analysis

Uploaded by

ritesh sinha

0% found this document useful (0 votes)

80 views2 pages

Original Title

lewis_pca_derivs

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

80 views2 pages

Three Derivations of Principal Component Analysis

Uploaded by

ritesh sinha

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 2

Search inside document

Three Derivations of Principal Components j.p.

lewis

Three Derivations of Principal Component Analysis

Why are the PCA basis vectors the eigenvectors of the correlation matrix?

Derivation #1: by maximizing variance

¿From Ballard & Brown, Computer Vision: The (random) data vector is x; its component along a proposed axis u is (x · u).
The variance of this is E(x · u − E(x · u))2 (the variance is the expectation of the square of the data with its mean removed).

E(x · u − E(x · u))2 = E[(u · (x − Ex))2 ]

= uE[(x − Ex) · (x − Ex)T ]u
= uT Cu

C is the covariance or ’correlation’ matrix. The u that gives the maximum value to u T Cu (with the constraint that u is a unit
vector) is the eigenvector of C with the largest eigenvalue. The second and subsequent principal component axes are the other
eigenvectors sorted by eigenvalue.

#2: ...by error minimization

Find PCA basis vectors u that minimize E||x − x̂||2 for a partial expansion out to P components:

X
P
x̂ = (x · uk )uk
k=1

X
N
x − x̂ = (x · uk )uk
k=P +1

where N is the full set of vectors necessary to represent the data.

So, minimize the square of the last sum. The cross terms disappear because of the orthogonality of u k . For each term:

E((x · u)u)2 = Eu(x · u)(x · u)u

the outer u’s disappear because u · u = 1.

= E(x · u)(x · u) = uCu
But uCu = λ, so the truncation error is the sum of the lower eigenvalues! Why: we know that u are eigenvectors, so they
satisfy Cu = λu, also u · u = 1, so....

#3: ...by diagonalizing the correlation matrix

The correlation matrix of some data: C = E[xxT ]. The correlation matrix of the data x transformed by some transform T:
C 0 = E[T x(T x)T ] = E[T xxT T T ]. The inner xxT is the correlation matrix of the original data. Now suppose that the rows
of T are chosen to be the eigenvectors of this correlation matrix– then because of the orthogonality of the eigenvectors, the
resuling matrix C’ will be diagonal. Thus C’, the correlation matrix of the transformed data, is uncorrelated. So the basis that
diagonalizes the correlation matrix consists of the eigenvectors of the (original) correlation matrix.
Three Derivations of Principal Components j.p.lewis

Correlation matrices

For a vector x, ExxT is a correlation matrix.

Say M is a matrix whose columns contain data vectors. I think both M M T and M M T can be interpreted as correlation
matrices.
M M T is the usual correlation matrix, a sum of outer products:
X
(M M T )i,j = xk [i]xk [j]
k
X
T
(M M ) = xk xTk ≈ ExxT = C
k

If xk are a sliding window through a signal, i.e. x0 contains samples 0..10, x1 samples 1..11, etc., then this corresponds to
estimating the autocovariance of the signal. If xk are images scanned into a vector, this gives the average (after dividing by N)
correlation of pixel i with pixel j.
The i, j entry of M T M is the dot of data vector i with data vector j. If a column of M contains various measurements for a
particular person then (M T M )i,j gives the correlation, averaged across tests, of person i with person j, while (M M T )i,j gives
the correlation, averaged across people, of test i versus test j.

PCA and SVD

SVD decomposes a possibly non-square matrix M into USV where U,V are square rotation-like matrices and S is a diagonal
matrix of singular values. The columns of U are the eigenvectors of M M T , the columns of V are the eigenvectors of M T M .

Computation Trick

If we are computing PCA on an image, M will be (e.g.) a million by N (N images), and M M T will be million2 . Instead, first
find the eigenvectors of M T M (which is N xN ): M T M x = λx. Then premultiply by M and interpret as (M M T )(M x) =
λ(M x), i.e., M x are the desired eigenvectors, now given as a linear combination of the original data using weights which are
the eigenvector of the smaller system.

MAST90083 2021 S2 Exam Paper
Document4 pages
MAST90083 2021 S2 Exam Paper
xiaotianxue84
No ratings yet
Solution Manual For Discrete Time Signal Processing 3 E 3rd Edition Alan V Oppenheim Ronald W Schafer
Document4 pages
Solution Manual For Discrete Time Signal Processing 3 E 3rd Edition Alan V Oppenheim Ronald W Schafer
Howard Zhang
No ratings yet
Salary Account - Easy To Access
Document6 pages
Salary Account - Easy To Access
ritesh sinha
No ratings yet
Notes Implementation Component Analysis
Document5 pages
Notes Implementation Component Analysis
Omer Ahmed
No ratings yet
On The Eigenspectrum of The Gram Matrix and Its Relationship To The Operator Eigenspectrum
Document18 pages
On The Eigenspectrum of The Gram Matrix and Its Relationship To The Operator Eigenspectrum
Martine Dumont
No ratings yet
Sampling Distributions: 1.1 Statistical Inference
Document22 pages
Sampling Distributions: 1.1 Statistical Inference
alvinakandwanaho3
No ratings yet
1.3.1 First and Second Order PDE
Document11 pages
1.3.1 First and Second Order PDE
CT Mar
No ratings yet
Spectral Relaxation For K-Means Clustering: Hongyuan Zha & Xiaofeng He Chris Ding & Horst Simon
Document7 pages
Spectral Relaxation For K-Means Clustering: Hongyuan Zha & Xiaofeng He Chris Ding & Horst Simon
Behzad Khafaie
No ratings yet
An Introduction To Discontinuous Galerkin Methods: Module 2: A Simple 1D DG Solver
Document16 pages
An Introduction To Discontinuous Galerkin Methods: Module 2: A Simple 1D DG Solver
keto diet eater
No ratings yet
Matrix Formulas For Semi Linear Back Propagation
Document30 pages
Matrix Formulas For Semi Linear Back Propagation
danielcrespin
No ratings yet
AdvancedSensorySystems 3b SVD
Document13 pages
AdvancedSensorySystems 3b SVD
Ogulcan Kertmen
No ratings yet
Lec19 PDF
Document9 pages
Lec19 PDF
juanagallardo01
No ratings yet
X (X X - . - . - . - . - X) : Neuro-Fuzzy Comp. - Ch. 3 May 24, 2005
Document20 pages
X (X X - . - . - . - . - X) : Neuro-Fuzzy Comp. - Ch. 3 May 24, 2005
Oluwafemi Dagunduro
No ratings yet
Week 2 Notes
Document23 pages
Week 2 Notes
Hind Ouazzani
No ratings yet
Introduction To Linear Algebra V: 1 Eigenvalue and Eigenvector
Document4 pages
Introduction To Linear Algebra V: 1 Eigenvalue and Eigenvector
Roghieh Mahdavi Haji
No ratings yet
Bili Near
Document8 pages
Bili Near
Md.Shariful Alam
No ratings yet
A Simple Explanation of Partial Least Squares
Document10 pages
A Simple Explanation of Partial Least Squares
tianao kang
No ratings yet
Characeristic Function
Document5 pages
Characeristic Function
KunalTelgote
No ratings yet
Linear Algebra Review: Introduction To Machine Learning (CSC 311) Spring 2020
Document28 pages
Linear Algebra Review: Introduction To Machine Learning (CSC 311) Spring 2020
john bianco
No ratings yet
EEE - 321: Signals and Systems Lab Assignment 2
Document5 pages
EEE - 321: Signals and Systems Lab Assignment 2
Atakan Yiğit
No ratings yet
Matrix Calculus
Document9 pages
Matrix Calculus
Wmleao
No ratings yet
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
Document40 pages
Dimension Reduction and Hidden Structure: 1.1 Principal Component Analysis (PCA)
S
No ratings yet
Dimension Reduction
Document23 pages
Dimension Reduction
idhitappu
No ratings yet
Matlab Homework Experts 2
Document10 pages
Matlab Homework Experts 2
Franklin Deo
No ratings yet
Ex 3 N
Document1 page
Ex 3 N
nabakumarj9
No ratings yet
K Is An: Ramanujan Graphs
Document8 pages
K Is An: Ramanujan Graphs
Vinay Kumar
No ratings yet
Ecf480 FPD 4 2015 2
Document13 pages
Ecf480 FPD 4 2015 2
richard kapimpa
No ratings yet
Midterm
Document3 pages
Midterm
jixian.liu.xjtu.edu
No ratings yet
Markov
Document12 pages
Markov
aliyah khairunnisa
No ratings yet
cs530 12 Notes PDF
Document188 pages
cs530 12 Notes PDF
yohanes sinaga
No ratings yet
Term End Exam 2015: ηK = η, where η = diag (1,
Document2 pages
Term End Exam 2015: ηK = η, where η = diag (1,
ianuli
No ratings yet
MTH101
Document13 pages
MTH101
Deepanshu Bansal
No ratings yet
Burda@th - If.uj - Edu.pl Atg@th - If.uj - Edu.pl Corresponding Author: Bwaclaw@th - If.uj - Edu.pl
Document9 pages
Burda@th - If.uj - Edu.pl Atg@th - If.uj - Edu.pl Corresponding Author: Bwaclaw@th - If.uj - Edu.pl
bansy
No ratings yet
1 s2.0 S0024379524000661 Main
Document13 pages
1 s2.0 S0024379524000661 Main
mosab.backkup
No ratings yet
NN Notes PDF
Document126 pages
NN Notes PDF
rus
No ratings yet
HW2 Solution
Document9 pages
HW2 Solution
Lê Quân
No ratings yet
Linear Algebra & Analysis Review As Covered in Class UW EE/AA/ME 578 Convex Optimization
Document16 pages
Linear Algebra & Analysis Review As Covered in Class UW EE/AA/ME 578 Convex Optimization
Abby Jones
No ratings yet
MIT8 044S13 E1 B
Document6 pages
MIT8 044S13 E1 B
*83*22*
No ratings yet
Multivariate Normal Distribution: 1 Random Vector
Document3 pages
Multivariate Normal Distribution: 1 Random Vector
Saravanan Mathi
No ratings yet
Massachusetts Institute of Technology
Document3 pages
Massachusetts Institute of Technology
Manikanta Reddy Manu
No ratings yet
DimensionalityReduction Pca
Document24 pages
DimensionalityReduction Pca
uchenuang
No ratings yet
Crespin, D. - Matrix Formulas For Semilinear Backpropagation
Document29 pages
Crespin, D. - Matrix Formulas For Semilinear Backpropagation
danielcrespin
No ratings yet
Further Linear Algebra. Chapter IV. Jordan Normal Form.: Andrei Yafaev
Document25 pages
Further Linear Algebra. Chapter IV. Jordan Normal Form.: Andrei Yafaev
david
No ratings yet
On Approach of Estimation Time Scales of Relaxation of Concentration of Charge Carriers in High-Doped Semiconductor
Document9 pages
On Approach of Estimation Time Scales of Relaxation of Concentration of Charge Carriers in High-Doped Semiconductor
ijitmcjournal12
No ratings yet
Solutions To Linear First Order ODE's 1. First Order Linear Equations
Document6 pages
Solutions To Linear First Order ODE's 1. First Order Linear Equations
Juan Gutier Cc
No ratings yet
Rarefied Gas Dynamics - DSMC Course
Document50 pages
Rarefied Gas Dynamics - DSMC Course
yicdoo
No ratings yet
Boyce ODEch 3 S 7 P 05
Document3 pages
Boyce ODEch 3 S 7 P 05
Josh Barroga
No ratings yet
Mathematical Economics: 1 What To Study
Document23 pages
Mathematical Economics: 1 What To Study
jrvv2013gmail
No ratings yet
6.003: Signals and Systems-Fall 2002
Document10 pages
6.003: Signals and Systems-Fall 2002
samsriti
No ratings yet
N D IX: The E-M Algorithm
Document12 pages
N D IX: The E-M Algorithm
정하윤
No ratings yet
GTR2019 1
Document67 pages
GTR2019 1
Bhat Saqib
No ratings yet
97 Matysiak Przewozniak Rulinska
Document7 pages
97 Matysiak Przewozniak Rulinska
Taffohouo Nwaffeu Yves Valdez
No ratings yet
Practice Problem Set 1
Document1 page
Practice Problem Set 1
abhay kumar
No ratings yet
Floquet Theory Basics
Document3 pages
Floquet Theory Basics
Praneeth Kumar
No ratings yet
A Collatz-Type Conjecture On The Set of Rational Numbers
Document7 pages
A Collatz-Type Conjecture On The Set of Rational Numbers
vahid
No ratings yet
STAT2602B Assignment 1 With Suggested Solution
Document6 pages
STAT2602B Assignment 1 With Suggested Solution
giaoxukun
No ratings yet
Algebraic Methods in Data Science: Lesson 3: Dan Garber
Document14 pages
Algebraic Methods in Data Science: Lesson 3: Dan Garber
rany khirbawy
No ratings yet
Dialnet-GeneralizationOfRakotchsFixedPointTheorem-7146087
Document9 pages
Dialnet-GeneralizationOfRakotchsFixedPointTheorem-7146087
fatingota1
No ratings yet
Weather Wax Hastie Solutions Manual
Document18 pages
Weather Wax Hastie Solutions Manual
daselknam
No ratings yet
Green's Function Estimates for Lattice Schrödinger Operators and Applications
From Everand
Green's Function Estimates for Lattice Schrödinger Operators and Applications
Jean Bourgain
No ratings yet
Seminar on Micro-Local Analysis
From Everand
Seminar on Micro-Local Analysis
Victor Guillemin
No ratings yet
Natural Language Processing: Parsing
Document18 pages
Natural Language Processing: Parsing
ritesh sinha
No ratings yet
Scan Sep 25, 2020
Document25 pages
Scan Sep 25, 2020
ritesh sinha
No ratings yet
Backend Coding Task
Document1 page
Backend Coding Task
ritesh sinha
No ratings yet
Assignment - 10: Q1. Write A Graphics Program To Implement Bezier Curve Code
Document3 pages
Assignment - 10: Q1. Write A Graphics Program To Implement Bezier Curve Code
ritesh sinha
No ratings yet
Ritesh Kumar Sinja +91 8117808825 Career Objective
Document2 pages
Ritesh Kumar Sinja +91 8117808825 Career Objective
ritesh sinha
No ratings yet
String Metods and Regex Metacharacters
Document4 pages
String Metods and Regex Metacharacters
ritesh sinha
No ratings yet
SQL Nanodegree Program Syllabus
Document12 pages
SQL Nanodegree Program Syllabus
ritesh sinha
No ratings yet
OS Assignment III
Document4 pages
OS Assignment III
ritesh sinha
No ratings yet
The Art of Data Structures: Sorting
Document42 pages
The Art of Data Structures: Sorting
ritesh sinha
No ratings yet
Using Best Appropriate Method To Find Solution
Document5 pages
Using Best Appropriate Method To Find Solution
Uzair Imran
No ratings yet
Unit-3 DWDM
Document11 pages
Unit-3 DWDM
PRANITHA REDDY
No ratings yet
ECG Denoising Using Wiener Filter and Kalman Filter
Document9 pages
ECG Denoising Using Wiener Filter and Kalman Filter
Chen David
No ratings yet
2.2 Speech Processing: - Speech Synthesis. - Speech Recognition. - Speech Coding
Document7 pages
2.2 Speech Processing: - Speech Synthesis. - Speech Recognition. - Speech Coding
Lekhashri Baskar
No ratings yet
03 - Lot Sizing For Individual Items With Time-Varying Demand
Document51 pages
03 - Lot Sizing For Individual Items With Time-Varying Demand
rahma.samy
No ratings yet
Graph Coloring
Document35 pages
Graph Coloring
Madhav Gupta
No ratings yet
Computation of KKT Points
Document2 pages
Computation of KKT Points
arash501
No ratings yet
Lecture 28 - Money Counting and Bin Packing Problem
Document15 pages
Lecture 28 - Money Counting and Bin Packing Problem
Engineer Zain
No ratings yet
Bitcoin Price Prediction 5 - Colaboratory
Document5 pages
Bitcoin Price Prediction 5 - Colaboratory
Sisay AD
No ratings yet
Credit Card Fraud Detection - Machine Learning Methods
Document5 pages
Credit Card Fraud Detection - Machine Learning Methods
Luckysingh Negi
No ratings yet
Homework 4
Document4 pages
Homework 4
Jeremy Ng
No ratings yet
GP-CP-Introduction To Fenwick Trees
Document3 pages
GP-CP-Introduction To Fenwick Trees
Himanshu Patidar
No ratings yet
Model QP 2
Document2 pages
Model QP 2
Atharav Satardekar
No ratings yet
Chapter Six: Numerical Solution of Ordinary Differential Equations 6.1 1 Order Equation: 6.1.1 Euler's Method
Document15 pages
Chapter Six: Numerical Solution of Ordinary Differential Equations 6.1 1 Order Equation: 6.1.1 Euler's Method
Mohamed Muayid
No ratings yet
PSC Ass 1
Document2 pages
PSC Ass 1
ramesh471
No ratings yet
Logic Programming Using Prolog List Operations
Document31 pages
Logic Programming Using Prolog List Operations
Moaaz 74
No ratings yet
Laplacian of Gaussian
Document18 pages
Laplacian of Gaussian
Zeeshan Hyder Bhatti
No ratings yet
Solving Problems by Searching: Artificial Intelligence COSC-3112 Ms. Humaira Anwer
Document16 pages
Solving Problems by Searching: Artificial Intelligence COSC-3112 Ms. Humaira Anwer
H7bgubjij
No ratings yet
Questions On Frequency Analysis of Signals and Systems
Document42 pages
Questions On Frequency Analysis of Signals and Systems
kibrom atsbha
100% (1)
B. Tech Project Report Economic Load Dispatch Using Optimization Algorithms
Document81 pages
B. Tech Project Report Economic Load Dispatch Using Optimization Algorithms
MohandRahim
No ratings yet
Quickhull Algorithm
Document4 pages
Quickhull Algorithm
Shehab Sarker
No ratings yet
Advanced Counting TECHNIQUES
Document31 pages
Advanced Counting TECHNIQUES
Wajeeha Ali
No ratings yet
Numerical Mathematics I (Lecture Notes) - Peter Philip
Document255 pages
Numerical Mathematics I (Lecture Notes) - Peter Philip
vic1234059
No ratings yet
For MSBTE Important Questions Bank
Document5 pages
For MSBTE Important Questions Bank
vishalpanchal592k
No ratings yet
Advanced Optimization Techniques
Document21 pages
Advanced Optimization Techniques
shiekzia
No ratings yet
Signals and Systems NOTES
Document15 pages
Signals and Systems NOTES
Ioana
No ratings yet
AI MCQ's + Questions
Document12 pages
AI MCQ's + Questions
Ansari Umair
No ratings yet
Data Mining: A Preprocessing Engine
Document5 pages
Data Mining: A Preprocessing Engine
Manon573
No ratings yet
Speaker Recognition Using MFCC and VQ
Document2 pages
Speaker Recognition Using MFCC and VQ
Asjad Iqbal
No ratings yet
Trace Tables
Document90 pages
Trace Tables
mlwd
No ratings yet