SlideShare a Scribd company logo
PRINCIPAL COMPONENT ANALYSIS
Partha Sarathi Kar
IVSM 166777
1
CONTENTS
• WHAT IS PCA
• HOW IT WORKS
• HISTORY OF PCA
• PCA IMPLEMENTATION
• USES of PCA
• LIMITATION OF PCA
2/28
WHAT IS PCA
PCA takes a dataset with a
lots of dimension (i.e. Lots
of Cells) and flattens it to
2 or 3 dimensions so we
can look on it.
3/28
Principal component analysis (PCA) is a technique used to
emphasize variation and bring out strong patterns in a dataset.
It's often used to make data easy to explore and visualize.
HOW IT WORKS
4/28
Eating in the UK (a 17D example) Here's the plot of the data along the first
principal component. Already we can see
something is different about Northern
Ireland.
Northern Irish eat way more grams of fresh
potatoes and way fewer of fresh fruits, cheese,
fish and alcoholic drinks
HISTORY OF PCA
• PCA was invented in 1901
by Karl Pearson
• as an analogue of the principal
axis theorem in mechanics
5/28
src: https://commons.wikimedia.org/wiki/File:GaussianScatterPCA.svg
HISTORY OF PCA
Depending on the field of application, it is also
named:
6/28
• discrete Kosambi-Karhunen–Loève transform (KLT) in signal
processing,
• the Hotelling transform in multivariate quality control,
• proper orthogonal decomposition (POD) in mechanical engineering,
• singular value decomposition (SVD) of X (Golub and Van Loan, 1983),
• eigenvalue decomposition (EVD) of XTX in linear algebra,
• Eckart–Young theorem (Harman, 1960), or Schmidt–Mirsky theorem in
psychometrics,
• empirical orthogonal functions (EOF) in meteorological science,
• empirical eigenfunction decomposition (Sirovich, 1987) etc
PCA IMPLEMENTATION
PCA could have different implementations.
But most popular ones are
• eigenvalue decomposition (EVD) and
• singular value decomposition (SVD).
7/28
PCA IMPLEMENTATION
Eigenvalue decomposition
(EVD)
8/28
NxM > NxK (K<=M)
X = original data matrix
W and D new Matrix from
W contains all principal
component vectors, while D
contains all ranks of those
vectors (ordered from the largest
variance to the least one
XXTW=WD
PCA IMPLEMENTATION
Singular Value decomposition
(SVD)
9/28
NxM > NxK (K<=M)
X = original data matrix
X=UΣVT
XV=UΣ
PCA IMPLEMENTATION
10/28
PCA IMPLEMENTATION
11/28
EXAMPLE
PCA IMPLEMENTATION
12/28
EXAMPLE
PCA IMPLEMENTATION
13/28
EXAMPLE
PCA IMPLEMENTATION
14/28
EXAMPLE
Dot are spread out along a diagonal line and
maximum variation of data is between the two
end points of line
PCA IMPLEMENTATION
15/28
EXAMPLE
Dot are spread out along a
diagonal line and maximum
variation of data is between the
two end points of line
Dots are also spread out a little above
and below the first line and 2nd largest
amount of variation is at the endpoints
of the new line
PCA IMPLEMENTATION
16/28
EXAMPLE
PCA IMPLEMENTATION
17/28
EXAMPLE
These two new axes that describe the variation in the data
are “Principal Components”
PCA IMPLEMENTATION
18/28
EXAMPLE
PCA IMPLEMENTATION
19/28
EXAMPLE
PCA IMPLEMENTATION
20/28
EXAMPLE
PCA IMPLEMENTATION
21/28
EXAMPLE
PCA IMPLEMENTATION
22/28
EXAMPLE
PCA IMPLEMENTATION
23/28
EXAMPLE
PCA IMPLEMENTATION
24/28
EXAMPLE
PCA IMPLEMENTATION
25/28
EXAMPLE
USES OF PCA
26/28
PCA is mostly used as a tool for Compression and Simplifying
data for easier learning in exploratory data analysis and for
making predictive models.
1- Better Perspective and less Complexity
2 - Better visualization
3- Reduce size
4- Different perspective:
LIMITATION OF PCA
27/28
If the data does not
follow a
multidimensional
normal (Gaussian)
distribution, PCA
may not give the
best principal
components
REFERENCES
28/28
Figure: pixel representation
Information and Image Credit :
• http://www.mit.edu/~gari/teaching/6.555/LECTURE_NOTES/ch28_bss.pdf
• https://www.quora.com/What-are-some-of-the-limitations-of-principal-component-analysis
• http://mengnote.blogspot.com.ee/2013/05/an-intuitive-explanation-of-pca.html
• https://www.youtube.com/watch?v=_UVHneBUBW0&t=2s
• http://setosa.io/ev/principal-component-analysis/
29
THANKS

More Related Content

PPTX
Principal component analysis
PPTX
Pca(principal components analysis)
PPTX
PDF
PCA (Principal component analysis)
PDF
Principal Component Analysis
PPTX
Lect4 principal component analysis-I
PPTX
Principal Component Analysis (PCA) and LDA PPT Slides
ODP
Introduction to Principle Component Analysis
Principal component analysis
Pca(principal components analysis)
PCA (Principal component analysis)
Principal Component Analysis
Lect4 principal component analysis-I
Principal Component Analysis (PCA) and LDA PPT Slides
Introduction to Principle Component Analysis

What's hot (20)

PPTX
PPTX
Lect5 principal component analysis
PPTX
Principal Component Analysis PCA
PPTX
Introduction to principal component analysis (pca)
PPSX
PPTX
Introduction to XGboost
PPT
PPT
Pricipal Component Analysis Using R
PPTX
Unsupervised learning clustering
PPTX
Introduction to Linear Discriminant Analysis
PPTX
Introduction to Machine Learning
PDF
Principal component analysis and lda
PDF
Handling Imbalanced Data: SMOTE vs. Random Undersampling
PDF
Decision tree
PDF
Introduction to XGBoost
PDF
Exploratory data analysis data visualization
PDF
Data Science - Part III - EDA & Model Selection
PPTX
Exploratory Data Analysis
PPTX
Exploratory data analysis
Lect5 principal component analysis
Principal Component Analysis PCA
Introduction to principal component analysis (pca)
Introduction to XGboost
Pricipal Component Analysis Using R
Unsupervised learning clustering
Introduction to Linear Discriminant Analysis
Introduction to Machine Learning
Principal component analysis and lda
Handling Imbalanced Data: SMOTE vs. Random Undersampling
Decision tree
Introduction to XGBoost
Exploratory data analysis data visualization
Data Science - Part III - EDA & Model Selection
Exploratory Data Analysis
Exploratory data analysis
Ad

Similar to Principal component analysis (20)

PPTX
Principal component analysis in machine L
PDF
overviewPCA
PDF
Mathematical Introduction to Principal Components Analysis
PDF
pca.pdf polymer nanoparticles and sensors
PPTX
Principal Component Analysis in Machine learning.pptx
PPTX
Dimensionality Reduction and feature extraction.pptx
PPTX
Feature selection using PCA.pptx
PPTX
pcappt-140121072949-phpapp01.pptx
PPTX
Principal component analysis.pptx
PPTX
Principal component analysis.pptx
PPTX
PCA and SVD in brief
PDF
Principal Component Analysis in Machine Learning.pdf
PDF
Pca slides- sanket shetye
PDF
Pca ankita dubey
PPTX
PCA-LDA-Lobo.pptxttvertyuytreiopkjhgftfv
PDF
5 DimensionalityReduction.pdf
PPTX
Principal Component Analysis (PCA).pptx
PPTX
Principal Component Analysis(PCA) lecture
PPTX
Easy_PCA_Presentation multivariate .pptx
Principal component analysis in machine L
overviewPCA
Mathematical Introduction to Principal Components Analysis
pca.pdf polymer nanoparticles and sensors
Principal Component Analysis in Machine learning.pptx
Dimensionality Reduction and feature extraction.pptx
Feature selection using PCA.pptx
pcappt-140121072949-phpapp01.pptx
Principal component analysis.pptx
Principal component analysis.pptx
PCA and SVD in brief
Principal Component Analysis in Machine Learning.pdf
Pca slides- sanket shetye
Pca ankita dubey
PCA-LDA-Lobo.pptxttvertyuytreiopkjhgftfv
5 DimensionalityReduction.pdf
Principal Component Analysis (PCA).pptx
Principal Component Analysis(PCA) lecture
Easy_PCA_Presentation multivariate .pptx
Ad

Recently uploaded (20)

PPTX
Welding lecture in detail for understanding
PPTX
Geodesy 1.pptx...............................................
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
Lesson 3_Tessellation.pptx finite Mathematics
PPTX
Strings in CPP - Strings in C++ are sequences of characters used to store and...
PPTX
web development for engineering and engineering
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PDF
Well-logging-methods_new................
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PDF
Structs to JSON How Go Powers REST APIs.pdf
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PDF
PPT on Performance Review to get promotions
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
CH1 Production IntroductoryConcepts.pptx
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
Internet of Things (IOT) - A guide to understanding
PPT
Project quality management in manufacturing
Welding lecture in detail for understanding
Geodesy 1.pptx...............................................
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Lesson 3_Tessellation.pptx finite Mathematics
Strings in CPP - Strings in C++ are sequences of characters used to store and...
web development for engineering and engineering
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Well-logging-methods_new................
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Structs to JSON How Go Powers REST APIs.pdf
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPT on Performance Review to get promotions
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
CH1 Production IntroductoryConcepts.pptx
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Internet of Things (IOT) - A guide to understanding
Project quality management in manufacturing

Principal component analysis