SlideShare a Scribd company logo
2
Most read
9
Most read
22
Most read
Principal Component Analysis
(PCA)
PCA - Overview
• A backbone of modern data analysis.
• A black box that is widely used but poorly understood.
• It is a mathematical tool from applied linear algebra.
• It is a simple, non-parametric method of extracting relevant
information from confusing data sets.
• It provides a roadmap for how to reduce a complex data set to a
lower dimension.
• PCA used to reduce dimensions of data without much loss of
information.
• PCA is “an orthogonal linear transformation that transfers the data to a new
coordinate system such that the greatest variance by any projection of the data
comes to lie on the first coordinate (first principal component), the second
greatest variance lies on the second coordinate (second principal component),
and so on.”
• An exploratory technique used to reduce the dimensionality of the data set to
2D or 3D
• Can be used to:
• Reduce number of dimensions in data
• Find patterns in high-dimensional data
• Visualize data of high dimensionality
• Example applications:
• Face recognition, Image compression, Gene expression analysis
PCA - Overview
Background
• Linear Algebra
• Principal Component Analysis (PCA)
• Independent Component Analysis (ICA)
• Linear Discriminant Analysis (LDA)
• Examples
Variance
• Variance – measure of the deviation from the mean for points in one
dimension, e.g., heights
• A measure of the spread of the data in a data set with mean
• Variance is claimed to be the original statistical measure of spread of data.
Covariance
• Covariance – a measure of how much each of the dimensions varies from
the mean with respect to each other.
• Covariance is measured between 2 dimensions to see if there is a
relationship between the 2 dimensions, e.g., number of hours studied
and grade obtained.
• The covariance between one dimension and itself is the variance
Covariance
•Covariance
Covariance calculations are used to find relationships between
dimensions in high dimensonal data sets (usually greater that 3) where
visualization is difficult.
• Suppose we have n attributes, A1, ..., An.
• Covariance matrix:
• Example for three attributes (x,y,z):
Covariance matrix
),cov(where),( ,, jijiji
nn
AAccC 











),cov(),cov(),cov(
),cov(),cov(),cov(
),cov(),cov(),cov(
zzyzxz
zyyyxy
zxyxxx
C




















3705.104
5.1047.47
)var(5.104
5.104)var(
),cov(),cov(
),cov(),cov(
M
H
MMHM
MHHH
Covariance matrix
Linear algebra
Matrix and vector multiplication
Transformation Matrices
Transformation Matrices
Eigenvalue Problem
Eigenvalue Problem
A v Av  v
2 3 3 12 3
x 4x
2 1 2 8 2
       
        
       
• Going back to our example: as A . v = λ . v
• Therefore, (3,2) is an eigenvector of the square matrix A and 4 is an
eigenvalue of A.
• The question is:
Given matrix A, how can we calculate the eigenvector and eigenvalues
for A?
How to calculate Eigenvalues & eigenvectors
Calculating Eigenvectors & Eigenvalues
Calculating Eigenvectors & Eigenvalues
Going through the same procedure for the second eigenvalue:
Calculating Eigenvectors & Eigenvalues
Calculating Eigenvectors & Eigenvalues
Properties of Eigenvectors and Eigenvalues

More Related Content

PPTX
PPTX
Lect5 principal component analysis
ODP
Introduction to Principle Component Analysis
PPTX
Pca(principal components analysis)
PPTX
Principal Component Analysis (PCA) and LDA PPT Slides
PDF
Principal component analysis and lda
PPTX
Implement principal component analysis (PCA) in python from scratch
PPTX
Principal component analysis
Lect5 principal component analysis
Introduction to Principle Component Analysis
Pca(principal components analysis)
Principal Component Analysis (PCA) and LDA PPT Slides
Principal component analysis and lda
Implement principal component analysis (PCA) in python from scratch
Principal component analysis

What's hot (20)

PPTX
Principal component analysis
PDF
Principal Component Analysis
PPTX
Exploratory Data Analysis
PPSX
PDF
Data Science - Part III - EDA & Model Selection
PPTX
Random forest algorithm
PPTX
Random forest
PDF
Pca ppt
PDF
Decision trees in Machine Learning
PPTX
Introduction to Linear Discriminant Analysis
PPTX
Hierarchical clustering.pptx
PPT
2.4 rule based classification
PDF
Data Science - Part V - Decision Trees & Random Forests
PPT
Cluster analysis
PPTX
Exploratory data analysis with Python
PPT
Decision tree
PPTX
PPTX
Introduction to principal component analysis (pca)
PDF
Classification Based Machine Learning Algorithms
PPTX
Decision trees
Principal component analysis
Principal Component Analysis
Exploratory Data Analysis
Data Science - Part III - EDA & Model Selection
Random forest algorithm
Random forest
Pca ppt
Decision trees in Machine Learning
Introduction to Linear Discriminant Analysis
Hierarchical clustering.pptx
2.4 rule based classification
Data Science - Part V - Decision Trees & Random Forests
Cluster analysis
Exploratory data analysis with Python
Decision tree
Introduction to principal component analysis (pca)
Classification Based Machine Learning Algorithms
Decision trees
Ad

Similar to Lect4 principal component analysis-I (20)

PPTX
Dimension Reduction Introduction & PCA.pptx
PPTX
ML-Lec-18-NEW Dimensionality Reduction-PCA (1).pptx
PPTX
11 Principal Component Analysis Computer Graphics.pptx
PPT
Class9_PCA_final.ppt
PPTX
principle component analysis.pptx
PDF
Principal Components Analysis, Calculation and Visualization
PPTX
Principal Component Analysis PCA
PPTX
PCA Final.pptx
PPTX
Dimensionality Reduction and feature extraction.pptx
PPT
The following ppt is about principal component analysis
PPT
PDF
Covariance.pdf
PPT
Principal Component Analysis (PCA):How to conduct PCA
PPT
Lecture 12 Principal Component Analysis in Machine Learning.ppt
PPT
pca in machine learning pca in machine learning pca in machine learning pca i...
PPT
Lecture1_jps.ppt
PPT
Lecture1_jps (1).ppt
PPTX
Sess03 Dimension Reduction Methods.pptx
PDF
Dimension Reduction Introduction & PCA.pptx
ML-Lec-18-NEW Dimensionality Reduction-PCA (1).pptx
11 Principal Component Analysis Computer Graphics.pptx
Class9_PCA_final.ppt
principle component analysis.pptx
Principal Components Analysis, Calculation and Visualization
Principal Component Analysis PCA
PCA Final.pptx
Dimensionality Reduction and feature extraction.pptx
The following ppt is about principal component analysis
Covariance.pdf
Principal Component Analysis (PCA):How to conduct PCA
Lecture 12 Principal Component Analysis in Machine Learning.ppt
pca in machine learning pca in machine learning pca in machine learning pca i...
Lecture1_jps.ppt
Lecture1_jps (1).ppt
Sess03 Dimension Reduction Methods.pptx
Ad

More from hktripathy (17)

PPTX
Lect 3 background mathematics
PDF
Lect 2 getting to know your data
PDF
Lect 1 introduction
PDF
Lecture7.1 data sampling
PDF
Lecture5 virtualization
PDF
Lecture6 introduction to data streams
PDF
Lecture4 big data technology foundations
PDF
Lecture3 business intelligence
PDF
Lecture2 big data life cycle
PDF
Lecture1 introduction to big data
PPTX
Lect9 Decision tree
PPTX
Lect8 Classification & prediction
PPTX
Lect7 Association analysis to correlation analysis
PPTX
Lect6 Association rule & Apriori algorithm
PPTX
Lect 3 background mathematics for Data Mining
PPTX
Lect 2 getting to know your data
PPTX
Lect 1 introduction
Lect 3 background mathematics
Lect 2 getting to know your data
Lect 1 introduction
Lecture7.1 data sampling
Lecture5 virtualization
Lecture6 introduction to data streams
Lecture4 big data technology foundations
Lecture3 business intelligence
Lecture2 big data life cycle
Lecture1 introduction to big data
Lect9 Decision tree
Lect8 Classification & prediction
Lect7 Association analysis to correlation analysis
Lect6 Association rule & Apriori algorithm
Lect 3 background mathematics for Data Mining
Lect 2 getting to know your data
Lect 1 introduction

Recently uploaded (20)

PPTX
Cell Types and Its function , kingdom of life
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
01-Introduction-to-Information-Management.pdf
PDF
Complications of Minimal Access Surgery at WLH
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
RMMM.pdf make it easy to upload and study
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
Pre independence Education in Inndia.pdf
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
Cell Types and Its function , kingdom of life
Supply Chain Operations Speaking Notes -ICLT Program
VCE English Exam - Section C Student Revision Booklet
O7-L3 Supply Chain Operations - ICLT Program
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
STATICS OF THE RIGID BODIES Hibbelers.pdf
01-Introduction-to-Information-Management.pdf
Complications of Minimal Access Surgery at WLH
PPH.pptx obstetrics and gynecology in nursing
Microbial diseases, their pathogenesis and prophylaxis
RMMM.pdf make it easy to upload and study
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Module 4: Burden of Disease Tutorial Slides S2 2025
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Pre independence Education in Inndia.pdf
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
Renaissance Architecture: A Journey from Faith to Humanism
Abdominal Access Techniques with Prof. Dr. R K Mishra

Lect4 principal component analysis-I

  • 2. PCA - Overview • A backbone of modern data analysis. • A black box that is widely used but poorly understood. • It is a mathematical tool from applied linear algebra. • It is a simple, non-parametric method of extracting relevant information from confusing data sets. • It provides a roadmap for how to reduce a complex data set to a lower dimension. • PCA used to reduce dimensions of data without much loss of information.
  • 3. • PCA is “an orthogonal linear transformation that transfers the data to a new coordinate system such that the greatest variance by any projection of the data comes to lie on the first coordinate (first principal component), the second greatest variance lies on the second coordinate (second principal component), and so on.” • An exploratory technique used to reduce the dimensionality of the data set to 2D or 3D • Can be used to: • Reduce number of dimensions in data • Find patterns in high-dimensional data • Visualize data of high dimensionality • Example applications: • Face recognition, Image compression, Gene expression analysis PCA - Overview
  • 4. Background • Linear Algebra • Principal Component Analysis (PCA) • Independent Component Analysis (ICA) • Linear Discriminant Analysis (LDA) • Examples
  • 5. Variance • Variance – measure of the deviation from the mean for points in one dimension, e.g., heights • A measure of the spread of the data in a data set with mean • Variance is claimed to be the original statistical measure of spread of data.
  • 6. Covariance • Covariance – a measure of how much each of the dimensions varies from the mean with respect to each other. • Covariance is measured between 2 dimensions to see if there is a relationship between the 2 dimensions, e.g., number of hours studied and grade obtained. • The covariance between one dimension and itself is the variance
  • 8. •Covariance Covariance calculations are used to find relationships between dimensions in high dimensonal data sets (usually greater that 3) where visualization is difficult.
  • 9. • Suppose we have n attributes, A1, ..., An. • Covariance matrix: • Example for three attributes (x,y,z): Covariance matrix ),cov(where),( ,, jijiji nn AAccC             ),cov(),cov(),cov( ),cov(),cov(),cov( ),cov(),cov(),cov( zzyzxz zyyyxy zxyxxx C
  • 12. Matrix and vector multiplication
  • 16. Eigenvalue Problem A v Av  v 2 3 3 12 3 x 4x 2 1 2 8 2                          • Going back to our example: as A . v = λ . v • Therefore, (3,2) is an eigenvector of the square matrix A and 4 is an eigenvalue of A. • The question is: Given matrix A, how can we calculate the eigenvector and eigenvalues for A?
  • 17. How to calculate Eigenvalues & eigenvectors
  • 20. Going through the same procedure for the second eigenvalue: Calculating Eigenvectors & Eigenvalues
  • 22. Properties of Eigenvectors and Eigenvalues