SlideShare a Scribd company logo
2
Most read
5
Most read
6
Most read
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors by Yan Ke and Rahul Sukthankar Presentation by Guy Tannenbaum
Introduction Local descriptors – computed efficiently, resistant to partial occlusion and changes in viewpoint. 2 independent aspects: Finding keypoints (in position and scale) Building a descriptor PCA-SIFT is a modification of SIFT, which changes how the keypoint descriptors are constructed.
Quick review of SIFT 2 parts of the algorithm: Finding keypoints Scale-space peak selection Keypoint localization Keypoint descriptor image gradients in local neighborhood of keypoint 4x4 array of histograms, each with 8 orientation bins (128 element vector)
PCA-SIFT: Basic idea Use PCA to efficiently represent the gradient patch around the keypoint.
PCA-SIFT: computing a projection matrix Select a representative set of pictures and detect all keypoints in these pictures For each keypoint: Extract an image patch around it with size 41 x 41 pixels Calculate horizontal and vertical gradients, resulting in a vector of size 39 x 39 x 2 = 3042 Put all these vectors into a k x 3042 matrix A where k is the number of keypoints detected Calculate the covariance matrix of A
PCA-SIFT: computing a projection matrix Compute the eigenvectors and eigenvalues of covA Select the first n eigenvectors; the projection matrix is a n x 3042 matrix composed of these eigenvectors n can either be a fixed value determined empirically or set dynamically based on the eigenvalues The projection matrix is only computed once and saved
Dimension reduction through PCA The image patches do not span the entire space of pixel values, and also not the smaller space of patches from natural images. They consist of the highly restricted set of patches that passed the first 3 stages of SIFT.
Constructing PCA-SIFT descriptor  Input: location of keypoint, scale, orientation. Extract a 41 x 41 patch around the keypoint at the given scale, rotated to its orientation Calculate 39 x 39 horizontal and vertical gradients, resulting in a vector of size 3042 Multiply this vector using the precomputed  n x 3042 projection matrix This results in a PCA-SIFT descriptor of size n
Results - Methodology Experimental Setup: Datasets contain images of some object, under different (synthetic or real) viewing conditions. Keypoints of all images in data set are found.  All pairs of keypoint descriptors from different images are examined, those with Euclidean distance smaller than a threshold are considered a match.
Results - Methodology Evaluation Metric: Recall vs. 1-percision graph Recall = #correct-positives / #total-positives 1-precision = #false-positives / #total-matches ROC graphs plot positive detection rate vs. false detection rate Positive detection rate = #correct-positives / #total-positives False detection rate = #false-positives / #total-negatives Recall vs. 1-percision graphs are better suited than ROC graphs to evaluate performance on detection tasks because the number of negatives in the data set is not well defined.
Results – Controlled transformation
Results 2 – Grafitti dataset Low recall rate at high precision is acceptable for real-world applications. Recall of 5% at 1-percision of 20% - about 1000 keypoints in image, of which 50 are reliable matches. Sufficient for applications like image retrival.
Results3 – running time
Eigenspace construction PCA-SIFT’s performance is not sensitive to the images used in the creation of the eigenspace.
Effect of PCA dimension Optimal performance at n=36 Hypothesis – First several components of the PCA subspace are sufficient for encoding variations caused by keypoint identity, while the later components represent details that are not useful, of potentially detrimental, such as distortion from projective wrap.
Summary PCA-SIFT is an alternate representation for local image descriptors of the SIFT algorithm. More distinctive, and more compact leading to improvements in accuracy and running time.
Credits Based on the article PCA-SIFT: A More Distinctive Representation for Local Image Descriptors by Yan Ke and Rahul Sukthankar. www.danet.dk/sensor_fusion/SIFT  features.ppt http://campar.in.tum.de/twiki/pub/Chair/TeachingOberSeminar/Slides_AndreasHaug.pdf

More Related Content

PDF
Deep learning for person re-identification
PDF
Lecture 4 Relationship between pixels
PPTX
Model Selection Techniques
PPTX
Edge Detection using Hough Transform
PDF
Unit 3 Arithmetic Coding
PPT
Chapter10 image segmentation
PPTX
Data Mining: Mining ,associations, and correlations
PDF
digital image processing, image processing
Deep learning for person re-identification
Lecture 4 Relationship between pixels
Model Selection Techniques
Edge Detection using Hough Transform
Unit 3 Arithmetic Coding
Chapter10 image segmentation
Data Mining: Mining ,associations, and correlations
digital image processing, image processing

What's hot (20)

PPTX
Color image processing
PPT
image enhancement
PPT
digital image processing
PDF
A Brief History of Object Detection / Tommi Kerola
PPTX
Face recognition using neural network
PPTX
Fuzzy Clustering(C-means, K-means)
PPTX
Gray level transformation
PPTX
Image segmentation in Digital Image Processing
PPTX
Chain code in dip
PDF
Machine Learning - Object Detection and Classification
PPT
Enhancement in spatial domain
PDF
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...
PPTX
digital image processing
PPTX
Chapter 9 morphological image processing
PPT
Huffman Coding
PDF
Object Detection Using R-CNN Deep Learning Framework
PPTX
Fourier descriptors & moments
PPT
morphological image processing
PPT
Image Texture Analysis
PDF
Representational Continuity for Unsupervised Continual Learning
Color image processing
image enhancement
digital image processing
A Brief History of Object Detection / Tommi Kerola
Face recognition using neural network
Fuzzy Clustering(C-means, K-means)
Gray level transformation
Image segmentation in Digital Image Processing
Chain code in dip
Machine Learning - Object Detection and Classification
Enhancement in spatial domain
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...
digital image processing
Chapter 9 morphological image processing
Huffman Coding
Object Detection Using R-CNN Deep Learning Framework
Fourier descriptors & moments
morphological image processing
Image Texture Analysis
Representational Continuity for Unsupervised Continual Learning
Ad

Viewers also liked (10)

PPTX
Scale Invariant Feature Transform
PPT
Sift Happens John Pearce June 27 Timed
PPTX
Smartphone apps
PPTX
Face Recognition
PDF
"Introduction to Feature Descriptors in Vision: From Haar to SIFT," A Present...
PPT
Michal Erel's SIFT presentation
PDF
Deep Learning for Computer Vision: Image Retrieval (UPC 2016)
PPSX
Face recognition technology - BEST PPT
PPT
Face recognition ppt
PPT
PPT - Powerful Presentation Techniques
Scale Invariant Feature Transform
Sift Happens John Pearce June 27 Timed
Smartphone apps
Face Recognition
"Introduction to Feature Descriptors in Vision: From Haar to SIFT," A Present...
Michal Erel's SIFT presentation
Deep Learning for Computer Vision: Image Retrieval (UPC 2016)
Face recognition technology - BEST PPT
Face recognition ppt
PPT - Powerful Presentation Techniques
Ad

Similar to PCA-SIFT: A More Distinctive Representation for Local Image Descriptors (20)

PDF
Video Stitching using Improved RANSAC and SIFT
PPTX
Dimensionality Reduction and feature extraction.pptx
PDF
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
PPT
Understandig PCA and LDA
PDF
Ay33292297
PDF
Ay33292297
PDF
Recognition of Handwritten Mathematical Equations
PPTX
All projects
PPTX
Implement principal component analysis (PCA) in python from scratch
PDF
Salient KeypointSelection for Object Representation
PDF
Kernel Descriptors for Visual Recognition
PDF
Lec07 aggregation-and-retrieval-system
PDF
Comparison of Various RCNN techniques for Classification of Object from Image
PDF
Literature Survey on Interest Points based Watermarking
PDF
EE660_Report_YaxinLiu_8448347171
PDF
4-RSSI-Spectral Domain Image Transforms_1.pdf
PDF
Introducing New Parameters to Compare the Accuracy and Reliability of Mean-Sh...
PPT
Feature Matching using SIFT algorithm
PDF
Large Scale Image Retrieval 2022.pdf
PDF
An automatic algorithm for object recognition and detection based on asift ke...
Video Stitching using Improved RANSAC and SIFT
Dimensionality Reduction and feature extraction.pptx
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
Understandig PCA and LDA
Ay33292297
Ay33292297
Recognition of Handwritten Mathematical Equations
All projects
Implement principal component analysis (PCA) in python from scratch
Salient KeypointSelection for Object Representation
Kernel Descriptors for Visual Recognition
Lec07 aggregation-and-retrieval-system
Comparison of Various RCNN techniques for Classification of Object from Image
Literature Survey on Interest Points based Watermarking
EE660_Report_YaxinLiu_8448347171
4-RSSI-Spectral Domain Image Transforms_1.pdf
Introducing New Parameters to Compare the Accuracy and Reliability of Mean-Sh...
Feature Matching using SIFT algorithm
Large Scale Image Retrieval 2022.pdf
An automatic algorithm for object recognition and detection based on asift ke...

More from wolf (12)

PPT
Eigenfaces and Fisherfaces
PPT
Shai Avidan's Support vector tracking and ensemble tracking
PPT
Constellation Models and Unsupervised Learning for Object Class Recognition
PPT
A bayesian framework for unsupervised one-shot learning of object categories
PPT
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Fe...
PPT
Recovering 3D human body configurations using shape contexts
PPT
Rafi Zachut's slides on class specific segmentation
PPT
Avihu Efrat's Viola and Jones face detection slides
PPT
Ala Stolpnik's Standard Model talk
PPT
Gil Shapira's Active Appearance Model slides
PPT
Moshe Guttmann's slides on eigenface
PPT
Object recognition seminar S2006E01
Eigenfaces and Fisherfaces
Shai Avidan's Support vector tracking and ensemble tracking
Constellation Models and Unsupervised Learning for Object Class Recognition
A bayesian framework for unsupervised one-shot learning of object categories
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Fe...
Recovering 3D human body configurations using shape contexts
Rafi Zachut's slides on class specific segmentation
Avihu Efrat's Viola and Jones face detection slides
Ala Stolpnik's Standard Model talk
Gil Shapira's Active Appearance Model slides
Moshe Guttmann's slides on eigenface
Object recognition seminar S2006E01

Recently uploaded (20)

PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
NewMind AI Monthly Chronicles - July 2025
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPT
Teaching material agriculture food technology
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
KodekX | Application Modernization Development
PDF
Encapsulation theory and applications.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Big Data Technologies - Introduction.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
NewMind AI Monthly Chronicles - July 2025
“AI and Expert System Decision Support & Business Intelligence Systems”
Spectral efficient network and resource selection model in 5G networks
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Teaching material agriculture food technology
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
20250228 LYD VKU AI Blended-Learning.pptx
Understanding_Digital_Forensics_Presentation.pptx
KodekX | Application Modernization Development
Encapsulation theory and applications.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Empathic Computing: Creating Shared Understanding
Big Data Technologies - Introduction.pptx
Digital-Transformation-Roadmap-for-Companies.pptx

PCA-SIFT: A More Distinctive Representation for Local Image Descriptors

  • 1. PCA-SIFT: A More Distinctive Representation for Local Image Descriptors by Yan Ke and Rahul Sukthankar Presentation by Guy Tannenbaum
  • 2. Introduction Local descriptors – computed efficiently, resistant to partial occlusion and changes in viewpoint. 2 independent aspects: Finding keypoints (in position and scale) Building a descriptor PCA-SIFT is a modification of SIFT, which changes how the keypoint descriptors are constructed.
  • 3. Quick review of SIFT 2 parts of the algorithm: Finding keypoints Scale-space peak selection Keypoint localization Keypoint descriptor image gradients in local neighborhood of keypoint 4x4 array of histograms, each with 8 orientation bins (128 element vector)
  • 4. PCA-SIFT: Basic idea Use PCA to efficiently represent the gradient patch around the keypoint.
  • 5. PCA-SIFT: computing a projection matrix Select a representative set of pictures and detect all keypoints in these pictures For each keypoint: Extract an image patch around it with size 41 x 41 pixels Calculate horizontal and vertical gradients, resulting in a vector of size 39 x 39 x 2 = 3042 Put all these vectors into a k x 3042 matrix A where k is the number of keypoints detected Calculate the covariance matrix of A
  • 6. PCA-SIFT: computing a projection matrix Compute the eigenvectors and eigenvalues of covA Select the first n eigenvectors; the projection matrix is a n x 3042 matrix composed of these eigenvectors n can either be a fixed value determined empirically or set dynamically based on the eigenvalues The projection matrix is only computed once and saved
  • 7. Dimension reduction through PCA The image patches do not span the entire space of pixel values, and also not the smaller space of patches from natural images. They consist of the highly restricted set of patches that passed the first 3 stages of SIFT.
  • 8. Constructing PCA-SIFT descriptor Input: location of keypoint, scale, orientation. Extract a 41 x 41 patch around the keypoint at the given scale, rotated to its orientation Calculate 39 x 39 horizontal and vertical gradients, resulting in a vector of size 3042 Multiply this vector using the precomputed n x 3042 projection matrix This results in a PCA-SIFT descriptor of size n
  • 9. Results - Methodology Experimental Setup: Datasets contain images of some object, under different (synthetic or real) viewing conditions. Keypoints of all images in data set are found. All pairs of keypoint descriptors from different images are examined, those with Euclidean distance smaller than a threshold are considered a match.
  • 10. Results - Methodology Evaluation Metric: Recall vs. 1-percision graph Recall = #correct-positives / #total-positives 1-precision = #false-positives / #total-matches ROC graphs plot positive detection rate vs. false detection rate Positive detection rate = #correct-positives / #total-positives False detection rate = #false-positives / #total-negatives Recall vs. 1-percision graphs are better suited than ROC graphs to evaluate performance on detection tasks because the number of negatives in the data set is not well defined.
  • 11. Results – Controlled transformation
  • 12. Results 2 – Grafitti dataset Low recall rate at high precision is acceptable for real-world applications. Recall of 5% at 1-percision of 20% - about 1000 keypoints in image, of which 50 are reliable matches. Sufficient for applications like image retrival.
  • 14. Eigenspace construction PCA-SIFT’s performance is not sensitive to the images used in the creation of the eigenspace.
  • 15. Effect of PCA dimension Optimal performance at n=36 Hypothesis – First several components of the PCA subspace are sufficient for encoding variations caused by keypoint identity, while the later components represent details that are not useful, of potentially detrimental, such as distortion from projective wrap.
  • 16. Summary PCA-SIFT is an alternate representation for local image descriptors of the SIFT algorithm. More distinctive, and more compact leading to improvements in accuracy and running time.
  • 17. Credits Based on the article PCA-SIFT: A More Distinctive Representation for Local Image Descriptors by Yan Ke and Rahul Sukthankar. www.danet.dk/sensor_fusion/SIFT features.ppt http://campar.in.tum.de/twiki/pub/Chair/TeachingOberSeminar/Slides_AndreasHaug.pdf