SlideShare a Scribd company logo
Real-time 3D Object Pose Estimation and Tracking  for Natural Landmark Based Visual Servo Seung-Min Baek and Sukhan Lee Sungkyunkwan University Intelligent System Research Center Changhyun  Choi Georgia Tech College of Computing
Contents Introduction Motivation Related Works Proposed Approach System Overview Problem Definition Initial Pose Estimation Local Pose Estimation Experimental Results Summary & Conclusion Future Work IEEE/RSJ IROS 2008, Sept 25
Introduction In Visual Servo Control, Object Recognition  Pose Estimation  are key tasks. IEEE/RSJ IROS 2008, Sept 25
Introduction Many systems still use Artificial Landmark . Unnatural  in human environment IEEE/RSJ IROS 2008, Sept 25
Introduction We need  Natural Landmarks . Natural Landmarks are visual features objects inherently have. IEEE/RSJ IROS 2008, Sept 25
Introduction Modern recognition methods SIFT about 200~300 ms on a modern PC Structured light several seconds IEEE/RSJ IROS 2008, Sept 25
Motivation How to apply these state-of-the-art recognition methods to visual servo control? How to overcome the time lag? How to solve the real-time issue? IEEE/RSJ IROS 2008, Sept 25
Related Works Monocular  Model-based Use keyframe information as prior knowledge Use sparse bundle adjustment technique [ L. Vacchetti et al.,  PAMI 04 ] Input image should be close enough to  the prior knowledge! IEEE/RSJ IROS 2008, Sept 25
Related Works Active Contour Local curve fitting algorithm Initialize by SIFT keypoint matching [G. Panin and A. Knoll,  JMM 04 ] Potential danger in background having  same color with tracking object! IEEE/RSJ IROS 2008, Sept 25
Our Idea Use  prior knowledge  (object models) 2D images 3D points  obtained from structured light system Use  scale invariant feature matching  for  accurate  initialization Use  KLT (Kanade-Lucas-Tomasi) tracker  for  fast  local tracking IEEE/RSJ IROS 2008, Sept 25
System Overview Add text IEEE/RSJ IROS 2008, Sept 25
Two Modes Mono Mode Using  mono camera Better computational performance Stereo Mode Using  stereo camera More accurate pose result IEEE/RSJ IROS 2008, Sept 25
Problem Definition –  Mono Mode Given 2D-3D correspondences and a calibrated mono camera, find the pose of the object with respect to the camera. IEEE/RSJ IROS 2008, Sept 25
Problem Definition –  Stereo Mode Given 3D-3D correspondences and a calibrated stereo camera, find the pose of the object with respect to the camera. IEEE/RSJ IROS 2008, Sept 25
Initial Pose Estimation Add text IEEE/RSJ IROS 2008, Sept 25
Initial Pose Estimation Extract SIFT keypoints Matching with model knowledge Estimate initial pose Get a convex hull of a set of matched SIFT keypoints Generate KLT tracking points within the convexhull Calculate 3D coordinates of KLT points IEEE/RSJ IROS 2008, Sept 25
Initial Pose Estimation Mono Mode Use the  POSIT algorithm  ( 2D-3D ) Stereo Mode Use the  closed-form solution using unit quaternions  ( 3D-3D ) R,t R,t IEEE/RSJ IROS 2008, Sept 25
Initial Pose Estimation Extract SIFT keypoints Matching with model knowledge Estimate initial pose Get a convex hull of a set of matched SIFT keypoints Generate KLT tracking points within the convexhull Calculate 3D coordinates of KLT points IEEE/RSJ IROS 2008, Sept 25
Initial Pose Estimation 3D coordinates of each KLT points are required for  subsequent local pose estimation Stereo Mode Straightforward in a calibrated stereo rig Triangulate 3D points Mono Mode Use  approximation  with the knowledge of model Get 3D coordinates by using  three nearest neighboring SIFT points IEEE/RSJ IROS 2008, Sept 25
Initial Pose Estimation +  : SIFT points •   : KLT points IEEE/RSJ IROS 2008, Sept 25
Initial Pose Estimation Treat the surface as  locally flat IEEE/RSJ IROS 2008, Sept 25
Local Pose Estimation Add text IEEE/RSJ IROS 2008, Sept 25
Local Pose Estimation Estimate pose with  KLT tracking points  and  their 3D points Pose estimation algorithms are same Mono Mode Use the  POSIT algorithm  ( 2D-3D ) Stereo Mode Use the  closed-form solution using unit quaternions  ( 3D-3D ) R,t R,t IEEE/RSJ IROS 2008, Sept 25
Removing Outliers IEEE/RSJ IROS 2008, Sept 25
Outlier Handling KLT tracking points are easy to  drift Drifting points result in inaccurate pose Use  RANSAC  to remove outlier Re-initialize  when there are no sufficient # of inliers IEEE/RSJ IROS 2008, Sept 25
Tracking Results IEEE/RSJ IROS 2008, Sept 25
Experiment Mono Mode Stereo Mode IEEE/RSJ IROS 2008, Sept 25
Tracking Results -  translation IEEE/RSJ IROS 2008, Sept 25
Tracking Results -  rotation IEEE/RSJ IROS 2008, Sept 25
RMS Error RMS errors over the whole sequence of image Z IEEE/RSJ IROS 2008, Sept 25
Computational Time Computational times of  pose estimation IEEE/RSJ IROS 2008, Sept 25
Computational Time Computational times of  each module IEEE/RSJ IROS 2008, Sept 25
Summary & Conclusion A method for tracking 3D roto-translation of rigid objects  using  scale invariant feature based matching  KLT (Kanade-Lucas-Tomasi) tracker Mono mode guarantees higher frame rate performance stereo mode shows better pose results IEEE/RSJ IROS 2008, Sept 25
Future Work To decrease the computational burden Use  GPU-based implementation  of KLT tracker and SIFT GPU KLT SiftGPU Unifying the  contour based tracking IEEE/RSJ IROS 2008, Sept 25
Thank you Any Questions? Any Suggestions? Any Comments? IEEE/RSJ IROS 2008, Sept 25

More Related Content

PDF
OpenSees: Future Directions
PDF
OpenSees as an Engine for Web-based Applications
PDF
Modelling structures in fire using OpenSees - An integrated approach
PDF
Efficient analytical and hybrid simulations using OpenSees
PPTX
All projects
PPTX
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 3)
PPTX
Ikuro Sato's slide presented at ICONIP2017
PDF
Thesis
OpenSees: Future Directions
OpenSees as an Engine for Web-based Applications
Modelling structures in fire using OpenSees - An integrated approach
Efficient analytical and hybrid simulations using OpenSees
All projects
SIGGRAPH 2014 Course on Computational Cameras and Displays (part 3)
Ikuro Sato's slide presented at ICONIP2017
Thesis

What's hot (20)

PPTX
A Highly Parallel Semi-Dataflow FPGA Architecture for Large-Scale N-Body Simu...
PDF
S1140183 Presentation
PPTX
Implementation of PD controller in attitude of quadcopter
PDF
Interfacing of MATLAB with Arduino for Object Detection Algorithm Implementat...
PDF
RGB colour detection and tracking on MATLAB
PDF
Scratch to Supercomputers: Bottoms-up Build of Large-scale Computational Lens...
PPT
MSShin-Machine_Learning_Algorithm_in_Period_Estimation.ppt
PDF
Building and road detection from large aerial imagery
PPTX
AMPC Advanced Manufacturing - Theme on a Page - Smart Knives and Hooks
PDF
COUPLED FPGA/ASIC IMPLEMENTATION OF ELLIPTIC CURVE CRYPTO-PROCESSOR
PDF
IRJET- Design the Surveillance Algorithm and Motion Detection of Objects for ...
PDF
Multi-core GPU – Fast parallel SAR image generation
PDF
A genetic algorithm aimed at optimising seismic retrofitting of existing RC f...
PDF
PR-278: RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
PDF
PDF
680report final
PDF
PPT
P1131210137
PDF
2018 IMSM: MIT Lincoln Laboratories Problem Presentation - Light Reflections...
PDF
Towards Exascale Simulations for Regional-Scale Earthquake Hazard and Risk
A Highly Parallel Semi-Dataflow FPGA Architecture for Large-Scale N-Body Simu...
S1140183 Presentation
Implementation of PD controller in attitude of quadcopter
Interfacing of MATLAB with Arduino for Object Detection Algorithm Implementat...
RGB colour detection and tracking on MATLAB
Scratch to Supercomputers: Bottoms-up Build of Large-scale Computational Lens...
MSShin-Machine_Learning_Algorithm_in_Period_Estimation.ppt
Building and road detection from large aerial imagery
AMPC Advanced Manufacturing - Theme on a Page - Smart Knives and Hooks
COUPLED FPGA/ASIC IMPLEMENTATION OF ELLIPTIC CURVE CRYPTO-PROCESSOR
IRJET- Design the Surveillance Algorithm and Motion Detection of Objects for ...
Multi-core GPU – Fast parallel SAR image generation
A genetic algorithm aimed at optimising seismic retrofitting of existing RC f...
PR-278: RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
680report final
P1131210137
2018 IMSM: MIT Lincoln Laboratories Problem Presentation - Light Reflections...
Towards Exascale Simulations for Regional-Scale Earthquake Hazard and Risk
Ad

Similar to IEEE/RSJ IROS 2008 Real-time Tracker (20)

PDF
Real-time 3D Object Pose Estimation and Tracking for Natural Landmark Based V...
PDF
Robot Pose Estimation: A Vertical Stereo Pair Versus a Horizontal One
PDF
"Fundamentals of Monocular SLAM," a Presentation from Cadence
PPTX
20110220 computer vision_eruhimov_lecture01
PDF
sduGroupEvent
PDF
AR/SLAM for end-users
PDF
=iros16tutorial_2.pdf
PDF
Visual Odometry using Stereo Vision
PDF
Dense Image Matching - Challenges and Potentials (Keynote 3D-ARCH 2015)
PPT
Multi-view Stereo and Structure from Motion
PPT
3D Scanning technology of industrial .ppt
PPTX
2015-07-08 Paper 38 - ICVS Talk
PDF
Lecture 01 frank dellaert - 3 d reconstruction and mapping: a factor graph ...
PDF
A ROS IMPLEMENTATION OF THE MONO-SLAM ALGORITHM
PDF
160713
PDF
Deep Learning for Structure-from-Motion (SfM)
PDF
Journey to structure from motion
PDF
Lec14 multiview stereo
PDF
An Assessment of Image Matching Algorithms in Depth Estimation
PDF
Towards Robust and Safe Autonomous Drones
Real-time 3D Object Pose Estimation and Tracking for Natural Landmark Based V...
Robot Pose Estimation: A Vertical Stereo Pair Versus a Horizontal One
"Fundamentals of Monocular SLAM," a Presentation from Cadence
20110220 computer vision_eruhimov_lecture01
sduGroupEvent
AR/SLAM for end-users
=iros16tutorial_2.pdf
Visual Odometry using Stereo Vision
Dense Image Matching - Challenges and Potentials (Keynote 3D-ARCH 2015)
Multi-view Stereo and Structure from Motion
3D Scanning technology of industrial .ppt
2015-07-08 Paper 38 - ICVS Talk
Lecture 01 frank dellaert - 3 d reconstruction and mapping: a factor graph ...
A ROS IMPLEMENTATION OF THE MONO-SLAM ALGORITHM
160713
Deep Learning for Structure-from-Motion (SfM)
Journey to structure from motion
Lec14 multiview stereo
An Assessment of Image Matching Algorithms in Depth Estimation
Towards Robust and Safe Autonomous Drones
Ad

Recently uploaded (20)

PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Zenith AI: Advanced Artificial Intelligence
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PPTX
OMC Textile Division Presentation 2021.pptx
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PPTX
TLE Review Electricity (Electricity).pptx
PDF
Encapsulation theory and applications.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Mushroom cultivation and it's methods.pdf
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
MIND Revenue Release Quarter 2 2025 Press Release
Digital-Transformation-Roadmap-for-Companies.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Chapter 5: Probability Theory and Statistics
Zenith AI: Advanced Artificial Intelligence
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
OMC Textile Division Presentation 2021.pptx
Group 1 Presentation -Planning and Decision Making .pptx
TLE Review Electricity (Electricity).pptx
Encapsulation theory and applications.pdf
A Presentation on Artificial Intelligence
Unlocking AI with Model Context Protocol (MCP)
1 - Historical Antecedents, Social Consideration.pdf
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Mushroom cultivation and it's methods.pdf
NewMind AI Weekly Chronicles - August'25-Week II
Hindi spoken digit analysis for native and non-native speakers
Enhancing emotion recognition model for a student engagement use case through...
MIND Revenue Release Quarter 2 2025 Press Release

IEEE/RSJ IROS 2008 Real-time Tracker

  • 1. Real-time 3D Object Pose Estimation and Tracking for Natural Landmark Based Visual Servo Seung-Min Baek and Sukhan Lee Sungkyunkwan University Intelligent System Research Center Changhyun Choi Georgia Tech College of Computing
  • 2. Contents Introduction Motivation Related Works Proposed Approach System Overview Problem Definition Initial Pose Estimation Local Pose Estimation Experimental Results Summary & Conclusion Future Work IEEE/RSJ IROS 2008, Sept 25
  • 3. Introduction In Visual Servo Control, Object Recognition Pose Estimation are key tasks. IEEE/RSJ IROS 2008, Sept 25
  • 4. Introduction Many systems still use Artificial Landmark . Unnatural in human environment IEEE/RSJ IROS 2008, Sept 25
  • 5. Introduction We need Natural Landmarks . Natural Landmarks are visual features objects inherently have. IEEE/RSJ IROS 2008, Sept 25
  • 6. Introduction Modern recognition methods SIFT about 200~300 ms on a modern PC Structured light several seconds IEEE/RSJ IROS 2008, Sept 25
  • 7. Motivation How to apply these state-of-the-art recognition methods to visual servo control? How to overcome the time lag? How to solve the real-time issue? IEEE/RSJ IROS 2008, Sept 25
  • 8. Related Works Monocular Model-based Use keyframe information as prior knowledge Use sparse bundle adjustment technique [ L. Vacchetti et al., PAMI 04 ] Input image should be close enough to the prior knowledge! IEEE/RSJ IROS 2008, Sept 25
  • 9. Related Works Active Contour Local curve fitting algorithm Initialize by SIFT keypoint matching [G. Panin and A. Knoll, JMM 04 ] Potential danger in background having same color with tracking object! IEEE/RSJ IROS 2008, Sept 25
  • 10. Our Idea Use prior knowledge (object models) 2D images 3D points obtained from structured light system Use scale invariant feature matching for accurate initialization Use KLT (Kanade-Lucas-Tomasi) tracker for fast local tracking IEEE/RSJ IROS 2008, Sept 25
  • 11. System Overview Add text IEEE/RSJ IROS 2008, Sept 25
  • 12. Two Modes Mono Mode Using mono camera Better computational performance Stereo Mode Using stereo camera More accurate pose result IEEE/RSJ IROS 2008, Sept 25
  • 13. Problem Definition – Mono Mode Given 2D-3D correspondences and a calibrated mono camera, find the pose of the object with respect to the camera. IEEE/RSJ IROS 2008, Sept 25
  • 14. Problem Definition – Stereo Mode Given 3D-3D correspondences and a calibrated stereo camera, find the pose of the object with respect to the camera. IEEE/RSJ IROS 2008, Sept 25
  • 15. Initial Pose Estimation Add text IEEE/RSJ IROS 2008, Sept 25
  • 16. Initial Pose Estimation Extract SIFT keypoints Matching with model knowledge Estimate initial pose Get a convex hull of a set of matched SIFT keypoints Generate KLT tracking points within the convexhull Calculate 3D coordinates of KLT points IEEE/RSJ IROS 2008, Sept 25
  • 17. Initial Pose Estimation Mono Mode Use the POSIT algorithm ( 2D-3D ) Stereo Mode Use the closed-form solution using unit quaternions ( 3D-3D ) R,t R,t IEEE/RSJ IROS 2008, Sept 25
  • 18. Initial Pose Estimation Extract SIFT keypoints Matching with model knowledge Estimate initial pose Get a convex hull of a set of matched SIFT keypoints Generate KLT tracking points within the convexhull Calculate 3D coordinates of KLT points IEEE/RSJ IROS 2008, Sept 25
  • 19. Initial Pose Estimation 3D coordinates of each KLT points are required for subsequent local pose estimation Stereo Mode Straightforward in a calibrated stereo rig Triangulate 3D points Mono Mode Use approximation with the knowledge of model Get 3D coordinates by using three nearest neighboring SIFT points IEEE/RSJ IROS 2008, Sept 25
  • 20. Initial Pose Estimation + : SIFT points • : KLT points IEEE/RSJ IROS 2008, Sept 25
  • 21. Initial Pose Estimation Treat the surface as locally flat IEEE/RSJ IROS 2008, Sept 25
  • 22. Local Pose Estimation Add text IEEE/RSJ IROS 2008, Sept 25
  • 23. Local Pose Estimation Estimate pose with KLT tracking points and their 3D points Pose estimation algorithms are same Mono Mode Use the POSIT algorithm ( 2D-3D ) Stereo Mode Use the closed-form solution using unit quaternions ( 3D-3D ) R,t R,t IEEE/RSJ IROS 2008, Sept 25
  • 24. Removing Outliers IEEE/RSJ IROS 2008, Sept 25
  • 25. Outlier Handling KLT tracking points are easy to drift Drifting points result in inaccurate pose Use RANSAC to remove outlier Re-initialize when there are no sufficient # of inliers IEEE/RSJ IROS 2008, Sept 25
  • 26. Tracking Results IEEE/RSJ IROS 2008, Sept 25
  • 27. Experiment Mono Mode Stereo Mode IEEE/RSJ IROS 2008, Sept 25
  • 28. Tracking Results - translation IEEE/RSJ IROS 2008, Sept 25
  • 29. Tracking Results - rotation IEEE/RSJ IROS 2008, Sept 25
  • 30. RMS Error RMS errors over the whole sequence of image Z IEEE/RSJ IROS 2008, Sept 25
  • 31. Computational Time Computational times of pose estimation IEEE/RSJ IROS 2008, Sept 25
  • 32. Computational Time Computational times of each module IEEE/RSJ IROS 2008, Sept 25
  • 33. Summary & Conclusion A method for tracking 3D roto-translation of rigid objects using scale invariant feature based matching KLT (Kanade-Lucas-Tomasi) tracker Mono mode guarantees higher frame rate performance stereo mode shows better pose results IEEE/RSJ IROS 2008, Sept 25
  • 34. Future Work To decrease the computational burden Use GPU-based implementation of KLT tracker and SIFT GPU KLT SiftGPU Unifying the contour based tracking IEEE/RSJ IROS 2008, Sept 25
  • 35. Thank you Any Questions? Any Suggestions? Any Comments? IEEE/RSJ IROS 2008, Sept 25