SlideShare a Scribd company logo
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Perception and Manipulation,
Closing the Gap
Darius Burschka
Machine Vision and Perception Group

Department of Computer Science
Technische Universität München
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Where	
  do	
  we	
  need	
  perception?
“known” clutter “unknown” clutter
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Role of perception in an unknown
environment
labeling motion parameters
• Acquisition of the scene
• Clustering of raw sensor
data
• Identification of
independent motion
components
• Estimation of action
• Understanding actions
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
What	
  is	
  in	
  the	
  scene?	
  
(sensor	
  data	
  abstraction)
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Feature Extraction
• For all pairs of surflets at

distance d insert the triple







plus a pointer to its model in a

hash-table.
• Do this for all models using the

same hash-table.
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
• For each model surflet pair

in the hash-table cell:



Compute the rigid

transform T that

best aligns



Online Recognition Phase
model hash-table
IJRR 2012 Special Issue, Papazov et al.
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
IJRR 2012 Special Issue, Papazov et al.
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
What	
  happens	
  if	
  an	
  object	
  is	
  similar	
  to	
  
one	
  in	
  the	
  database?
Indexing to the Atlas database needs
to be extended to object classes
-> deformable shape registration
needed
Atlas information Observed object
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Deformable Registration from 

generic models (special issue SGP'11 Papazov et al.)
Matching of a detailed shape
to a primitive prior
The manipulation “heat map” from
the generic model gets propagated
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Alignment of non-rigid
shapes for inspection and
localization
Similarity estimates for
categorization of objects
Visual identification of
kinematic chains
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
The proposed hierarchical MRFs consists of two layers:
segmentation and correspondence layer.
Segmentation layer is designed for the dynamic foreground
extraction.
Correspondence layer is designed to maintain the
correspondences of extracted foreground data across
consecutive frames
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
What do we try to extract from the
environment?
labeling motion parameters
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Hybrid Model of the Environment 

(JC Ramirez)
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
World model saves additional info, 

like texture, motion, etc
VISAPP 2013 J.Ramirez et al.)
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Fusion	
  of	
  sensor	
  readings
-­‐	
  Construction	
  of	
  3D	
  models
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Pose Estimation
➢Two motion prediction
concepts
• 2D feature propagation by
motion derivatives
• IMU-based feature
prediction
➢Combination of both:
• translation propagation by
feature velocity (2D)
• rotation propagation by
gyroscopes
no feature propagation
Strobl, Mair, Bodenmüller, Kielhofer, Sepp, Suppa, Burschka, Hirzinger
IROS, IEEE/RSJ, 2009, Best Paper Finalist
Mair, Strobl, Bodenmüller, Suppa, Burschka

KI, Springer Journal, 2010
MachineVisionandPerceptionGroup@TUMMVP
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Strobl, Mair, Bodenmüller, Kielhofer, Sepp, Suppa, Burschka, Hirzinger
IROS, IEEE/RSJ, 2009, Best Paper Finalist
Mair, Strobl, Bodenmüller, Suppa, Burschka

KI, Springer Journal, 2010
Feature	
  Propagation
➢Two motion prediction
concepts
• 2D feature propagation by
motion derivatives
• IMU-based feature
prediction
➢Combination of both:
• translation propagation by
feature velocity (2D)
• rotation propagation by
gyroscopes
linear feature propagation
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Feature	
  Propagation	
  (Data	
  fusion)
➢Two motion prediction
concepts
• 2D feature propagation
by motion derivatives
• IMU-based feature
prediction
➢Combination of both:
• translation propagation
by feature velocity (2D)
• rotation propagation by
gyroscopes
Strobl, Mair, Bodenmüller, Kielhofer, Sepp, Suppa, Burschka, Hirzinger
IROS, IEEE/RSJ, 2009, Best Paper Finalist
Mair, Strobl, Bodenmüller, Suppa, Burschka

KI, Springer Journal, 2010linear + gyros based
prop.
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Local	
  Feature	
  Tracking	
  Algorithms
•Image-gradient based à Extended KLT (ExtKLT)
• patch-based implementation
• feature propagation
• corner-binding
+ sub-pixel accuracy
• algorithm scales bad with number 

of features
•Tracking-By-Matching à AGAST tracker
• AGAST corner detector
• efficient descriptor
• high frame-rates (hundrets of 

features in a few milliseconds)
+ algorithm scales well with number

of features
• pixel-accuracy
8
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Real	
  Time	
  Pose	
  Tracking

DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Perception-based loop closure
Obstacle avoidance
Navigation	
  system	
  with	
  accuracy	
  
estimation
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Navigation example
Estimation of 3 rotational
angles
Estimation of a translation
vector
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Collaborative Reconstruction with 

Self-Localization
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Asynchronous stereo for 

dynamic scenes
NTP
VISAPP 2014
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Estimation	
  of	
  unobservable	
  properties
Mapping of
Knowledge
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Physical	
  and	
  Geometric	
  Properties	
  	
  
of	
  an	
  Object	
  (ICRA	
  2012	
  Petsch	
  et	
  al.)
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Functional	
  Properties	
  of	
  an	
  Object	
  
stored	
  in	
  Functionality	
  Map
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Each tool used in the
procedure has its own
container describing its
shape, handling properties
etc.
Knowledge	
  Representation
Functionality map for a specific
procedure describes the way
how the tool was used during
the procedure while moved
between points in the world(Petsch/Burschka IROS2011)
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Estimation of non-observable
Scene properties (robot plays)
• Estimation	
  of	
  the	
  Center	
  of	
  mass
• Estimation	
  of	
  Stiffness	
  
• Estimation	
  of	
  Mass	
  and	
  Friction	
  Force
• Estimation	
  of	
  Mass	
  Distribution
simulator
predict act perceive
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Sensory Substitution
Flood of sensors required on modern service
robots tocover varying measurement ranges
and operating spaces
• Video Cameras
• Laser-Range Finders
• Accelerometers
• Force Sensors
• Artifical Skin
• Chemical Sensors
…..
The sensor configuration does change also
with the operating conditions, e.g., baseline
dependency of a binocular stereo setup
➔Implementation of missing sensing
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
accelerometer
camera(s)
gyro
Bag of
Transformations
(physical laws)
Transformation
Graph
physical sensors Sensing modalities
Force
Acceleration
Force distribution
3D Structure
Substitution Framework
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Force Sensing without
Force Sensor?
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Surface Response for
different types of Objects
hard
soft
deformable
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Accelerometer always optimal for
acceleration measurement?
Graph instrumentation
DariusBurschka–PerceptionChallenges
http://www6.in.tum.de/burschka/
Challenges	
  for	
  Perception	
  
• Perception is essential to interact with an unknown and changing
environments
• Perception allows to compensate own uncertainties of the robot
by providing robust reference to the environment
• Level of abstraction of the raw data reduces the amount of
information needed to represent a scene at the cost of increased
processing – an optimum for a given system needs to be found
• System needs to be able to cope not only with static scenes but
also with dynamic changes (actions, failures) in the environment
• Observation of other agents (human other robots) can provide
essential data to fill in the unobservable parameters of the scene

More Related Content

PDF
Visual Mapping and Collision Avoidance Dynamic Environments in Dynamic Enviro...
PDF
Robust and Efficient Coupling of Perception to Actuation with Metric and Non-...
PDF
Challenges in Perception for Learning, Cognition and Control Approaches
PDF
European Robotics Forum 2014 Talk on Closing the Perception-Action Gap
PPTX
Task-Representation for Robust Coupling of Perception to Action in Dynamic En...
PDF
Understanding of Process-Workflow through Identification of Action Constraint...
PPTX
Monocular simultaneous localization and generalized object mapping with undel...
PDF
A benchmark dataset to evaluate sensor displacement in activity recognition
Visual Mapping and Collision Avoidance Dynamic Environments in Dynamic Enviro...
Robust and Efficient Coupling of Perception to Actuation with Metric and Non-...
Challenges in Perception for Learning, Cognition and Control Approaches
European Robotics Forum 2014 Talk on Closing the Perception-Action Gap
Task-Representation for Robust Coupling of Perception to Action in Dynamic En...
Understanding of Process-Workflow through Identification of Action Constraint...
Monocular simultaneous localization and generalized object mapping with undel...
A benchmark dataset to evaluate sensor displacement in activity recognition

Similar to Challenges for Perception in Navigation and Manipulation (20)

PDF
Keynote at Tracking Workshop during ISMAR 2014
PDF
Dynamic Vehicle Tracking and Detection For Self Driving Using FCM Algorithm
PDF
Sensory Substitution
PDF
A Deep Belief Network Approach to Learning Depth from Optical Flow
PDF
Semantic Perception for Telemanipulation at SPME Workshop at ICRA 2013
PDF
Advances In Image Processing And Understanding A Festschrift For Thomas S Hua...
PDF
Obstacle detection in images
PDF
“Event-Based Neuromorphic Perception and Computation: The Future of Sensing a...
PDF
Bosch Expert Days
PPTX
TechnicalBackgroundOverview
PPTX
PPTX
IntroComputerVision23.pptx
PDF
adly Shahat Mergany tag eldien_vission based.pdf
PDF
adly Shahat Mergany tag eldien_vission based.pdf
PDF
Thesis
PDF
Robotics Navigation
PDF
Artificial Intelligence for Vision: A walkthrough of recent breakthroughs
PPTX
1_Intro2ssssssssssssssssssssssssssssss2.pptx
PDF
Fusion of Multi-MAV Data
Keynote at Tracking Workshop during ISMAR 2014
Dynamic Vehicle Tracking and Detection For Self Driving Using FCM Algorithm
Sensory Substitution
A Deep Belief Network Approach to Learning Depth from Optical Flow
Semantic Perception for Telemanipulation at SPME Workshop at ICRA 2013
Advances In Image Processing And Understanding A Festschrift For Thomas S Hua...
Obstacle detection in images
“Event-Based Neuromorphic Perception and Computation: The Future of Sensing a...
Bosch Expert Days
TechnicalBackgroundOverview
IntroComputerVision23.pptx
adly Shahat Mergany tag eldien_vission based.pdf
adly Shahat Mergany tag eldien_vission based.pdf
Thesis
Robotics Navigation
Artificial Intelligence for Vision: A walkthrough of recent breakthroughs
1_Intro2ssssssssssssssssssssssssssssss2.pptx
Fusion of Multi-MAV Data
Ad

Recently uploaded (20)

PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
A Presentation on Artificial Intelligence
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Electronic commerce courselecture one. Pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Spectroscopy.pptx food analysis technology
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Assigned Numbers - 2025 - Bluetooth® Document
MIND Revenue Release Quarter 2 2025 Press Release
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
A Presentation on Artificial Intelligence
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Digital-Transformation-Roadmap-for-Companies.pptx
Electronic commerce courselecture one. Pdf
Advanced methodologies resolving dimensionality complications for autism neur...
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
NewMind AI Weekly Chronicles - August'25-Week II
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Spectroscopy.pptx food analysis technology
The AUB Centre for AI in Media Proposal.docx
Building Integrated photovoltaic BIPV_UPV.pdf
Programs and apps: productivity, graphics, security and other tools
Ad

Challenges for Perception in Navigation and Manipulation