SlideShare a Scribd company logo
Challenges in Perception for
Learning, Cognition and Control
Approaches
Darius	Burschka	
Machine	Vision	Perception	Group	(MVP)	
Technische	Universität	München,	Germany
Darius	Burschka	–	Machine	Vision	and	Perception	Group	(TUM)
Structure of a CNN for Robotics Applications
Raw	sensor	information	(depth	image	from	an	RGB-D	sensor)	is	taken	as	input	
to	connect	with	the	robot	control	command	directly		
Ideal	case:	learn	the	coupling	between	perception	and	control	commands	from	
demonstrations
Darius	Burschka	–	Machine	Vision	and	Perception	Group	(TUM)
Curse of Complexity in Direct Processing
In	theory,	one	could	use	all	the	extracted	
features	with	a	classifier	such	as	a	softmax	
classifier,	but	this	can	be	computationally	
challenging.	For	instance	images	of	size	96x96	
pixels	with	400	learned	features	over	8x8	
inputs.	Each	convolution	results	in	an	output	of	
size	(96−8+1)∗(96−8+1)=7921,	and	since	we	
have	400	features,	this	results	in	a	vector	
of	892∗400=	3,168,400	features	per	example.	
Learning	a	classifier	with	inputs	having	3+	
million	features	can	be	unwieldy,	and	can	also	
be	prone	to	over-fitting.
Darius	Burschka	–	Machine	Vision	and	Perception	Group	(TUM)
Deep-Learning for Perception in Robotics
What	do	all	these	tasks	have	in	common?
Source:	L.	Tai,	“Deep-Learning	in	Mobile	Robotics…”
Darius	Burschka	–	Machine	Vision	and	Perception	Group	(TUM)
Learning Approaches vs. Conventional Tools
Image	source:	Berkley
Grouping/Segmentation,	Labeling,	Identification	
because	differences	in	the	properties	of	pixels	to	
entire	image	are	considered	due	to	the	structure	of	
CNNs	
Problem:	not	clear,	what	the	processing	is	based	on!
?
Hand-designed	features	help	to	estimate	metric	
distances	between	them	using	calibration	parameters
Darius	Burschka	–	Machine	Vision	and	Perception	Group	(TUM)
Deep-Learning for Control in Robotics
Source:	L.	Tai,	“Deep-Learning	in	Mobile	Robotics…”
Darius	Burschka	–	Machine	Vision	and	Perception	Group	(TUM)
Can we avoid Metric Information in
Representation of Environments?
Darius	Burschka	–	Machine	Vision	and	Perception	Group	(TUM)
Fast Uncalibrated Monocular Estimation of 

Independent Motion Components
Representation of the Environment in Collision Space
Darius	Burschka	–	Machine	Vision	and	Perception	Group	(TUM)
Examples of non-metric Control (RoboMobil)
Darius	Burschka	–	Machine	Vision	and	Perception	Group	(TUM)
Examples of non-metric Control (RoboMobil)
Darius	Burschka	–	Machine	Vision	and	Perception	Group	(TUM)
Conclusions
• Image	labeling	and	image	retrieval	(indexing)	represent	a	large	
application	field	for	perception	and	image	processing	(Google,	Apple,	
etc.),	but	robotic	requires	often	a	metric	mapping	from	the	sensor	
onto	control	values	
• Work	on	alternatives	for	coupling	between	the	sensor	and	the	
actuator	should	be	one	of	the	perception	goals	
• Currently	used	data	representations	need	to	be	re-defined	for	learning	
applications.

More Related Content

PDF
European Robotics Forum 2014 Talk on Closing the Perception-Action Gap
PDF
Robust and Efficient Coupling of Perception to Actuation with Metric and Non-...
PDF
Visual Mapping and Collision Avoidance Dynamic Environments in Dynamic Enviro...
PDF
Deep Learning for Structure-from-Motion (SfM)
PDF
Kq3518291832
PDF
PDF
A ROBUST BACKGROUND REMOVAL ALGORTIHMS USING FUZZY C-MEANS CLUSTERING
PDF
Visual odometry _report
European Robotics Forum 2014 Talk on Closing the Perception-Action Gap
Robust and Efficient Coupling of Perception to Actuation with Metric and Non-...
Visual Mapping and Collision Avoidance Dynamic Environments in Dynamic Enviro...
Deep Learning for Structure-from-Motion (SfM)
Kq3518291832
A ROBUST BACKGROUND REMOVAL ALGORTIHMS USING FUZZY C-MEANS CLUSTERING
Visual odometry _report

What's hot (19)

PDF
IEEE 2013 Matlab titles
PDF
Random Valued Impulse Noise Elimination using Neural Filter
PDF
Tracking Chessboard Corners Using Projective Transformation for Augmented Rea...
PDF
Keynote at Tracking Workshop during ISMAR 2014
PDF
From Sense to Print: Towards Automatic 3D Printing from 3D Sensing Devices
PDF
X36141145
PDF
Vision based non-invasive tool for facial swelling assessment
PDF
NEURAL NETWORKS FOR HIGH PERFORMANCE TIME-DELAY ESTIMATION AND ACOUSTIC SOURC...
PDF
SLAM Zero to One
PDF
Real-time 3D Object Pose Estimation and Tracking for Natural Landmark Based V...
DOCX
Haze removal for a single remote sensing image based on deformed haze imaging...
PPTX
Introduction to Segmentation in Computer vision
PDF
A NOVEL BACKGROUND SUBTRACTION ALGORITHM FOR PERSON TRACKING BASED ON K-NN
PDF
Handwritten digits recognition report
PDF
Hoip10 articulo counting people in crowded environments_univ_berlin
PDF
Invalidating vulnerable broadcaster nodes using
PDF
Disparity map generation based on trapezoidal camera architecture for multi v...
PDF
Moving object detection using background subtraction algorithm using simulink
PPTX
Introduction to Convolutional Neural Networks
IEEE 2013 Matlab titles
Random Valued Impulse Noise Elimination using Neural Filter
Tracking Chessboard Corners Using Projective Transformation for Augmented Rea...
Keynote at Tracking Workshop during ISMAR 2014
From Sense to Print: Towards Automatic 3D Printing from 3D Sensing Devices
X36141145
Vision based non-invasive tool for facial swelling assessment
NEURAL NETWORKS FOR HIGH PERFORMANCE TIME-DELAY ESTIMATION AND ACOUSTIC SOURC...
SLAM Zero to One
Real-time 3D Object Pose Estimation and Tracking for Natural Landmark Based V...
Haze removal for a single remote sensing image based on deformed haze imaging...
Introduction to Segmentation in Computer vision
A NOVEL BACKGROUND SUBTRACTION ALGORITHM FOR PERSON TRACKING BASED ON K-NN
Handwritten digits recognition report
Hoip10 articulo counting people in crowded environments_univ_berlin
Invalidating vulnerable broadcaster nodes using
Disparity map generation based on trapezoidal camera architecture for multi v...
Moving object detection using background subtraction algorithm using simulink
Introduction to Convolutional Neural Networks
Ad

Viewers also liked (11)

PDF
Challenges for Perception in Navigation and Manipulation
PPTX
Perception
PPTX
Perception and Attention
PPTX
Amazing benefits of cumin.
PPT
Austin Journal of Bioorganic & Organic Chemistry
PPTX
Flipped Classroom project at UPV
PPT
Princeton Real Estate Inventory Grid
PDF
PDF
perbedaan traditional dressing VS modern dressing
PDF
TestingAR XI - Beyond the Basics - Planificación de Testing con Matriz ACC
PPSX
Presentacion Pinturas Cobalto
Challenges for Perception in Navigation and Manipulation
Perception
Perception and Attention
Amazing benefits of cumin.
Austin Journal of Bioorganic & Organic Chemistry
Flipped Classroom project at UPV
Princeton Real Estate Inventory Grid
perbedaan traditional dressing VS modern dressing
TestingAR XI - Beyond the Basics - Planificación de Testing con Matriz ACC
Presentacion Pinturas Cobalto
Ad

Similar to Challenges in Perception for Learning, Cognition and Control Approaches (20)

PPTX
Introduction to Computer Vision and its Applications
PDF
Deep Learning a whirlwind tour of key principles
PDF
Understanding Convolutional Neural Networks
PPTX
A simple presentation for deep learning.
PDF
"Demystifying Deep Neural Networks," a Presentation from BDTI
PDF
Deep Learning: a birds eye view
PDF
CNN Algorithm
PDF
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
PPTX
An introduction to Deep Learning
PPTX
Introduction to Deep learning
PDF
Introduction-to-Neural-Networks-and-Deep-Learning.pptx.pdf
PPT
Introduction_to_DEEP_LEARNING ppt 101ppt
PPT
Introduction_to_DEEP_LEARNING.sfsdafsadfsadfsdafsdppt
PPTX
Automatic Attendace using convolutional neural network Face Recognition
PPT
Introduction_to_DEEP_LEARNING.ppt
PDF
Introduction to Deep Learning: Concepts, Architectures, and Applications
PDF
Deep Learning for Unmanned Systems Anis Koubaa
PDF
Phx dl meetup
PPTX
Deep Learning: Towards General Artificial Intelligence
PDF
Tutorial on Deep Learning
Introduction to Computer Vision and its Applications
Deep Learning a whirlwind tour of key principles
Understanding Convolutional Neural Networks
A simple presentation for deep learning.
"Demystifying Deep Neural Networks," a Presentation from BDTI
Deep Learning: a birds eye view
CNN Algorithm
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
An introduction to Deep Learning
Introduction to Deep learning
Introduction-to-Neural-Networks-and-Deep-Learning.pptx.pdf
Introduction_to_DEEP_LEARNING ppt 101ppt
Introduction_to_DEEP_LEARNING.sfsdafsadfsadfsdafsdppt
Automatic Attendace using convolutional neural network Face Recognition
Introduction_to_DEEP_LEARNING.ppt
Introduction to Deep Learning: Concepts, Architectures, and Applications
Deep Learning for Unmanned Systems Anis Koubaa
Phx dl meetup
Deep Learning: Towards General Artificial Intelligence
Tutorial on Deep Learning

Recently uploaded (20)

PPTX
Research Process - Research Methods course
PPTX
Tablets And Capsule Preformulation Of Paracetamol
PPTX
nose tajweed for the arabic alphabets for the responsive
PPTX
NORMAN_RESEARCH_PRESENTATION.in education
PDF
natwest.pdf company description and business model
PDF
oil_refinery_presentation_v1 sllfmfls.pdf
PPTX
Presentation for DGJV QMS (PQP)_12.03.2025.pptx
PPTX
ART-APP-REPORT-FINctrwxsg f fuy L-na.pptx
PPTX
water for all cao bang - a charity project
PDF
Instagram's Product Secrets Unveiled with this PPT
PPTX
_ISO_Presentation_ISO 9001 and 45001.pptx
PPTX
PHIL.-ASTRONOMY-AND-NAVIGATION of ..pptx
PPT
The Effect of Human Resource Management Practice on Organizational Performanc...
PPTX
MERISTEMATIC TISSUES (MERISTEMS) PPT PUBLIC
PPTX
Impressionism_PostImpressionism_Presentation.pptx
PPTX
AcademyNaturalLanguageProcessing-EN-ILT-M02-Introduction.pptx
PPTX
lesson6-211001025531lesson plan ppt.pptx
PPTX
Project and change Managment: short video sequences for IBA
PPTX
2025-08-10 Joseph 02 (shared slides).pptx
PDF
Microsoft-365-Administrator-s-Guide_.pdf
Research Process - Research Methods course
Tablets And Capsule Preformulation Of Paracetamol
nose tajweed for the arabic alphabets for the responsive
NORMAN_RESEARCH_PRESENTATION.in education
natwest.pdf company description and business model
oil_refinery_presentation_v1 sllfmfls.pdf
Presentation for DGJV QMS (PQP)_12.03.2025.pptx
ART-APP-REPORT-FINctrwxsg f fuy L-na.pptx
water for all cao bang - a charity project
Instagram's Product Secrets Unveiled with this PPT
_ISO_Presentation_ISO 9001 and 45001.pptx
PHIL.-ASTRONOMY-AND-NAVIGATION of ..pptx
The Effect of Human Resource Management Practice on Organizational Performanc...
MERISTEMATIC TISSUES (MERISTEMS) PPT PUBLIC
Impressionism_PostImpressionism_Presentation.pptx
AcademyNaturalLanguageProcessing-EN-ILT-M02-Introduction.pptx
lesson6-211001025531lesson plan ppt.pptx
Project and change Managment: short video sequences for IBA
2025-08-10 Joseph 02 (shared slides).pptx
Microsoft-365-Administrator-s-Guide_.pdf

Challenges in Perception for Learning, Cognition and Control Approaches