Thesis dissertation: Humanoid Robot Control of Complex Postural Tasks based on Learning from Demonstration

Miguel González-Fierro | Humanoid Robot Control of Complex Postural Tasks based on Learning from Demonstration 1
Humanoid Robot Control of
Complex Postural
Tasks based on Learning from
Demonstration
Miguel González-Fierro Palacios
Advisers:
Prof. Carlos Balaguer
Universidad Carlos III
Dr. Thrishantha Nanayakkara
King’s College London

Behavior transfer & postural control

High level order

Introduction
Objectives
1. Postural planning and
control of a single skill
2. Postural planning and
control of consecutive
skills
3. Execution of high level
order

Introduction
Outline
HIGH LEVEL
ORDER
Environment analysis
Postural planning
Postural control
SINGLE
SKILL
BASIC
CONCEPTS
SEQUENTIAL
SKILLS
POSTURAL
CONTROL
2
3
4
5
6Sequential imitation
Sequential innovation
Behavior selection
Model selection
Fractional control
PID vs
Imitation learning
Skill innovation
Behavior representation
Humanoid models
Gait
Dance mimicking
 
PI D

2. BASIC REPRESENTATIONS FOR
POSTURAL CONTROL IN
HUMANOIDS

Basic Representations
Humanoid robot models

Zero Moment Point

3D Linear Inverted Pendulum Model

Cart Table Model

Gait Simulation

Humanoid Gait

Robot Mimicking

Dance Routine

Video

3. IMITATION LEARNING AND SKILL
INNOVATION IN HUMANOIDS
THROUGH REWARD TEMPLATES

Postural Planning of a Single Skill
Learning in Humans
How do they learn?

Imitation in Robots
1. Determine what
to imitate
2. Establish a
metrics for
imitation
3. Map between
dissimilar bodies
4. Compute the
control
commands

Imitation and Innovation of a Skill

Data Acquisition

ZMP and Torque Data

Reward Function

Markov Chains and Transition Matrix
Markov property:
Markov transition matrix:
Polynomial reward function
Gaussian reward function

Behavior Strategy Asumption

Imitation and Innovation Reward

Robot ZMP
ZMP imitation ZMP innovation

Robot Torque
Torque imitation Torque innovation

Experiments
Imitation Innovation

Video

4. LEARNING AND IMPROVING A
SEQUENCE OF GOAL DIRECTED
SKILLS

Postural Planning of Sequential Skills
Planning Sequential Skills

Overview of Sequential Imitation

Overview of Sequential Innovation

Policy Definition

Encoding Demonstrations
GMM GMRData

Behavior Definition
B1: Approaching the door B2: Grasping the knob
B3: Pulling the door B4: Releasing the knob

Behavior Selector

Contex-Based Reward Functions
Basis Reward Function:
Behavior based Reward Function:
Reward Profiles

Humanoid Opening a Door

Video

5. ROBUST CONTROL OF HUMANOID
MODELS THROUGH FRACTIONAL
CALCULUS

Postural Control
Humanoid Models Discussion
VS

Postural Control
Our Approach
Reduced model
+
Fractional Order Controller

Postural Control
Fractional Order Controller
Cope with mismatches in the model
Robust against disturbances
Approximation as a rational function
+1Kg
+1Kg
+1Kg

Postural Control
Model Identification

Postural Control
State Space Equations

Postural Control
Control System

Postural Control
PID vs Fractional PID
Performance comparison between nominal and overloaded system (+3Kg)

6. CONTROL OF HUMANOID ROBOTS
EXECUTING COMPLEX TASKS

High Level Architecture
Architecture Overview

Floor and Obstacles Estimation
Normals Estimation Plane Estimation Obstacle Clustering
Covariance matrix
Eigenvalues/Eigenvectors
Normals direction
RANSAC
Inliers: 70% (Plane)
Outliers: 30% (Obstacles)
Euclidean Clustering using
K-mean and KD-trees

Environment Analysis

Environment Model

Video

Global Path Planning

Video

Postural Skill Generator

Postural Motion Planning

Whole Body Postural Control

Correction of Disturbances
distF

7. CONCLUSIONS AND
FUTURE WORKS

Conclusions
Summary
HIGH LEVEL
ORDER
Environment analysis
Postural planning
Postural control
SINGLE
SKILL
BASIC
CONCEPTS
SEQUENTIAL
SKILLS
POSTURAL
CONTROL
2
3
4
5
6Sequential imitation
Sequential innovation
Behavior selection
Model selection
Fractional control
PID vs
Imitation learning
Skill innovation
Behavior representation
Humanoid models
Gait
Dance mimicking
 
PI D

Conclusions
Key Contributions
Postural Motion Planning and Control
Reward trajectory as behavior metrics
Reward Transition Matrix as behavior representation
Postural planning and control architecture
Learning from Demonstration
Single skill transference through multimodal reward profile
Skill transference through Reward Transition Matrix
Imitation extended to a set of sequential skills
Skill innovation
Maximization of positive difference of imitation reward
Application to a set of sequential skills
Modelling and Control
Reduced model of the humanoid controlled by fractional controller

Conclusions
Future Works
New approaches on goal emulation and intention understanding
Selection of reward profile through machine learning techniques
Behavior transference between robot-robot or human-human
Implementation of fractional controller in the real humanoid
Decision module to autonomously generate the postural path

Conclusions
List of Publications
Journal papers
- González-Fierro, M., Balaguer, C., Swann, N., and Nanayakkara, T. (2014). Full-Body Postural Control
of a Humanoid Robot with Both Imitation Learning and Skill Innovation. International Journal of
Humanoid Robotics, 11(2):1450012.
- Monje, C.A., Pierro, P., Ramos, T., González-Fierro, M., Balaguer, C. (2013). Modelling and Simulation
of the Humanoid Robot HOAP-3 in the OpenHRP3 Platform Cybernetics and Systems, 44(8):663-680.
- Bueno, J. G., González-Fierro, M., Moreno, L., and Balaguer, C. (2013). Facial Emotion Recognition
and Adaptative Postural Reaction by a Humanoid based on Neural Evolution International Journal of
Advanced Computer Science, 3(10):481-493.
- González-Fierro, M., Hernández, D., Nanayakkara, T., and Balaguer, C. (2014).Behavior Sequencing
Based on Demonstrations-a Case of a Humanoid Opening a Door while Walking. submitted to Advanced
Robotics.
- González-Fierro, M., Monje, C. A., and Balaguer, C. (2014). Fractional Control of a Humanoid Robot
Reduced Model with Model Disturbances. submitted to Cybernetics and Systems
Conference/Symposium papers
- González-Fierro, M., Balaguer, C., Swann, N., and Nanayakkara, T. (2013). A Humanoid Robot
Standing Up Through Learning from Demonstration Using a Multimodal Reward Function. In IEEERAS
International Conference on Humanoid Robots, 2013. Humanoids 2013.
- González-Fierro, M., Bueno, J., Balaguer, C., and Moreno, L. (2013). A Complete 3D Perception and
Path Planning Architecture for a Humanoid. Robocity2030 11th Workshop: Robots Sociales, pages 167-184.
- González-Fierro, M., Maldonado, M. A., Víctores, J. G., Morante, S., and Balaguer, C. (2013).
Object Tagging for Human-Robot Interaction by Recolorization using Gaussian Mixture Models. In
Proceedings of Robocity2030 12th Workshop: Robótica Cognitiva, pages 67-76.

Conclusions
- González-Fierro, M., Monje, C., González, V., and Balaguer, C. (2013). Evolutionary Fractional Order
Control of a Humanoid Robot Modeled as a Triple Inverted Pendulum. In Proceedings of Robocity2030
11th Workshop: Robots Sociales, pages 245-263.
- González-Fierro, M., Monje, C. A., and Balaguer, C. (2013). Robust Control of a Reduced Humanoid
Robot Model using Genetic Algorithms and Fractional Calculus. In Mathematical Methods in Engineering
International Conference MME2013, pages 183-194.
- Bueno, J., Martín, A., González-Fierro, M., Moreno, L., and Balaguer, C. (2013). Distinguishing
between Similar Objects based on Geometrical Features in 3D Perception. In Proceedings of Robocity2030
12th Workshop: Robótica Cognitiva, pages 77-92.
- Víctores, J. G., Morante, S., González-Fierro, M., and Balaguer, C. (2013). Augmented Reality and
Social Interaction platform through Multirobot Design. In Proceedings of Robocity2030 11th Workshop:
Robots Sociales, pages 131-143.
- González-Fierro, M., Hernández, D., Pierro, P., and Balaguer, C. (2012). Dynamic Modelling of
Humanoid Robots Using Spatial Algebra. In XXXIII Jornadas de Automática. CEA.
- Pierro, P., Hernández, D., Herrero, D., González-Fierro, M., and Balaguer, C. (2012). Perception
System for Working with Humanoid Robots in Unstructured Collaborative Scenarios. In Proceedings of
the 2012 International IEEE Intelligent Vehicles Symposium. Workshops V Perception in Robotics. IEEE.
- Bueno, J. G., González-Fierro, M., L.Moreno, and C.Balaguer (2012). Facial Gesture Recognition using
Active Appearance Models based on Neural Evolution. In 2012 International Conference on Human-Robot
Interaction (HRI 2012), pages 133-134. IEEE.
- Monje, C. A., Pierro, P., Ramos, T., González-Fierro, M., and Balaguer, C. (2011). Modelling and
Simulation of the Humanoid Robot HOAP-3 in the OpenHRP3 Platform. In Proceedings of Robot 2011.
Workshop Robots Humanoides.

Conclusions
- Bueno, J. G., González-Fierro, M., Moreno, L., and Balaguer, C. (2011). Facial Gesture Recognition
and Postural Interaction using Neural Evolution Algorithm and Active Appearance Models. In Proceedings
of Robocity2030 9th Workshop: Robots colaborativos e interacción humano-robot, pages 145-159.
- González-Fierro, M., Jardón, A., Martínez de la Casa, S., Stoelen, M. F., Víctores, J. G., Balaguer, C.
(2010). Educational Initiatives Related with the CEABOT Contest. Proceedings of SIMPAR, pp 649-658.
- Mateo, A. P., González-Fierro, M., Hernández, D., Pierro, P., and Balaguer, C. (2010). Robust Real
Time Stabilization: Estabilización de la Imagen con Aplicación en el Robot Humanoide HOAP-3. In
Proceedings of Robocity2030 7th Workshop: Visión en Robótica.
- Peña, A., Hernández, D., González-Fierro, M., Pierro, P., and Balaguer, C. (2010). Sistema de
Visión del Humanoide HOAP-3 para la Detección e Identificación de Objetos Mediante Librerías
OpenCV. In Proceedings of Robocity2030 7th Workshop: Visión en Robótica.
- Pierro, P., González-Fierro, M., and Balaguer, C. (2009). El Proyecto Europeo ROBOT@CWE:
Advanced Robotic Systems in Future Collaborative Working Environments. In Proceedings of Robot 2009.
II Workshop de Robotica (ROBOT 2009).
- Pierro, P., Hernández, D., González-Fierro, M., Blasi, L., Milani, A., and Balaguer, C. (2009). A
Human-Humanoid Interface for Collaborative Tasks. In Proceedings on the Second workshop for young
Researchers on Human-friendly robotics, Sestri Levante, Italy.
- Pierro, P., Hernández, D., González-Fierro, M., Blasi, L., Milani, A., and Balaguer, C. (2009).
Humanoid Teleoperation System for Space Environments. In Advanced Robotics, 2009. ICAR 2009.
International Conference on, pages 1-6. IEEE.
- González-Fierro, M., Pierro, P., Jardón, A., Herrero, D., and Balaguer, C. (2009). Realización de Tareas
Colaborativas entre Robots Humanoides. Experimentación con dos Robots Robonova. In Proceedings
of Robocity2030 5th Workshop: Cooperación en Robótica.

One more thing…

Humanoid Robot Control of
Complex Postural
Tasks based on Learning from
Demonstration
Miguel González-Fierro Palacios
Advisers:
Prof. Carlos Balaguer
Universidad Carlos III
Dr. Thrishantha Nanayakkara
King’s College London

Thesis dissertation: Humanoid Robot Control of Complex Postural Tasks based on Learning from Demonstration

More Related Content

Recently uploaded (20)

Featured (20)

Thesis dissertation: Humanoid Robot Control of Complex Postural Tasks based on Learning from Demonstration