Humanoid robots learning to walk faster: from the real world to simulation and back

Authors:
Alon Farchy;Samuel Barrett;Patrick MacAlpine;Peter Stone
Affiliations:
University of Texas at Austin, Austin, TX, USA;University of Texas at Austin, Austin, TX, USA;University of Texas at Austin, Austin, TX, USA;University of Texas at Austin, Austin, TX, USA
Venue:
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Year:
2013

Citing 12
Cited 0

Real-time Collision-free Path Planning of Robot Manipulators using Neural Network Approaches

Autonomous Robots
Learning Sequential Decision Rules Using Simulation Models and Competition

Machine Learning - Special issue on genetic algorithms
Distributed computing in practice: the Condor experience: Research Articles

Concurrency and Computation: Practice & Experience - Grid Performance
Using inaccurate models in reinforcement learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
Autonomous Learning of Stable Quadruped Locomotion

RoboCup 2006: Robot Soccer World Cup X
Self-modeling in humanoid soccer robots

Robotics and Autonomous Systems
Machine learning for fast quadrupedal locomotion

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Reinforcement learning for agents with many sensors and actuators acting in categorizable environments

Journal of Artificial Intelligence Research
The WEKA data mining software: an update

ACM SIGKDD Explorations Newsletter
Crossing the reality gap in evolutionary robotics by promoting transferable controllers

Proceedings of the 12th annual conference on Genetic and evolutionary computation
On optimizing interdependent skills: a case study in simulated 3D humanoid robot soccer

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Omnidirectional walking using ZMP and preview control for the NAO humanoid robot

RoboCup 2009

Quantified Score

Hi-index	0.00

Visualization

Abstract

Simulation is often used in research and industry as a low cost, high efficiency alternative to real model testing. Simulation has also been used to develop and test powerful learning algorithms. However, parameters learned in simulation often do not translate directly to the application, especially because heavy optimization in simulation has been observed to exploit the inevitable simulator simplifications, thus creating a gap between simulation and application that reduces the utility of learning in simulation. This paper introduces Grounded Simulation Learning (GSL), an iterative optimization framework for speeding up robot learning using an imperfect simulator. In GSL, a behavior is developed on a robot and then repeatedly: 1) the behavior is optimized in simulation; 2) the resulting behavior is tested on the real robot and compared to the expected results from simulation, and 3) the simulator is modified, using a machine-learning approach to come closer in line with reality. This approach is fully implemented and validated on the task of learning to walk using an Aldebaran Nao humanoid robot. Starting from a set of stable, hand-coded walk parameters, four iterations of this three-step optimization loop led to more than a 25% increase in the robot's walking speed.