Learning variable impedance control

Authors:
Jonas Buchli;Freek Stulp;Evangelos Theodorou;Stefan Schaal
Affiliations:
Computational Learning and Motor Control Lab, Universityof Southern California, Los Angeles, USA, Department of Advanced Robotics, Italian Institute ofTechnology, Genova, Italy;Computational Learning and Motor Control Lab, Universityof Southern California, Los Angeles, USA;Computational Learning and Motor Control Lab, Universityof Southern California, Los Angeles, USA;Computational Learning and Motor Control Lab, Universityof Southern California, Los Angeles, USA
Venue:
International Journal of Robotics Research
Year:
2011

Citing 11
Cited 5

Practical methods of optimization; (2nd ed.)

Practical methods of optimization; (2nd ed.)
Inertial properties in robotic manipulation: an object-level framework

International Journal of Robotics Research
Nonlinear and Optimal Control Systems

Nonlinear and Optimal Control Systems
Robot Force Control

Robot Force Control
Modelling and Control of Robot Manipulators

Modelling and Control of Robot Manipulators
Stability and motor adaptation in human arm movements

Biological Cybernetics
Machine learning of motor skills for robotics

Machine learning of motor skills for robotics
Robotics: Modelling, Planning and Control

Robotics: Modelling, Planning and Control
Graphical model inference in optimal control of stochastic multi-agent systems

Journal of Artificial Intelligence Research
Compliant quadruped locomotion over rough terrain

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
A Generalized Path Integral Control Approach to Reinforcement Learning

The Journal of Machine Learning Research

Adaptive impedance control for natural human-robot collaboration

Proceedings of the Workshop at SIGGRAPH Asia
From dynamic movement primitives to associative skill memories

Robotics and Autonomous Systems
Compliant skills acquisition and multi-optima policy search with EM-based reinforcement learning

Robotics and Autonomous Systems
Design with shape grammars and reinforcement learning

Advanced Engineering Informatics
Reinforcement learning in robotics: A survey

International Journal of Robotics Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

One of the hallmarks of the performance, versatility, and robustness of biological motor control is the ability to adapt the impedance of the overall biomechanical system to different task requirements and stochastic disturbances. A transfer of this principle to robotics is desirable, for instance to enable robots to work robustly and safely in everyday human environments. It is, however, not trivial to derive variable impedance controllers for practical high degree-of-freedom (DOF) robotic tasks. In this contribution, we accomplish such variable impedance control with the reinforcement learning (RL) algorithm PI2 (Policy Improvement with Path Integrals). PI2 is a model-free, sampling-based learning method derived from first principles of stochastic optimal control. The PI 2 algorithm requires no tuning of algorithmic parameters besides the exploration noise. The designer can thus fully focus on the cost function design to specify the task. From the viewpoint of robotics, a particular useful property of PI2 is that it can scale to problems of many DOFs, so that reinforcement learning on real robotic systems becomes feasible. We sketch the PI2 algorithm and its theoretical properties, and how it is applied to gain scheduling for variable impedance control. We evaluate our approach by presenting results on several simulated and real robots. We consider tasks involving accurate tracking through via points, and manipulation tasks requiring physical contact with the environment. In these tasks, the optimal strategy requires both tuning of a reference trajectory and the impedance of the end-effector. The results show that we can use path integral based reinforcement learning not only for planning but also to derive variable gain feedback controllers in realistic scenarios. Thus, the power of variable impedance control is made available to a wide variety of robotic systems and practical applications.