Reinforcement learning in robotics: A survey

Authors:
Jens Kober;J. Andrew Bagnell;Jan Peters
Affiliations:
Bielefeld University, CoR-Lab Research Institute for Cognition and Robotics, Bielefeld, Germany, Honda Research Institute Europe, Offenbach/Main, Germany;Carnegie Mellon University, Robotics Institute, Pittsburgh, PA, USA;Max Planck Institute for Intelligent Systems, Department of Empirical Inference, Tübingen, Germany, Technische Universität Darmstadt, FB Informatik, FG Intelligent Autonomous Systems, Da ...
Venue:
International Journal of Robotics Research
Year:
2013

Citing 83
Cited 0

Model-based control of a robot manipulator

Model-based control of a robot manipulator
Integrated architecture for learning, planning, and reacting based on approximating dynamic programming

Proceedings of the seventh international conference (1990) on Machine learning
Automatic programming of behavior-based robots using reinforcement learning

Artificial Intelligence
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning

Machine Learning
Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time

Machine Learning
Incremental multi-step Q-learning

Machine Learning - Special issue on reinforcement learning
Purposive behavior acquisition for a real robot by vision-based reinforcement learning

Machine Learning - Special issue on robot learning
A Kendama learning robot based on bi-directional theory

Neural Networks - 1996 Special issue: four major hypotheses in neuroscience
Using expectation-maximization for reinforcement learning

Neural Computation
Locally Weighted Learning for Control

Artificial Intelligence Review - Special issue on lazy learning
Learning agents for uncertain environments (extended abstract)

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
A framework for reinforcement learning on real robots

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Nonparametric model-based reinforcement learning

NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
Likelilood ratio gradient estimation: an overview

WSC '87 Proceedings of the 19th conference on Winter simulation
Practical methods for optimal control using nonlinear programming

Practical methods for optimal control using nonlinear programming
Dynamic Programming and Optimal Control

Dynamic Programming and Optimal Control
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Scalable Techniques from Nonparametric Statistics for Real Time Robot Learning

Applied Intelligence
Reinforcement Learning in the Multi-Robot Domain

Autonomous Robots
Near-Optimal Reinforcement Learning in Polynomial Time

Machine Learning
Noise and the Reality Gap: The Use of Simulation in Evolutionary Robotics

Proceedings of the Third European Conference on Advances in Artificial Life
Reinforcement Learning in Situated Agents: Theoretical and Practical Solutions

EWLR-8 Proceedings of the 8th European Workshop on Learning Robots: Advances in Robot Learning
Direct Policy Search using Paired Statistical Tests

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Approximately Optimal Approximate Reinforcement Learning

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Robot Learning From Demonstration

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Reinforcement Learning for Biped Locomotion

ICANN '02 Proceedings of the International Conference on Artificial Neural Networks
Learning to Drive a Bicycle Using Reinforcement Learning and Shaping

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Introduction to Stochastic Search and Optimization

Introduction to Stochastic Search and Optimization
Learning in embedded systems

Learning in embedded systems
Recent Advances in Hierarchical Reinforcement Learning

Discrete Event Dynamic Systems
R-max - a general polynomial time algorithm for near-optimal reinforcement learning

The Journal of Machine Learning Research
Lyapunov design for safe reinforcement learning

The Journal of Machine Learning Research
The Cross Entropy Method: A Unified Approach To Combinatorial Optimization, Monte-carlo Simulation (Information Science and Statistics)

The Cross Entropy Method: A Unified Approach To Combinatorial Optimization, Monte-carlo Simulation (Information Science and Statistics)
Apprenticeship learning via inverse reinforcement learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

The Journal of Machine Learning Research
Learning from observation using primitives

Learning from observation using primitives
Relating reinforcement learning performance to classification performance

ICML '05 Proceedings of the 22nd international conference on Machine learning
High speed obstacle avoidance using monocular vision and reinforcement learning

ICML '05 Proceedings of the 22nd international conference on Machine learning
Using inaccurate models in reinforcement learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
Maximum margin planning

ICML '06 Proceedings of the 23rd international conference on Machine learning
Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning)

Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning)
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
On the role of tracking in stationary environments

Proceedings of the 24th international conference on Machine learning
Application of reinforcement learning in robot soccer

Engineering Applications of Artificial Intelligence
Learning CPG-based Biped Locomotion with a Policy Gradient Method: Application to a Humanoid Robot

International Journal of Robotics Research
Learning to Control in Operational Space

International Journal of Robotics Research
Trajectory Optimization using Reinforcement Learning for Map Exploration

International Journal of Robotics Research
Free gait generation with reinforcement learning for a six-legged robot

Robotics and Autonomous Systems
Natural Actor-Critic

Neurocomputing
Operational Space Control: A Theoretical and Empirical Comparison

International Journal of Robotics Research
2008 Special Issue: Reinforcement learning of motor skills with policy gradients

Neural Networks
Space-indexed dynamic programming: learning to follow trajectories

Proceedings of the 25th international conference on Machine learning
Perception and Developmental Learning of Affordances in Autonomous Robots

KI '07 Proceedings of the 30th annual German conference on Advances in Artificial Intelligence
A Reinforcement Learning Technique with an Adaptive Action Generator for a Multi-robot System

SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
State-Dependent Exploration for Policy Gradient Methods

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
Robot Navigation Based on Fuzzy RL Algorithm

ISNN '08 Proceedings of the 5th international symposium on Neural Networks: Advances in Neural Networks
A survey of robot learning from demonstration

Robotics and Autonomous Systems
Apprenticeship learning for helicopter control

Communications of the ACM - Barbara Liskov: ACM's A.M. Turing Award Winner
Search-based structured prediction

Machine Learning
Regularization and feature selection in least-squares temporal difference learning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Reinforcement learning for robot soccer

Autonomous Robots
Policy Gradient Learning of Cooperative Interaction with a Robot Using User's Biological Signals

Advances in Neuro-Information Processing
Maximum entropy inverse reinforcement learning

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Reinforcement learning: a survey

Journal of Artificial Intelligence Research
Automatic gait optimization with Gaussian process regression

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Covariant policy search

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Active learning using mean shift optimization for robot grasping

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
From Motor Learning to Interaction Learning in Robots

From Motor Learning to Interaction Learning in Robots
LQR-trees: Feedback Motion Planning via Sums-of-Squares Verification

International Journal of Robotics Research
Combining active learning and reactive control for robot grasping

Robotics and Autonomous Systems
Reinforcement Learning and Dynamic Programming Using Function Approximators

Reinforcement Learning and Dynamic Programming Using Function Approximators
Learning from Demonstration for Autonomous Navigation in Complex Unstructured Terrain

International Journal of Robotics Research
Learning visual representations for perception-action systems

International Journal of Robotics Research
Learning variable impedance control

International Journal of Robotics Research
Policy search for motor primitives in robotics

Machine Learning
Robot learning from demonstration by constructing skill trees

International Journal of Robotics Research
Learning message-passing inference machines for structured prediction

CVPR '11 Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition
Learning reactive and planning rules in a motivationally autonomousanimat

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives

Robotics and Autonomous Systems
Learning Generalizable Control Programs

IEEE Transactions on Autonomous Mental Development
Learning to select and generalize striking movements in robot table tennis

International Journal of Robotics Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

Reinforcement learning offers to robotics a framework and set of tools for the design of sophisticated and hard-to-engineer behaviors. Conversely, the challenges of robotic problems provide both inspiration, impact, and validation for developments in reinforcement learning. The relationship between disciplines has sufficient promise to be likened to that between physics and mathematics. In this article, we attempt to strengthen the links between the two research communities by providing a survey of work in reinforcement learning for behavior generation in robots. We highlight both key challenges in robot reinforcement learning as well as notable successes. We discuss how contributions tamed the complexity of the domain and study the role of algorithms, representations, and prior knowledge in achieving these successes. As a result, a particular focus of our paper lies on the choice between model-based and model-free as well as between value-function-based and policy-search methods. By analyzing a simple problem in some detail we demonstrate how reinforcement learning approaches may be profitably applied, and we note throughout open questions and the tremendous potential for future research.