Robot weightlifting by direct policy search

Authors:
Michael T. Rosenstein;Andrew G. Barto
Affiliations:
Department of Computer Science, University of Massachusetts, Amherst, MA;Department of Computer Science, University of Massachusetts, Amherst, MA
Venue:
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Year:
2001

Citing 8
Cited 5

Adaptive control of mechanical manipulators

Adaptive control of mechanical manipulators
Technical Note: \cal Q-Learning

Machine Learning
The Parti-game Algorithm for Variable Resolution Reinforcement Learning in Multidimensional State-spaces

Machine Learning
Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning

Artificial Intelligence
Gradient descent for general reinforcement learning

Proceedings of the 1998 conference on Advances in neural information processing systems II
Stochastic dynamic programming with factored representations

Artificial Intelligence
Learning to Predict by the Methods of Temporal Differences

Machine Learning
Reinforcement Learning in POMDP's via Direct Gradient Ascent

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning

Optimistic pruning for multiple instance learning

Pattern Recognition Letters
Coordination motion-tasks using actual robot dynamics

Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
Coordination motion-tasks using actual robot dynamics

Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
Emerging behaviors by learning joint coordination in articulated mobile robots

IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
Variable risk control via stochastic optimization

International Journal of Robotics Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes a method for structuring a robot motor learning task. By designing a suitably parameterized policy, we show that a simple search algorithm, along with biologically motivated constraints, offers an effective means for motor skill acquisition. The framework makes use of the robot counterparts to several elements found in human motor learning: imitation, equilibrium-point control, motor programs, and synergies. We demonstrate that through learning, coordinated behavior emerges from initial, crude knowledge about a difficult robot weightlifting task.