Adaptive control of mechanical manipulators
Adaptive control of mechanical manipulators
Technical Note: \cal Q-Learning
Machine Learning
Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning
Artificial Intelligence
Gradient descent for general reinforcement learning
Proceedings of the 1998 conference on Advances in neural information processing systems II
Stochastic dynamic programming with factored representations
Artificial Intelligence
Learning to Predict by the Methods of Temporal Differences
Machine Learning
Reinforcement Learning in POMDP's via Direct Gradient Ascent
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Optimistic pruning for multiple instance learning
Pattern Recognition Letters
Coordination motion-tasks using actual robot dynamics
Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
Coordination motion-tasks using actual robot dynamics
Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
Emerging behaviors by learning joint coordination in articulated mobile robots
IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
Variable risk control via stochastic optimization
International Journal of Robotics Research
Hi-index | 0.00 |
This paper describes a method for structuring a robot motor learning task. By designing a suitably parameterized policy, we show that a simple search algorithm, along with biologically motivated constraints, offers an effective means for motor skill acquisition. The framework makes use of the robot counterparts to several elements found in human motor learning: imitation, equilibrium-point control, motor programs, and synergies. We demonstrate that through learning, coordinated behavior emerges from initial, crude knowledge about a difficult robot weightlifting task.