Technical Note: \cal Q-Learning
Machine Learning
Asynchronous Stochastic Approximation and Q-Learning
Machine Learning
Biped Locomotion
Brains, Behavior and Robotics
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Neuro-Dynamic Programming
IROS '95 Proceedings of the International Conference on Intelligent Robots and Systems-Volume 1 - Volume 1
Coupled Van Der Pol oscillators utilised as central pattern generators for quadruped locomotion
CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
Fully interconnected, linear control for limit cycle walking
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Hi-index | 0.00 |
This paper presents a general control architecture for bipedal walking which is based on a divide-and-conquer approach. Based on the architecture, the sagittal-plane motion-control algorithm is formulated using a control approach known as Virtual Model Control. A reinforcment learning algorithm is used to learn the key parameter of the swing leg control task so that speed control can be achieved. The control algorithm is applied to two simulated bipedal robots. The simulation analyses demonstrate that the local speed control mechanism based on the stance ankle is effective in reducing the learning time. The algorithm is also demonstrated to be general in that it is applicable across bipedal robots that have different length and mass parameters.