Learning Time Allocation Using Neural Networks
CG '00 Revised Papers from the Second International Conference on Computers and Games
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
The Journal of Machine Learning Research
Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot
Robotics and Autonomous Systems
Reinforcement Learning State Estimator
Neural Computation
Learning CPG-based Biped Locomotion with a Policy Gradient Method: Application to a Humanoid Robot
International Journal of Robotics Research
Reinforcement Learning in Fine Time Discretization
ICANNGA '07 Proceedings of the 8th international conference on Adaptive and Natural Computing Algorithms, Part I
Learning CPG sensory feedback with policy gradient for biped locomotion for a full-body humanoid
AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Infinite-horizon policy-gradient estimation
Journal of Artificial Intelligence Research
Experiments with infinite-horizon, policy-gradient estimation
Journal of Artificial Intelligence Research
A cat-like robot real-time learning to run
ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
Swarm reinforcement learning method based on an actor-critic method
SEAL'10 Proceedings of the 8th international conference on Simulated evolution and learning
ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part I
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Hi-index | 0.00 |