Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Neuro-Dynamic Programming
Reinforcement Learning in POMDP's via Direct Gradient Ascent
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
SIAM Journal on Control and Optimization
Neural Networks: A Comprehensive Foundation (3rd Edition)
Neural Networks: A Comprehensive Foundation (3rd Edition)
Adaptive navigation for autonomous robots
Robotics and Autonomous Systems
Hi-index | 0.00 |
In this paper, we present Policy Methods as an alternative to Value Methods to solve Reinforcement Learning problems. The paper proposes a Direct Policy Search algorithm that uses a Neural Network to represent the control policies. Details about the algorithm and the update rules are given. The main application of the proposed algorithm is to implement robot control systems, in which the generalization problem usually arises. In this paper, we point out the suitability of our algorithm in a RL benchmark, that was specially designed to test the generalization capability of RL algorithms. Results check out that policy methods obtain better results than value methods in these situations.