Doina Precup;Richard S. Sutton
-;-
An Efficient Gradient Forecasting Search Method Utilizing the Discrete Difference Equation Prediction Model
Applied Intelligence
Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming
Machine Learning
A worst-case comparison between temporal difference and residual gradient with linear function approximation
Proceedings of the 25th international conference on Machine learning