Near-Optimal Reinforcement Learning in Polynomial Time
Machine Learning
On the convergence of stochastic iterative dynamic programming algorithms
Neural Computation
Learning to act using real-time dynamic programming
Artificial Intelligence
Explaining temporal differences to create useful concepts for evaluating states
AAAI'90 Proceedings of the eighth National conference on Artificial intelligence - Volume 2
Hi-index | 0.00 |