Communications of the ACM
The Riccati equation
Efficient reinforcement learning
COLT '94 Proceedings of the seventh annual conference on Computational learning theory
Optimal, predictive, and adaptive control
Optimal, predictive, and adaptive control
Optimal Control of Stochastic Systems
Optimal Control of Stochastic Systems
Dynamic Programming and Stochastic Control
Dynamic Programming and Stochastic Control
Expected Mistake Bound Model for On-Line Reinforcement Learning
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Reinforcement learning: a survey
Journal of Artificial Intelligence Research
PAC Bounds for Multi-armed Bandit and Markov Decision Processes
COLT '02 Proceedings of the 15th Annual Conference on Computational Learning Theory
Exploring compact reinforcement-learning representations with linear regression
UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Hi-index | 0.00 |