Dynamic programming: deterministic and stochastic models
Dynamic programming: deterministic and stochastic models
Technical Note: \cal Q-Learning
Machine Learning
Rigorous learning curve bounds from statistical mechanics
COLT '94 Proceedings of the seventh annual conference on Computational learning theory
Efficient reinforcement learning
COLT '94 Proceedings of the seventh annual conference on Computational learning theory
Learning to act using real-time dynamic programming
Artificial Intelligence - Special volume on computational research on interaction and agency, part 1
Learning to Predict by the Methods of Temporal Differences
Machine Learning
Learning curve bounds for a Markov decision process with undiscounted rewards
COLT '96 Proceedings of the ninth annual conference on Computational learning theory
Hi-index | 0.00 |