Technical Note: \cal Q-Learning
Machine Learning
Rigorous learning curve bounds from statistical mechanics
COLT '94 Proceedings of the seventh annual conference on Computational learning theory
Efficient reinforcement learning
COLT '94 Proceedings of the seventh annual conference on Computational learning theory
Reinforcement learning algorithms for average-payoff Markovian decision processes
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Learning to act using real-time dynamic programming
Artificial Intelligence - Special volume on computational research on interaction and agency, part 1
Markov decision processes in large state spaces
COLT '95 Proceedings of the eighth annual conference on Computational learning theory
Dynamic Programming and Optimal Control, Two Volume Set
Dynamic Programming and Optimal Control, Two Volume Set
Learning to Predict by the Methods of Temporal Differences
Machine Learning
Analytical Mean Squared Error Curves for Temporal DifferenceLearning
Machine Learning
Near-Optimal Reinforcement Learning in Polynomial Time
Machine Learning
Hi-index | 0.00 |