Bias-Variance Error Bounds for Temporal Difference Updates

Authors:
Michael J. Kearns;Satinder P. Singh
Affiliations:
-;-
Venue:
COLT '00 Proceedings of the Thirteenth Annual Conference on Computational Learning Theory
Year:
2000

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
From Q(λ) to average Q-learning: efficient implementation of an asymptotic approximation

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Error bounds in reinforcement learning policy evaluation

AI'05 Proceedings of the 18th Canadian Society conference on Advances in Artificial Intelligence
Recursive least-squares learning with eligibility traces

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning

Hi-index	0.00