Journal of the ACM (JACM)
Model-based average reward reinforcement learning
Artificial Intelligence
Machine Learning - Special issue on context sensitivity and concept drift
The Nonstochastic Multiarmed Bandit Problem
SIAM Journal on Computing
Near-Optimal Reinforcement Learning in Polynomial Time
Machine Learning
Keepaway Soccer: A Machine Learning Testbed
RoboCup 2001: Robot Soccer World Cup V
Behavior transfer for value-function-based reinforcement learning
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
SMDP homomorphisms: an algebraic approach to abstraction in semi-Markov decision processes
IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Multi-armed bandit algorithms and empirical evaluation
ECML'05 Proceedings of the 16th European conference on Machine Learning
Graph-Based Analysis of Human Transfer Learning Using a Game Testbed
IEEE Transactions on Knowledge and Data Engineering
Autonomous transfer for reinforcement learning
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
On policy learning in restricted policy spaces
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Robust distance metric learning with auxiliary knowledge
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Transfer Learning for Reinforcement Learning Domains: A Survey
The Journal of Machine Learning Research
Transfer learning through indirect encoding
Proceedings of the 12th annual conference on Genetic and evolutionary computation
Evolving Static Representations for Task Transfer
The Journal of Machine Learning Research
Transfer learning via multiple inter-task mappings
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Reinforcement learning transfer via sparse coding
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Strong mitigation: nesting search for good policies within search for good reward
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Reinforcement learning transfer using a sparse coded inter-task mapping
EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Hi-index | 0.02 |
A long-lived agent continually faces new tasks in its environment. Such an agent may be able to use knowledge learned in solving earlier tasks to produce candidate policies for its current task. There may, however, be multiple reasonable policies suggested by prior experience, and the agent must choose between them potentially without any a priori knowledge about their applicability to its current situation. We present an "experts" algorithm for efficiently choosing amongst candidate policies in solving an unknown Markov decision process task. We conclude with the results of experiments on two domains in which we generate candidate policies from solutions to related tasks and use our experts algorithm to choose amongst them.