Transfer via inter-task mappings in policy search reinforcement learning

Authors:
Matthew E. Taylor;Shimon Whiteson;Peter Stone
Affiliations:
The University of Texas at Austin, Austin, Texas;The University of Texas at Austin, Austin, Texas;The University of Texas at Austin, Austin, Texas
Venue:
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Year:
2007

Citing 13
Cited 22

Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
International Workshop on Combinations of Genetic Algorithms and Neural Networks

International Workshop on Combinations of Genetic Algorithms and Neural Networks
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Evolving neural networks through augmenting topologies

Evolutionary Computation
Utility Functions in Autonomic Systems

ICAC '04 Proceedings of the First International Conference on Autonomic Computing
Behavior transfer for value-function-based reinforcement learning

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Comparing evolutionary and temporal difference methods in a reinforcement learning domain

Proceedings of the 8th annual conference on Genetic and evolutionary computation
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Evolutionary Function Approximation for Reinforcement Learning

The Journal of Machine Learning Research
Using Homomorphisms to transfer options across continuous reinforcement learning domains

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Generalizing plans to new environments in relational MDPs

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Keepaway soccer: from machine learning testbed to benchmark

RoboCup 2005
Using advice to transfer knowledge acquired in one reinforcement learning task to another

ECML'05 Proceedings of the 16th European conference on Machine Learning

Autonomous transfer for reinforcement learning

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
A new perspective to the keepaway soccer: the takers

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Graph Laplacian based transfer learning in reinforcement learning

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Graph-Based Domain Mapping for Transfer Learning in General Games

ECML '07 Proceedings of the 18th European conference on Machine Learning
Transfer Learning in Reinforcement Learning Problems Through Partial Policy Recycling

ECML '07 Proceedings of the 18th European conference on Machine Learning
Transfer via soft homomorphisms

Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Experiments with Adaptive Transfer Rate in Reinforcement Learning

Knowledge Acquisition: Approaches, Algorithms and Applications
Mapping and revising Markov logic networks for transfer learning

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Autonomous inter-task transfer in reinforcement learning domains

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Transfer Learning for Reinforcement Learning Domains: A Survey

The Journal of Machine Learning Research
Skill combination for reinforcement learning

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Transfer learning through indirect encoding

Proceedings of the 12th annual conference on Genetic and evolutionary computation
Evolving Static Representations for Task Transfer

The Journal of Machine Learning Research
Reinforcement learning through global stochastic search in N-MDPs

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Transferring evolved reservoir features in reinforcement learning tasks

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Transfer learning via multiple inter-task mappings

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Reinforcement learning transfer via sparse coding

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Budgeted knowledge transfer for state-wise heterogeneous RL agents

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part I
Reinforcement learning transfer using a sparse coded inter-task mapping

EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
Transferring task models in Reinforcement Learning agents

Neurocomputing
A survey on team strategies in robot soccer: team strategies and role description

Artificial Intelligence Review
Learning potential functions and their representations for multi-task reinforcement learning

Autonomous Agents and Multi-Agent Systems

Quantified Score

Hi-index	0.02

Visualization

Abstract

The ambitious goal of transfer learning is to accelerate learning on a target task after training on a different, but related, source task. While many past transfer methods have focused on transferring value-functions, this paper presents a method for transferring policies across tasks with different state and action spaces. In particular, this paper utilizes transfer via inter-task mappings for policy search methods (TVITM-PS) to construct a transfer functional that translates a population of neural network policies trained via policy search from a source task to a target task. Empirical results in robot soccer Keepaway and Server Job Scheduling show that TVITM-PS can markedly reduce learning time when full inter-task mappings are available. The results also demonstrate that TVITMPS still succeeds when given only incomplete inter-task mappings. Furthermore, we present a novel method for learning such mappings when they are not available, and give results showing they perform comparably to hand-coded mappings.