Reinforcement learning transfer via sparse coding

Authors:
Haitham B. Ammar;Karl Tuyls;Matthew E. Taylor;Kurt Driessens;Gerhard Weiss
Affiliations:
Maastricht University, The Netherlands;Maastricht University, The Netherlands;Lafayette College;Maastricht University, The Netherlands;Maastricht University, The Netherlands
Venue:
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Year:
2012

Citing 18
Cited 1

Transfer of Learning by Composing Solutions of Elemental Sequential Tasks

Machine Learning
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Least-squares policy iteration

The Journal of Machine Learning Research
Autonomous shaping: knowledge transfer in reinforcement learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning)

Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning)
Self-taught learning: transfer learning from unlabeled data

Proceedings of the 24th international conference on Machine learning
An Interior-Point Method for Large-Scale l1-Regularized Logistic Regression

The Journal of Machine Learning Research
Transfer Learning via Inter-Task Mappings for Temporal Difference Learning

The Journal of Machine Learning Research
Transfer via inter-task mappings in policy search reinforcement learning

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Transfer of samples in batch reinforcement learning

Proceedings of the 25th international conference on Machine learning
Autonomous transfer for reinforcement learning

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 1
Graph-Based Domain Mapping for Transfer Learning in General Games

ECML '07 Proceedings of the 18th European conference on Machine Learning
Value-function-based transfer for reinforcement learning using structure mapping

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Training and tracking in robotics

IJCAI'85 Proceedings of the 9th international joint conference on Artificial intelligence - Volume 1
An experts algorithm for transfer learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Transfer Learning for Reinforcement Learning Domains: A Survey

The Journal of Machine Learning Research
Reinforcement Learning and Dynamic Programming Using Function Approximators

Reinforcement Learning and Dynamic Programming Using Function Approximators
Using advice to transfer knowledge acquired in one reinforcement learning task to another

ECML'05 Proceedings of the 16th European conference on Machine Learning

Reinforcement learning transfer using a sparse coded inter-task mapping

EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Although reinforcement learning (RL) has been successfully deployed in a variety of tasks, learning speed remains a fundamental problem for applying RL in complex environments. Transfer learning aims to ameliorate this shortcoming by speeding up learning through the adaptation of previously learned behaviors in similar tasks. Transfer techniques often use an inter-task mapping, which determines how a pair of tasks are related. Instead of relying on a hand-coded inter-task mapping, this paper proposes a novel transfer learning method capable of autonomously creating an inter-task mapping by using a novel combination of sparse coding, sparse projection learning and sparse Gaussian processes. We also propose two new transfer algorithms (TrLSPI and TrFQI) based on least squares policy iteration and fitted-Q-iteration. Experiments not only show successful transfer of information between similar tasks, inverted pendulum to cart pole, but also between two very different domains: mountain car to cart pole. This paper empirically shows that the learned inter-task mapping can be successfully used to (1) improve the performance of a learned policy on a fixed number of environmental samples, (2) reduce the learning times needed by the algorithms to converge to a policy on a fixed number of samples, and (3) converge faster to a near-optimal policy given a large number of samples.