Transfer of Learning by Composing Solutions of Elemental Sequential Tasks

Authors:
Satinder Pal Singh
Affiliations:
Department of Computer Science, University of Massachusetts, Amherst, MA 01003. SATINDER@cs.umass.edu
Venue:
Machine Learning
Year:
1992

Citing 0
Cited 53

Dynamic Clustering of Maps in Autonomous Agents

IEEE Transactions on Pattern Analysis and Machine Intelligence
CHILD: A First Step Towards Continual Learning

Machine Learning - Special issue on inductive transfer
Hierarchical Learning of Navigational Behaviors in anAutonomous Robot using a Predictive Sparse DistributedMemory

Machine Learning - Special issue on learning in autonomous robots
Hierarchical Learning of Navigational Behaviors in anAutonomous Robot using a Predictive Sparse Distributed Memory

Autonomous Robots
Multiple model-based reinforcement learning

Neural Computation
Distributing a Mind on the Internet: The World-Wide-Mind

ECAL '01 Proceedings of the 6th European Conference on Advances in Artificial Life
An Overview of MAXQ Hierarchical Reinforcement Learning

SARA '02 Proceedings of the 4th International Symposium on Abstraction, Reformulation, and Approximation
Multiple Reward Criterion for Cooperative Behavior Acquisition in a Muliagent Environment

RoboCup-99: Robot Soccer World Cup III
Integration of soft computing towards autonomous legged robots

Autonomous robotic systems
Inter-module credit assignment in modular reinforcement learning

Neural Networks
Planning and programming with first-order markov decision processes: insights and challenges

TARK '01 Proceedings of the 8th conference on Theoretical aspects of rationality and knowledge
Learning basic navigation for personal satellite assistant using neuroevolution

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Behavior transfer for value-function-based reinforcement learning

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Cooperative Coevolution: An Architecture for Evolving Coadapted Subcomponents

Evolutionary Computation
Sequence-Learning Algorithm Based on Backward Chaining

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Hierarchical Average Reward Reinforcement Learning

The Journal of Machine Learning Research
Genetic learning using adaptive action value tables

EC'08 Proceedings of the 9th WSEAS International Conference on Evolutionary Computing
Requirements for Machine Lifelong Learning

IWINAC '07 Proceedings of the 2nd international work-conference on The Interplay Between Natural and Artificial Computation, Part I: Bio-inspired Modeling of Cognitive Tasks
Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others

RoboCup 2007: Robot Soccer World Cup XI
Self-organized Reinforcement Learning Based on Policy Gradient in Nonstationary Environments

ICANN '08 Proceedings of the 18th international conference on Artificial Neural Networks, Part I
Q-learning based on hierarchical evolutionary mechanism

WSEAS Transactions on Systems and Control
On universal transfer learning

Theoretical Computer Science
Evolution of functional specialization in a morphologically homogeneous robot

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Value functions for RL-based behavior transfer: a comparative study

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Autonomous inter-task transfer in reinforcement learning domains

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
A model of inductive bias learning

Journal of Artificial Intelligence Research
Hierarchical reinforcement learning with the MAXQ value function decomposition

Journal of Artificial Intelligence Research
Reinforcement learning: a survey

Journal of Artificial Intelligence Research
Solving non-Markovian control tasks with neuroevolution

IJCAI'99 Proceedings of the 16th international joint conference on Artificial intelligence - Volume 2
Learning and multiagent reasoning for autonomous agents

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
How robot morphology and training order affect the learning of multiple behaviors

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Human instruction recognition and self behavior acquisition based on state value

FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
Transfer Learning for Reinforcement Learning Domains: A Survey

The Journal of Machine Learning Research
Selective transfer of task knowledge using stochastic noise

AI'03 Proceedings of the 16th Canadian society for computational studies of intelligence conference on Advances in artificial intelligence
Emulation and behavior understanding through shared values

Robotics and Autonomous Systems
Guarding against premature convergence while accelerating evolutionary search

Proceedings of the 12th annual conference on Genetic and evolutionary computation
From baby steps to Leapfrog: how "Less is More" in unsupervised dependency parsing

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
The utility of evolving simulated robot morphology increases with task complexity for object manipulation

Artificial Life
Reinforcement learning with a hierarchy of abstract models

AAAI'92 Proceedings of the tenth national conference on Artificial intelligence
The implementation of Q-learning for problems in continuous state and action space using SOM-based fuzzy systems

ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
SD-Q: selective discount Q learning based on new results of intertemporal choice theory

AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part II
Flexible decomposition algorithms for weakly coupled Markov decision problems

UAI'98 Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence
Skill acquisition via transfer learning and advice taking

ECML'06 Proceedings of the 17th European conference on Machine Learning
Simultaneous learning to acquire competitive behaviors in multi-agent system based on modular learning system

RoboCup 2005
Using advice to transfer knowledge acquired in one reinforcement learning task to another

ECML'05 Proceedings of the 16th European conference on Machine Learning
Modular learning system and scheduling for behavior acquisition in multi-agent environment

RoboCup 2004
Q-error as a selection mechanism in modular reinforcement-learning systems

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
An extension of a hierarchical reinforcement learning algorithm for multiagent settings

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Reinforcement learning transfer via sparse coding

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Multi-agent task division learning in hide-and-seek games

AIMSA'12 Proceedings of the 15th international conference on Artificial Intelligence: methodology, systems, and applications
Neuro-fuzzy-based skill learning for robots

Robotica
Automatic interface optimization through random exploration of available elements

Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Learning potential functions and their representations for multi-task reinforcement learning

Autonomous Agents and Multi-Agent Systems

Quantified Score

Hi-index	0.01

Visualization

Abstract

Although building sophisticated learning agents that operate in complex environments will require learning to perform multiple tasks, most applications of reinforcement learning have focused on single tasks. In this paper I consider a class of sequential decision tasks (SDTs), called composite sequential decision tasks, formed by temporally concatenating a number of elemental sequential decision tasks. Elemental SDTs cannot be decomposed into simpler SDTs. I consider a learning agent that has to learn to solve a set of elemental and composite SDTs. I assume that the structure of the composite tasks is unknown to the learning agent. The straightforward application of reinforcement learning to multiple tasks requires learning the tasks separately, which can waste computational resources, both memory and time. I present a new learning algorithm and a modular architecture that learns the decomposition of composite SDTs, and achieves transfer of learning by sharing the solutions of elemental SDTs across multiple composite SDTs. The solution of a composite SDT is constructed by computationally inexpensive modifications of the solutions of its constituent elemental SDTs. I provide a proof of one aspect of the learning algorithm.