Autonomous shaping: knowledge transfer in reinforcement learning

Authors:
George Konidaris;Andrew Barto
Affiliations:
University of Massachusetts at Amherst;University of Massachusetts at Amherst
Venue:
ICML '06 Proceedings of the 23rd international conference on Machine learning
Year:
2006

Citing 17
Cited 30

Real-time heuristic search

Artificial Intelligence
Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time

Machine Learning
Learning to act using real-time dynamic programming

Artificial Intelligence - Special volume on computational research on interaction and agency, part 1
The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms

Machine Learning - Special issue on reinforcement learning
Exploring unknown environments with real-time search or reinforcement learning

Proceedings of the 1998 conference on Advances in neural information processing systems II
Reinforcement Learning

Reinforcement Learning
Robot Shaping: An Experiment in Behavior Engineering

Robot Shaping: An Experiment in Behavior Engineering
Reinforcement Learning in the Multi-Robot Domain

Autonomous Robots
PolicyBlocks: An Algorithm for Creating Useful Macro-Actions in Reinforcement Learning

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Learning to Drive a Bicycle Using Reinforcement Learning and Shaping

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Efficient Exploration In Reinforcement Learning

Efficient Exploration In Reinforcement Learning
Reusing Old Policies to Accelerate Learning on New MDPs TITLE2:

Reusing Old Policies to Accelerate Learning on New MDPs TITLE2:
Learning and value function approximation in complex decision processes

Learning and value function approximation in complex decision processes
Proto-value functions: developmental reinforcement learning

ICML '05 Proceedings of the 22nd international conference on Machine learning
Value functions for RL-based behavior transfer: a comparative study

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Potential-based shaping and Q-value initialization are equivalent

Journal of Artificial Intelligence Research

Automatic shaping and decomposition of reward functions

Proceedings of the 24th international conference on Machine learning
On the role of tracking in stationary environments

Proceedings of the 24th international conference on Machine learning
Multi-task reinforcement learning: a hierarchical Bayesian approach

Proceedings of the 24th international conference on Machine learning
Towards reinforcement learning representation transfer

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Co-evolution of Shaping Rewards and Meta-Parameters in Reinforcement Learning

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Trajectory prediction: learning to map situations to robot trajectories

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Experiments with Adaptive Transfer Rate in Reinforcement Learning

Knowledge Acquisition: Approaches, Algorithms and Applications
Learning and multiagent reasoning for autonomous agents

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Building portable options: skill transfer in reinforcement learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Knowledge-based recurrent neural networks in Reinforcement Learning

ASC '07 Proceedings of The Eleventh IASTED International Conference on Artificial Intelligence and Soft Computing
Transfer Learning beyond Text Classification

ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Transfer of knowledge for a climbing virtual human: a reinforcement learning approach

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
2010 Special Issue: Online learning of shaping rewards in reinforcement learning

Neural Networks
Transfer Learning for Reinforcement Learning Domains: A Survey

The Journal of Machine Learning Research
The more you learn, the less you store: Memory-controlled incremental SVM for visual place recognition

Image and Vision Computing
Skill combination for reinforcement learning

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Multi-task evolutionary shaping without pre-specified representations

Proceedings of the 12th annual conference on Genetic and evolutionary computation
Structural knowledge transfer by spatial abstraction for reinforcement learning agents

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Dynamic reward shaping: training a robot by voice

IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
Abstraction and generalization in reinforcement learning: a summary and framework

ALA'09 Proceedings of the Second international conference on Adaptive and Learning Agents
Teaching a robot to perform task through imitation and on-line feedback

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Reinforcement learning transfer via common subspaces

ALA'11 Proceedings of the 11th international conference on Adaptive and Learning Agents
Multi-Task reinforcement learning: shaping and feature selection

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Transfer learning in multi-agent reinforcement learning domains

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Reinforcement learning transfer via sparse coding

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Autonomous shaping via coevolutionary selection of training experience

PPSN'12 Proceedings of the 12th international conference on Parallel Problem Solving from Nature - Volume Part II
Fast motion planning from experience: trajectory prediction for speeding up movement generation

Autonomous Robots
Transferring task models in Reinforcement Learning agents

Neurocomputing
Machine learning for interactive systems and robots: a brief introduction

Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
Learning potential functions and their representations for multi-task reinforcement learning

Autonomous Agents and Multi-Agent Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We introduce the use of learned shaping rewards in reinforcement learning tasks, where an agent uses prior experience on a sequence of tasks to learn a portable predictor that estimates intermediate rewards, resulting in accelerated learning in later tasks that are related but distinct. Such agents can be trained on a sequence of relatively easy tasks in order to develop a more informative measure of reward that can be transferred to improve performance on more difficult tasks without requiring a hand coded shaping function. We use a rod positioning task to show that this significantly improves performance even after a very brief training period.