Integrating reinforcement learning with human demonstrations of varying ability

Authors:
Matthew E. Taylor;Halit Bener Suay;Sonia Chernova
Affiliations:
Lafayette College;Worcester Polytechnic Institute;Worcester Polytechnic Institute
Venue:
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 2
Year:
2011

Citing 23
Cited 5

TD-Gammon, a self-teaching backgammon program, achieves master-level play

Neural Computation
Reinforcement learning with replacing eligibility traces

Machine Learning - Special issue on reinforcement learning
Creating advice-taking reinforcement learners

Machine Learning - Special issue on reinforcement learning
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Integrated learning for interactive synthetic characters

Proceedings of the 29th annual conference on Computer graphics and interactive techniques
Robot Learning From Demonstration

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Cross-domain transfer for reinforcement learning

Proceedings of the 24th international conference on Machine learning
Multi-task reinforcement learning: a hierarchical Bayesian approach

Proceedings of the 24th international conference on Machine learning
Transfer Learning via Inter-Task Mappings for Temporal Difference Learning

The Journal of Machine Learning Research
Autonomous Learning of Stable Quadruped Locomotion

RoboCup 2006: Robot Soccer World Cup X
A survey of robot learning from demonstration

Robotics and Autonomous Systems
Interactively shaping agents via human reinforcement: the TAMER framework

Proceedings of the fifth international conference on Knowledge capture
Potential-based shaping and Q-value initialization are equivalent

Journal of Artificial Intelligence Research
Accelerating reinforcement learning through implicit imitation

Journal of Artificial Intelligence Research
Training and tracking in robotics

IJCAI'85 Proceedings of the 9th international joint conference on Artificial intelligence - Volume 1
Transfer Learning for Reinforcement Learning Domains: A Survey

The Journal of Machine Learning Research
Relational macros for transfer in reinforcement learning

ILP'07 Proceedings of the 17th international conference on Inductive logic programming
Probabilistic Policy Reuse for inter-task transfer learning

Robotics and Autonomous Systems
Combining manual feedback with subsequent MDP reward signals for reinforcement learning

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
Keepaway soccer: from machine learning testbed to benchmark

RoboCup 2005
Using advice to transfer knowledge acquired in one reinforcement learning task to another

ECML'05 Proceedings of the 16th European conference on Machine Learning

Policy transfer in mobile robots using neuro-evolutionary navigation

Proceedings of the 14th annual conference companion on Genetic and evolutionary computation
Reinforcement learning from simultaneous human and MDP reward

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
Strategy-Based learning through communication with humans

KES-AMSTA'12 Proceedings of the 6th KES international conference on Agent and Multi-Agent Systems: technologies and applications
Multi model transfer learning with RULES family

MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
A comparison between a communication-based and a data mining-based learning approach for agents

Intelligent Decision Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

This work introduces Human-Agent Transfer (HAT), an algorithm that combines transfer learning, learning from demonstration and reinforcement learning to achieve rapid learning and high performance in complex domains. Using experiments in a simulated robot soccer domain, we show that human demonstrations transferred into a baseline policy for an agent and refined using reinforcement learning significantly improve both learning time and policy performance. Our evaluation compares three algorithmic approaches to incorporating demonstration rule summaries into transfer learning, and studies the impact of demonstration quality and quantity, as well as the effect of combining demonstrations from multiple teachers. Our results show that all three transfer methods lead to statistically significant improvement in performance over learning without demonstration. The best performance was achieved by combining the best demonstrations from two teachers.