Apprenticeship learning via inverse reinforcement learning

Authors:
Pieter Abbeel;Andrew Y. Ng
Affiliations:
Stanford University, Stanford, CA;Stanford University, Stanford, CA
Venue:
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Year:
2004

Citing 8
Cited 95

ALVINN: an autonomous land vehicle in a neural network

Advances in neural information processing systems 1
The nature of statistical learning theory

The nature of statistical learning theory
Learning to Fly

ML '92 Proceedings of the Ninth International Workshop on Machine Learning
Robot Learning From Demonstration

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Algorithms for Inverse Reinforcement Learning

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Learning Movement Sequences from Demonstration

ICDL '02 Proceedings of the 2nd International Conference on Development and Learning
Apprenticeship learning via inverse reinforcement learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning

Apprenticeship learning via inverse reinforcement learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Exploration and apprenticeship learning in reinforcement learning

ICML '05 Proceedings of the 22nd international conference on Machine learning
Recognition and reproduction of gestures using a probabilistic framework combining PCA, ICA and HMM

ICML '05 Proceedings of the 22nd international conference on Machine learning
Dynamic preferences in multi-criteria reinforcement learning

ICML '05 Proceedings of the 22nd international conference on Machine learning
Qualitative reinforcement learning

ICML '06 Proceedings of the 23rd international conference on Machine learning
Maximum margin planning

ICML '06 Proceedings of the 23rd international conference on Machine learning
Learnable behavioural model for autonomous virtual agents: low-level learning

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Confidence-based policy learning from demonstration using Gaussian mixture models

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Multi-thresholded approach to demonstration selection for interactive robot learning

Proceedings of the 3rd ACM/IEEE international conference on Human robot interaction
Learning all optimal policies with multiple criteria

Proceedings of the 25th international conference on Machine learning
Learning for control from multiple demonstrations

Proceedings of the 25th international conference on Machine learning
Apprenticeship learning using linear programming

Proceedings of the 25th international conference on Machine learning
Autonomous agent learning using an actor-critic algorithm and behavior models

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Navigate like a cabbie: probabilistic reasoning from observed context-aware behavior

UbiComp '08 Proceedings of the 10th international conference on Ubiquitous computing
Imitation Learning Using Graphical Models

ECML '07 Proceedings of the 18th European conference on Machine Learning
Transfer in variable-reward hierarchical reinforcement learning

Machine Learning
Creating and using matrix representations of social interaction

Proceedings of the 4th ACM/IEEE international conference on Human robot interaction
A survey of robot learning from demonstration

Robotics and Autonomous Systems
Linear Bellman combination for control of character animation

ACM SIGGRAPH 2009 papers
Apprenticeship learning for helicopter control

Communications of the ACM - Barbara Liskov: ACM's A.M. Turing Award Winner
Experiments with Adaptive Transfer Rate in Reinforcement Learning

Knowledge Acquisition: Approaches, Algorithms and Applications
Learning to search: Functional gradient techniques for imitation learning

Autonomous Robots
Active Learning for Reward Estimation in Inverse Reinforcement Learning

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Active imitation learning

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Maximum entropy inverse reinforcement learning

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Interactive policy learning through confidence-based autonomy

Journal of Artificial Intelligence Research
Behavior bounding: an efficient method for high-level behavior comparison

Journal of Artificial Intelligence Research
Bayesian inverse reinforcement learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Inverse reinforcement learning in partially observable environments

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Training parsers by inverse reinforcement learning

Machine Learning
Memory-enhanced evolutionary robotics: the echo state network approach

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Learning motor primitives for robotics

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Planning-based prediction for pedestrians

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Transparent active learning for robots

Proceedings of the 5th ACM/IEEE international conference on Human-robot interaction
Learning behavior styles with inverse reinforcement learning

ACM SIGGRAPH 2010 papers
Non-parametric Learning to Aid Path Planning over Slopes

International Journal of Robotics Research
Learning to follow navigational directions

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Analysis of Inverse Reinforcement Learning with Perturbed Demonstrations

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Performance measurement and its role in advancement for intelligent systems: discussion points

PerMIS '09 Proceedings of the 9th Workshop on Performance Metrics for Intelligent Systems
Learning from Demonstration for Autonomous Navigation in Complex Unstructured Terrain

International Journal of Robotics Research
2010 Special Issue: Applying machine learning to infant interaction: The development is in the details

Neural Networks
A Human-Robot Collaborative Reinforcement Learning Algorithm

Journal of Intelligent and Robotic Systems
Learning from demonstration using MDP induced metrics

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Autonomous Helicopter Aerobatics through Apprenticeship Learning

International Journal of Robotics Research
Adaptation-based programming in java

Proceedings of the 20th ACM SIGPLAN workshop on Partial evaluation and program manipulation
The Stanford LittleDog: A learning and rapid replanning approach to quadruped locomotion

International Journal of Robotics Research
Dynamic reward shaping: training a robot by voice

IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
Human and robot perception in large-scale learning from demonstration

Proceedings of the 6th international conference on Human-robot interaction
Robot self-initiative and personalization by learning through repeated interactions

Proceedings of the 6th international conference on Human-robot interaction
Human-assisted neuroevolution through shaping, advice and examples

Proceedings of the 13th annual conference on Genetic and evolutionary computation
Inverse Reinforcement Learning in Partially Observable Environments

The Journal of Machine Learning Research
Preference-based policy learning

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Preference elicitation and inverse reinforcement learning

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Comparing action-query strategies in semi-autonomous agents

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Reinforcement learning and apprenticeship learning for robotic control

ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
Reinforcement learning utilizes proxemics: An avatar learns to manipulate the position of people in immersive virtual reality

ACM Transactions on Applied Perception (TAP)
Trajectories and keyframes for kinesthetic teaching: a human-robot interaction perspective

HRI '12 Proceedings of the seventh annual ACM/IEEE international conference on Human-Robot Interaction
Robot learning from demonstration by constructing skill trees

International Journal of Robotics Research
Probabilistic pointing target prediction via inverse optimal control

Proceedings of the 2012 ACM international conference on Intelligent User Interfaces
Teaching a robot to perform task through imitation and on-line feedback

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Improving biped walk stability with complementary corrective demonstration

Autonomous Robots
Automatic state abstraction from demonstration

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Imitation learning in relational domains: a functional-gradient boosting approach

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Robust bayesian reinforcement learning through tight lower bounds

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Bayesian multitask inverse reinforcement learning

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Batch, off-policy and model-free apprenticeship learning

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Automatic task decomposition and state abstraction from demonstration

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
LEARNING AND VERIFYING SAFETY CONSTRAINTS FOR PLANNERS IN A KNOWLEDGE-IMPOVERISHED SYSTEM

Computational Intelligence
Faster program adaptation through reward attribution inference

Proceedings of the 11th International Conference on Generative Programming and Component Engineering
Learning to interpret natural language instructions

SIAC '12 Proceedings of the Second Workshop on Semantic Interpretation in an Actionable Context
Besting the quiz master: crowdsourcing incremental classification games

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Activity forecasting

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
APRIL: active preference learning-based reinforcement learning

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Bayesian nonparametric inverse reinforcement learning

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Structured apprenticeship learning

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Active learning of inverse models with intrinsically motivated goal exploration in robots

Robotics and Autonomous Systems
Human behavior understanding for robotics

HBU'12 Proceedings of the Third international conference on Human Behavior Understanding
Bayesian Learning of Noisy Markov Decision Processes

ACM Transactions on Modeling and Computer Simulation (TOMACS) - Special Issue on Monte Carlo Methods in Statistics
Stochastic optimal control methods for investigating the power of morphological computation

Artificial Life
Human-robot cross-training: computational formulation, modeling and evaluation of a human team training strategy

Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction
Legibility and predictability of robot motion

Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction
Generalizing hyper-heuristics via apprenticeship learning

EvoCOP'13 Proceedings of the 13th European conference on Evolutionary Computation in Combinatorial Optimization
Using informative behavior to increase engagement in the tamer framework

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Inverse reinforcement learning for interactive systems

Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
A policy-blending formalism for shared control

International Journal of Robotics Research
Probabilistic movement modeling for intention inference in human-robot interaction

International Journal of Robotics Research
Scenario Trees and Policy Selection for Multistage Stochastic Programming Using Machine Learning

INFORMS Journal on Computing
Motion planning and reactive control on learnt skill manifolds

International Journal of Robotics Research
Reinforcement learning in robotics: A survey

International Journal of Robotics Research
Bayesian nonparametric feature construction for inverse reinforcement learning

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Learning via human feedback in continuous state and action spaces

Applied Intelligence
Embodied imitation-enhanced reinforcement learning in multi-agent systems

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Socially guided intrinsic motivation for robot learning of motor skills

Autonomous Robots
Learning web-service task descriptions from traces

Web Intelligence and Agent Systems
A tour of machine learning: An AI perspective

AI Communications - ECAI 2012 Turing and Anniversary Track

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we want to learn to perform. This setting is useful in applications (such as the task of driving) where it may be difficult to write down an explicit reward function specifying exactly how different desiderata should be traded off. We think of the expert as trying to maximize a reward function that is expressible as a linear combination of known features, and give an algorithm for learning the task demonstrated by the expert. Our algorithm is based on using "inverse reinforcement learning" to try to recover the unknown reward function. We show that our algorithm terminates in a small number of iterations, and that even though we may never recover the expert's reward function, the policy output by the algorithm will attain performance close to that of the expert, where here performance is measured with respect to the expert's unknown reward function.