Bayesian inverse reinforcement learning

Authors:
Deepak Ramachandran;Eyal Amir
Affiliations:
Computer Science Dept., University of Illinois at Urbana-Champaign, Urbana, IL;Computer Science Dept., University of Illinois at Urbana-Champaign, Urbana, IL
Venue:
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Year:
2007

Citing 10
Cited 23

An introduction to the Ising model

American Mathematical Monthly
Sampling and integration of near log-concave functions

STOC '91 Proceedings of the twenty-third annual ACM symposium on Theory of computing
Learning agents for uncertain environments (extended abstract)

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Opponent modeling in poker

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Robot Learning From Demonstration

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Algorithms for Inverse Reinforcement Learning

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Apprenticeship learning via inverse reinforcement learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Inverse Problem Theory and Methods for Model Parameter Estimation

Inverse Problem Theory and Methods for Model Parameter Estimation
A Bayesian approach to imitation in reinforcement learning

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence

Learning for control from multiple demonstrations

Proceedings of the 25th international conference on Machine learning
Apprenticeship learning for helicopter control

Communications of the ACM - Barbara Liskov: ACM's A.M. Turing Award Winner
Active Learning for Reward Estimation in Inverse Reinforcement Learning

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
Value-based policy teaching with active indirect elicitation

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 1
Maximum entropy inverse reinforcement learning

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Behavior bounding: an efficient method for high-level behavior comparison

Journal of Artificial Intelligence Research
A Computational Model of Social-Learning Mechanisms

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Inverse reinforcement learning in partially observable environments

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Planning-based prediction for pedestrians

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Analysis of Inverse Reinforcement Learning with Perturbed Demonstrations

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
2010 Special Issue: Applying machine learning to infant interaction: The development is in the details

Neural Networks
Learning from demonstration using MDP induced metrics

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Autonomous Helicopter Aerobatics through Apprenticeship Learning

International Journal of Robotics Research
Human-assisted neuroevolution through shaping, advice and examples

Proceedings of the 13th annual conference on Genetic and evolutionary computation
Inverse Reinforcement Learning in Partially Observable Environments

The Journal of Machine Learning Research
Preference elicitation and inverse reinforcement learning

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Comparing action-query strategies in semi-autonomous agents

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Bayesian multitask inverse reinforcement learning

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Batch, off-policy and model-free apprenticeship learning

EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Bayesian nonparametric inverse reinforcement learning

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Structured apprenticeship learning

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Apprenticeship learning with few examples

Neurocomputing
Bayesian nonparametric feature construction for inverse reinforcement learning

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an expert. IRL is motivated by situations where knowledge of the rewards is a goal by itself (as in preference elicitation) and by the task of apprenticeship learning (learning policies from an expert). In this paper we show how to combine prior knowledge and evidence from the expert's actions to derive a probability distribution over the space of reward functions. We present efficient algorithms that find solutions for the reward learning and apprenticeship learning tasks that generalize well over these distributions. Experimental results show strong improvement for our methods over previous heuristic-based approaches.