ALVINN: an autonomous land vehicle in a neural network
Advances in neural information processing systems 1
The nature of statistical learning theory
The nature of statistical learning theory
ML '92 Proceedings of the Ninth International Workshop on Machine Learning
Robot Learning From Demonstration
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Algorithms for Inverse Reinforcement Learning
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Learning Movement Sequences from Demonstration
ICDL '02 Proceedings of the 2nd International Conference on Development and Learning
Apprenticeship learning via inverse reinforcement learning
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Apprenticeship learning via inverse reinforcement learning
ICML '04 Proceedings of the twenty-first international conference on Machine learning
Exploration and apprenticeship learning in reinforcement learning
ICML '05 Proceedings of the 22nd international conference on Machine learning
Recognition and reproduction of gestures using a probabilistic framework combining PCA, ICA and HMM
ICML '05 Proceedings of the 22nd international conference on Machine learning
Dynamic preferences in multi-criteria reinforcement learning
ICML '05 Proceedings of the 22nd international conference on Machine learning
Qualitative reinforcement learning
ICML '06 Proceedings of the 23rd international conference on Machine learning
ICML '06 Proceedings of the 23rd international conference on Machine learning
Learnable behavioural model for autonomous virtual agents: low-level learning
AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
Confidence-based policy learning from demonstration using Gaussian mixture models
Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Multi-thresholded approach to demonstration selection for interactive robot learning
Proceedings of the 3rd ACM/IEEE international conference on Human robot interaction
Learning all optimal policies with multiple criteria
Proceedings of the 25th international conference on Machine learning
Learning for control from multiple demonstrations
Proceedings of the 25th international conference on Machine learning
Apprenticeship learning using linear programming
Proceedings of the 25th international conference on Machine learning
Autonomous agent learning using an actor-critic algorithm and behavior models
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
Navigate like a cabbie: probabilistic reasoning from observed context-aware behavior
UbiComp '08 Proceedings of the 10th international conference on Ubiquitous computing
Imitation Learning Using Graphical Models
ECML '07 Proceedings of the 18th European conference on Machine Learning
Transfer in variable-reward hierarchical reinforcement learning
Machine Learning
Creating and using matrix representations of social interaction
Proceedings of the 4th ACM/IEEE international conference on Human robot interaction
A survey of robot learning from demonstration
Robotics and Autonomous Systems
Linear Bellman combination for control of character animation
ACM SIGGRAPH 2009 papers
Apprenticeship learning for helicopter control
Communications of the ACM - Barbara Liskov: ACM's A.M. Turing Award Winner
Experiments with Adaptive Transfer Rate in Reinforcement Learning
Knowledge Acquisition: Approaches, Algorithms and Applications
Active Learning for Reward Estimation in Inverse Reinforcement Learning
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
Maximum entropy inverse reinforcement learning
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Interactive policy learning through confidence-based autonomy
Journal of Artificial Intelligence Research
Behavior bounding: an efficient method for high-level behavior comparison
Journal of Artificial Intelligence Research
Bayesian inverse reinforcement learning
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Inverse reinforcement learning in partially observable environments
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Training parsers by inverse reinforcement learning
Machine Learning
Memory-enhanced evolutionary robotics: the echo state network approach
CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
Learning motor primitives for robotics
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Planning-based prediction for pedestrians
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Transparent active learning for robots
Proceedings of the 5th ACM/IEEE international conference on Human-robot interaction
Learning behavior styles with inverse reinforcement learning
ACM SIGGRAPH 2010 papers
Non-parametric Learning to Aid Path Planning over Slopes
International Journal of Robotics Research
Learning to follow navigational directions
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Analysis of Inverse Reinforcement Learning with Perturbed Demonstrations
Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Performance measurement and its role in advancement for intelligent systems: discussion points
PerMIS '09 Proceedings of the 9th Workshop on Performance Metrics for Intelligent Systems
Learning from Demonstration for Autonomous Navigation in Complex Unstructured Terrain
International Journal of Robotics Research
A Human-Robot Collaborative Reinforcement Learning Algorithm
Journal of Intelligent and Robotic Systems
Learning from demonstration using MDP induced metrics
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Autonomous Helicopter Aerobatics through Apprenticeship Learning
International Journal of Robotics Research
Adaptation-based programming in java
Proceedings of the 20th ACM SIGPLAN workshop on Partial evaluation and program manipulation
The Stanford LittleDog: A learning and rapid replanning approach to quadruped locomotion
International Journal of Robotics Research
Dynamic reward shaping: training a robot by voice
IBERAMIA'10 Proceedings of the 12th Ibero-American conference on Advances in artificial intelligence
Human and robot perception in large-scale learning from demonstration
Proceedings of the 6th international conference on Human-robot interaction
Robot self-initiative and personalization by learning through repeated interactions
Proceedings of the 6th international conference on Human-robot interaction
Human-assisted neuroevolution through shaping, advice and examples
Proceedings of the 13th annual conference on Genetic and evolutionary computation
Inverse Reinforcement Learning in Partially Observable Environments
The Journal of Machine Learning Research
Preference-based policy learning
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Preference elicitation and inverse reinforcement learning
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III
Comparing action-query strategies in semi-autonomous agents
The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Reinforcement learning and apprenticeship learning for robotic control
ALT'06 Proceedings of the 17th international conference on Algorithmic Learning Theory
ACM Transactions on Applied Perception (TAP)
Trajectories and keyframes for kinesthetic teaching: a human-robot interaction perspective
HRI '12 Proceedings of the seventh annual ACM/IEEE international conference on Human-Robot Interaction
Robot learning from demonstration by constructing skill trees
International Journal of Robotics Research
Probabilistic pointing target prediction via inverse optimal control
Proceedings of the 2012 ACM international conference on Intelligent User Interfaces
Teaching a robot to perform task through imitation and on-line feedback
CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Automatic state abstraction from demonstration
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Imitation learning in relational domains: a functional-gradient boosting approach
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Robust bayesian reinforcement learning through tight lower bounds
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Bayesian multitask inverse reinforcement learning
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Batch, off-policy and model-free apprenticeship learning
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Automatic task decomposition and state abstraction from demonstration
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
LEARNING AND VERIFYING SAFETY CONSTRAINTS FOR PLANNERS IN A KNOWLEDGE-IMPOVERISHED SYSTEM
Computational Intelligence
Faster program adaptation through reward attribution inference
Proceedings of the 11th International Conference on Generative Programming and Component Engineering
Learning to interpret natural language instructions
SIAC '12 Proceedings of the Second Workshop on Semantic Interpretation in an Actionable Context
Besting the quiz master: crowdsourcing incremental classification games
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
APRIL: active preference learning-based reinforcement learning
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Bayesian nonparametric inverse reinforcement learning
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Structured apprenticeship learning
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
Active learning of inverse models with intrinsically motivated goal exploration in robots
Robotics and Autonomous Systems
Human behavior understanding for robotics
HBU'12 Proceedings of the Third international conference on Human Behavior Understanding
Bayesian Learning of Noisy Markov Decision Processes
ACM Transactions on Modeling and Computer Simulation (TOMACS) - Special Issue on Monte Carlo Methods in Statistics
Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction
Legibility and predictability of robot motion
Proceedings of the 8th ACM/IEEE international conference on Human-robot interaction
Generalizing hyper-heuristics via apprenticeship learning
EvoCOP'13 Proceedings of the 13th European conference on Evolutionary Computation in Combinatorial Optimization
Using informative behavior to increase engagement in the tamer framework
Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems
Inverse reinforcement learning for interactive systems
Proceedings of the 2nd Workshop on Machine Learning for Interactive Systems: Bridging the Gap Between Perception, Action and Communication
A policy-blending formalism for shared control
International Journal of Robotics Research
Probabilistic movement modeling for intention inference in human-robot interaction
International Journal of Robotics Research
Scenario Trees and Policy Selection for Multistage Stochastic Programming Using Machine Learning
INFORMS Journal on Computing
Motion planning and reactive control on learnt skill manifolds
International Journal of Robotics Research
Reinforcement learning in robotics: A survey
International Journal of Robotics Research
Bayesian nonparametric feature construction for inverse reinforcement learning
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Learning via human feedback in continuous state and action spaces
Applied Intelligence
Embodied imitation-enhanced reinforcement learning in multi-agent systems
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Socially guided intrinsic motivation for robot learning of motor skills
Autonomous Robots
Learning web-service task descriptions from traces
Web Intelligence and Agent Systems
A tour of machine learning: An AI perspective
AI Communications - ECAI 2012 Turing and Anniversary Track
Hi-index | 0.00 |
We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we want to learn to perform. This setting is useful in applications (such as the task of driving) where it may be difficult to write down an explicit reward function specifying exactly how different desiderata should be traded off. We think of the expert as trying to maximize a reward function that is expressible as a linear combination of known features, and give an algorithm for learning the task demonstrated by the expert. Our algorithm is based on using "inverse reinforcement learning" to try to recover the unknown reward function. We show that our algorithm terminates in a small number of iterations, and that even though we may never recover the expert's reward function, the policy output by the algorithm will attain performance close to that of the expert, where here performance is measured with respect to the expert's unknown reward function.