Practical solution techniques for first-order MDPs

Authors:
Scott Sanner;Craig Boutilier
Affiliations:
Statistical Machine Learning Group, National ICT Australia, Canberra, ACT, 0200, Australia;Department of Computer Science, University of Toronto, Toronto, ON M5S 3H5, Canada
Venue:
Artificial Intelligence
Year:
2009

Citing 56
Cited 15

Generalized subsumption and its applications to induction and redundancy

Artificial Intelligence
ADL: exploring the middle ground between STRIPS and the situation calculus

Proceedings of the first international conference on Principles of knowledge representation and reasoning
The frame problem in situation the calculus: a simple solution (sometimes) and a completeness result for goal regression

Artificial intelligence and mathematical theory of computation
Learning by analogical reasoning in general problem-solving

Learning by analogical reasoning in general problem-solving
Feature-based methods for large scale dynamic programming

Machine Learning - Special issue on reinforcement learning
Abstraction and approximate decision-theoretic planning

Artificial Intelligence
The independent choice logic for modelling multiple agents under uncertainty

Artificial Intelligence - Special issue on economic principles of multi-agent systems
Algebraic decision diagrams and their applications

ICCAD '93 Proceedings of the 1993 IEEE/ACM international conference on Computer-aided design
Solving very large weakly coupled Markov decision processes

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Reinforcement learning with hierarchies of machines

NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
How to dynamically merge Markov decision processes

NIPS '97 Proceedings of the 1997 conference on Advances in neural information processing systems 10
Learning to Take Actions

Machine Learning
Bucket elimination: a unifying framework for reasoning

Artificial Intelligence
Learning action strategies for planning domains

Artificial Intelligence
Using temporal logics to express search control knowledge for planning

Artificial Intelligence
Relational reinforcement learning

Machine Learning - Special issue on inducive logic programming
Knowlege in action: logical foundations for specifying and implementing dynamical systems

Knowlege in action: logical foundations for specifying and implementing dynamical systems
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Integrating Experimentation and Guidance in Relational Reinforcement Learning

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Computing Factored Value Functions for Policies in Structured MDPs

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Policy Iteration for Factored MDPs

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Decision-Theoretic, High-Level Agent Programming in the Situation Calculus

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Greedy linear value-approximation for factored Markov decision processes

Eighteenth national conference on Artificial intelligence
Piecewise linear value function approximation for factored MDPs

Eighteenth national conference on Artificial intelligence
Dynamic Programming

Dynamic Programming
Learning to Act using Real-Time Dynamic Programming

Learning to Act using Real-Time Dynamic Programming
The Linear Programming Approach to Approximate Dynamic Programming

Operations Research
Knowledge Representation and Reasoning

Knowledge Representation and Reasoning
Bellman goes relational

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Exploiting first-order regression in inductive policy selection

UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Solving factored MDPs with continuous and discrete variables

UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Graph kernels and Gaussian processes for relational reinforcement learning

Machine Learning
The design and implementation of VAMPIRE

AI Communications - CASC
MPE and partial inversion in lifted probabilistic variable elimination

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Samuel meets Amarel: automating value function approximation using global state space analysis

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Learning measures of progress for planning domains

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Prioritized goal decomposition of Markov decision processes: toward a synthesis of classical and decision theoretic planning

IJCAI'97 Proceedings of the Fifteenth international joint conference on Artifical intelligence - Volume 2
The FF planning system: fast plan generation through heuristic search

Journal of Artificial Intelligence Research
The first probabilistic track of the international planning competition

Journal of Artificial Intelligence Research
Decision-theoretic planning with non-Markovian rewards

Journal of Artificial Intelligence Research
Approximate policy iteration with a policy language bias: solving relational Markov decision processes

Journal of Artificial Intelligence Research
First order decision diagrams for relational MDPs

Journal of Artificial Intelligence Research
Exploiting causal independence in Bayesian network inference

Journal of Artificial Intelligence Research
First order decision diagrams for relational MDPs

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
First-order probabilistic inference

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Generalizing plans to new environments in relational MDPs

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Extending DTGOLOG with options

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Symbolic dynamic programming for first-order MDPs

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 1
Lifted first-order probabilistic inference

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Fast planning through planning graph analysis

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
First-order decision-theoretic planning in structured relational environments

First-order decision-theoretic planning in structured relational environments
SPUDD: stochastic planning using decision diagrams

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Inductive policy selection for first-order MDPs

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Context-specific independence in Bayesian networks

UAI'96 Proceedings of the Twelfth international conference on Uncertainty in artificial intelligence

Generalized first order decision diagrams for first order Markov decision processes

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
On the Verification of Very Expressive Temporal Properties of Non-terminating Golog Programs

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Exploration in relational worlds

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Automatic induction of bellman-error features for probabilistic planning

Journal of Artificial Intelligence Research
Planning with noisy probabilistic relational rules

Journal of Artificial Intelligence Research
Relational preference rules for control

Artificial Intelligence
Declarative programming for agent applications

Autonomous Agents and Multi-Agent Systems
Bridging the gap between reinforcement learning and knowledge representation: a logical off- and on-policy framework

ECSQARU'11 Proceedings of the 11th European conference on Symbolic and quantitative approaches to reasoning with uncertainty
Decision-theoretic planning with generalized first-order decision diagrams

Artificial Intelligence
A partition-based first-order probabilistic logic to represent interactive beliefs

SUM'11 Proceedings of the 5th international conference on Scalable uncertainty management
Probabilistic relational planning with first order decision diagrams

Journal of Artificial Intelligence Research
Stochastic enforced hill-climbing

Journal of Artificial Intelligence Research
Proximity-based non-uniform abstractions for approximate planning

Journal of Artificial Intelligence Research
Plan-based policies for efficient multiple battery load management

Journal of Artificial Intelligence Research
Exploration in relational domains for model-based reinforcement learning

The Journal of Machine Learning Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many traditional solution approaches to relationally specified decision-theoretic planning problems (e.g., those stated in the probabilistic planning domain description language, or PPDDL) ground the specification with respect to a specific instantiation of domain objects and apply a solution approach directly to the resulting ground Markov decision process (MDP). Unfortunately, the space and time complexity of these grounded solution approaches are polynomial in the number of domain objects and exponential in the predicate arity and the number of nested quantifiers in the relational problem specification. An alternative to grounding a relational planning problem is to tackle the problem directly at the relational level. In this article, we propose one such approach that translates an expressive subset of the PPDDL representation to a first-order MDP (FOMDP) specification and then derives a domain-independent policy without grounding at any intermediate step. However, such generality does not come without its own set of challenges-the purpose of this article is to explore practical solution techniques for solving FOMDPs. To demonstrate the applicability of our techniques, we present proof-of-concept results of our first-order approximate linear programming (FOALP) planner on problems from the probabilistic track of the ICAPS 2004 and 2006 International Planning Competitions.