SPUDD: stochastic planning using decision diagrams

Authors:
Jesse Hoey;Robert St-Aubin;Alan Hu;Craig Boutilier
Affiliations:
Department of Computer Science, University of British Columbia, Vancouver, BC, Canada;Department of Computer Science, University of British Columbia, Vancouver, BC, Canada;Department of Computer Science, University of British Columbia, Vancouver, BC, Canada;Department of Computer Science, University of British Columbia, Vancouver, BC, Canada
Venue:
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Year:
1999

Citing 14
Cited 85

Graph-Based Algorithms for Boolean Function Manipulation

IEEE Transactions on Computers
Probabilistic reasoning in intelligent systems: networks of plausible inference

Probabilistic reasoning in intelligent systems: networks of plausible inference
A model for reasoning about persistence and causation

Computational Intelligence
Modeling a dynamic and uncertain world I: symbolic and probabilistic reasoning about change

Artificial Intelligence
Abstraction and approximate decision-theoretic planning

Artificial Intelligence
Algebraic decision diagrams and their applications

ICCAD '93 Proceedings of the 1993 IEEE/ACM international conference on Computer-aided design
Automatic OBDD-based generation of universal plans in non-deterministic domains

AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Exploiting structure in policy construction

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Model minimization in Markov decision processes

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Exploiting the rule structure for decision making within the independent choice logic

UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence
Correlated action effects in decision theoretic regression

UAI'97 Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence
Context-specific independence in Bayesian networks

UAI'96 Proceedings of the Twelfth international conference on Uncertainty in artificial intelligence
Topological parameters for time-space tradeoff

UAI'96 Proceedings of the Twelfth international conference on Uncertainty in artificial intelligence

Complexity of finite-horizon Markov decision process problems

Journal of the ACM (JACM)
Symbolic Heuristic Search Using Decision Diagrams

Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation
Solving Factored MDPs with Large Action Space Using Algebraic Decision Diagrams

PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Piecewise linear value function approximation for factored MDPs

Eighteenth national conference on Artificial intelligence
Symbolic heuristic search for factored Markov decision processes

Eighteenth national conference on Artificial intelligence
Weak, strong, and strong cyclic planning via symbolic model checking

Artificial Intelligence - special issue on planning with uncertainty and incomplete information
Contingent planning under uncertainty via stochastic satisfiability

Artificial Intelligence - special issue on planning with uncertainty and incomplete information
Equivalence notions and model minimization in Markov decision processes

Artificial Intelligence - special issue on planning with uncertainty and incomplete information
Solving factored MDPs using non-homogeneous partitions

Artificial Intelligence - special issue on planning with uncertainty and incomplete information
When plans distinguish Bayes nets

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
A differential semantics for jointree algorithms

Artificial Intelligence
Compiling CSPs into Tree-Driven Automata for Interactive Solving

Constraints
Dynamic programming for structured continuous Markov decision problems

UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Exploiting first-order regression in inductive policy selection

UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
Heuristic search value iteration for POMDPs

UAI '04 Proceedings of the 20th conference on Uncertainty in artificial intelligence
A causal approach to hierarchical decomposition of factored MDPs

ICML '05 Proceedings of the 22nd international conference on Machine learning
Strong planning under partial observability

Artificial Intelligence
Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

ICML '06 Proceedings of the 23rd international conference on Machine learning
Causal Graph Based Decomposition of Factored MDPs

The Journal of Machine Learning Research
Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes

IEEE Transactions on Pattern Analysis and Machine Intelligence
Exploiting factored representations for decentralized execution in multiagent teams

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
θ-Subsumption Based on Object Context

Inductive Logic Programming
Exploiting Additive Structure in Factored MDPs for Reinforcement Learning

Recent Advances in Reinforcement Learning
Planning for success: The interdisciplinary approach to building Bayesian models

International Journal of Approximate Reasoning
Anytime heuristic search for partial satisfaction planning

Artificial Intelligence
The factored policy-gradient planner

Artificial Intelligence
Practical solution techniques for first-order MDPs

Artificial Intelligence
Experimental Evaluation of a Planning Language Suitable for Formal Verification

Model Checking and Artificial Intelligence
Anticipatory Learning Classifier Systems and Factored Reinforcement Learning

Anticipatory Behavior in Adaptive Learning Systems
Learning Representation and Control in Markov Decision Processes: New Frontiers

Foundations and Trends® in Machine Learning
Solving generalized semi-Markov decision processes using continuous phase-type distributions

AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Functional value iteration for decision-theoretic planning with general utility functions

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Decision tree methods for finding reusable MDP homomorphisms

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Considering Unseen States as Impossible in Factored Reinforcement Learning

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
DC-SSAT: a divide-and-conquer approach to solving stochastic satisfiability problems efficiently

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 1
Planning and execution with phase transitions

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 2
Using domain-configurable search control for probabilistic planning

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Thresholded rewards: acting optimally in timed, zero-sum games

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Partitioned external-memory value iteration

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Symbolic heuristic search value iteration for factored POMDPs

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
Conformant planning via symbolic model checking

Journal of Artificial Intelligence Research
Efficient solution algorithms for factored MDPs

Journal of Artificial Intelligence Research
The first probabilistic track of the international planning competition

Journal of Artificial Intelligence Research
Decision-theoretic planning with non-Markovian rewards

Journal of Artificial Intelligence Research
FLUCAP: a heuristic search planner for first-order MDPs

Journal of Artificial Intelligence Research
Planning with durative actions in stochastic domains

Journal of Artificial Intelligence Research
First order decision diagrams for relational MDPs

Journal of Artificial Intelligence Research
First order decision diagrams for relational MDPs

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Topological value iteration algorithm for Markov decision processes

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Solving factored MDPs via non-homogeneous partitioning

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 1
Symbolic dynamic programming for first-order MDPs

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 1
Bounded search and symbolic inference for constraint optimization

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
A decision-theoretic approach to task assistance for persons with dementia

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Affine algebraic decision diagrams (AADDs) and their application to structured probabilistic inference

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Strong planning under partial observability

Artificial Intelligence
Intensional dynamic programming. A Rosetta stone for structured dynamic programming

Journal of Algorithms
Deliberation scheduling using GSMDPs in stochastic asynchronous domains

International Journal of Approximate Reasoning
ReTrASE: integrating paradigms for approximate probabilistic planning

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Generalized first order decision diagrams for first order Markov decision processes

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Symbolic Reasoning with Weighted and Normalized Decision Diagrams

Electronic Notes in Theoretical Computer Science (ENTCS)
The lumberjack algorithm for learning linked decision forests

PRICAI'00 Proceedings of the 6th Pacific Rim international conference on Artificial intelligence
Promela planning

SPIN'03 Proceedings of the 10th international conference on Model checking software
Automated handwashing assistance for persons with dementia using video and a partially observable Markov decision process

Computer Vision and Image Understanding
Automated large-scale control of gene regulatory networks

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Approximate dynamic programming with affine ADDs

Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
A new representation and associated algorithms for generalized planning

Artificial Intelligence
Symbolic bounded real-time dynamic programming

SBIA'10 Proceedings of the 20th Brazilian conference on Advances in artificial intelligence
Efficient solutions to factored MDPs with imprecise transition probabilities

Artificial Intelligence
Using mathematical programming to solve Factored Markov Decision Processes with Imprecise Probabilities

International Journal of Approximate Reasoning
Efficient planning in R-max

The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
Dynamic Behavior Sequencing for Hybrid Robot Architectures

Journal of Intelligent and Robotic Systems
Decision-theoretic planning with generalized first-order decision diagrams

Artificial Intelligence
A natural language argumentation interface for explanation generation in Markov decision processes

ADT'11 Proceedings of the Second international conference on Algorithmic decision theory
Probabilistic relational planning with first order decision diagrams

Journal of Artificial Intelligence Research
A framework and a mean-field algorithm for the local control of spatial processes

International Journal of Approximate Reasoning
Anytime state-based solution methods for decision processes with non-Markovian rewards

UAI'02 Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence
Value-directed belief state approximation for POMDPs

UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence
Symbolic generalization for on-line planning

UAI'03 Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence
Stochastic abstract policies for knowledge transfer in robotic navigation tasks

MICAI'11 Proceedings of the 10th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
Topological value iteration algorithms

Journal of Artificial Intelligence Research
DetH: approximate hierarchical solution of large Markov decision processes

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Discovering hidden structure in factored MDPs

Artificial Intelligence
Proximity-based non-uniform abstractions for approximate planning

Journal of Artificial Intelligence Research
Recognizing internal states of other agents to anticipate and coordinate interactions

EUMAS'11 Proceedings of the 9th European conference on Multi-Agent Systems
An English-Language Argumentation Interface for Explanation Generation with Markov Decision Processes in the Domain of Academic Advising

ACM Transactions on Interactive Intelligent Systems (TiiS)

Quantified Score

Hi-index	0.01

Visualization

Abstract

Recently, structured methods for solving factored Markov decisions processes (MDPs) with large state spaces have been proposed recently to allow dynamic programming to be applied without the need for complete state enumeration. We propose and examine a new value iteration algorithm for MDPs that uses algebraic decision diagrams (ADDs) to represent value functions and policies, assuming an ADD input representation of the MDP. Dynamic programming is implemented via ADD manipulation. We demonstrate our method on a class of large MDPs (up to 63 million states) and show that significant gains can be had when compared to tree-structured representations (with up to a thirty-fold reduction in the number of nodes required to represent optimal value functions).