Nearly deterministic abstractions of Markov decision processes

Authors:
Terran Lane;Leslie Pack Kaelbling
Affiliations:
MIT Artificial Intelligence Laboratory, 200 Technology Square, Cambridge, MA;MIT Artificial Intelligence Laboratory, 200 Technology Square, Cambridge, MA
Venue:
Eighteenth national conference on Artificial intelligence
Year:
2002

Citing 19
Cited 4

Planning under time constraints in stochastic domains

Artificial Intelligence - Special volume on planning and scheduling
Approximation schemes for minimum latency problems

STOC '99 Proceedings of the thirty-first annual ACM symposium on Theory of computing
An improved approximation ratio for the minimum latency problem

Proceedings of the seventh annual ACM-SIAM symposium on Discrete algorithms
Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning

Artificial Intelligence
Complexity of finite-horizon Markov decision process problems

Journal of the ACM (JACM)
Stochastic dynamic programming with factored representations

Artificial Intelligence
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Efficient Reinforcement Learning in Factored MDPs

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Multi-Value-Functions: Efficient Automatic Action Hierarchies for Multiple Goal MDPs

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Dynamic Non-uniform Abstractions for Approximate Planning in Large Structured Stochastic Domains

PRICAI '98 Proceedings of the 5th Pacific Rim International Conference on Artificial Intelligence: Topics in Artificial Intelligence
Policy Iteration for Factored MDPs

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Robust Combination of Local Controllers

UAI '01 Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence
Hierarchical control and learning for markov decision processes

Hierarchical control and learning for markov decision processes
Temporal abstraction in reinforcement learning

Temporal abstraction in reinforcement learning
Nonapproximability results for partially observable Markov decision processes

Journal of Artificial Intelligence Research
Model minimization in Markov decision processes

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Probabilistic propositional planning: representations and complexity

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Hierarchical solution of Markov decision processes using macro-actions

UAI'98 Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence
Flexible decomposition algorithms for weakly coupled Markov decision problems

UAI'98 Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence

Controlled search over compact state representations, in nondeterministic planning domains and beyond

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Application of action selection, information gathering, and information evaluation technologies to UAV target tracking

DAMAS'05 Proceedings of the 2005 international conference on Defence Applications of Multi-Agent Systems
Desire-space analysis and action selection for multiple dynamic goals

CLIMA'04 Proceedings of the 5th international conference on Computational Logic in Multi-Agent Systems
Simulating UAV Surveillance for Analyzing Impact of Commitments in Multi-Agent Systems

International Journal of Agent Technologies and Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We examine scaling issues for a restricted class of compactly representable Markov decision process planning problems. For one stochastic mobile robotics package delivery problem it is possible to decouple the stochastic local-navigation problem from the deterministic global-routing one and to solve each with dedicated methods. Careful construction of macro actions allows us to effectively "hide" navigational stochasticity from the global routing problem and to approximate the latter with off-the-shelf combinatorial optimization routines for the traveling salesdroid problem, yielding a net exponential speedup in planning performance. We give analytic conditions on when the macros are close enough to deterministic for the approximation to be good and demonstrate the performance of our method on small and large simulated navigation problems.