The complexity of Markov decision processes
Mathematics of Operations Research
A survey of algorithmic methods for partially observed Markov decision processes
Annals of Operations Research
Acting optimally in partially observable stochastic domains
AAAI'94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 2)
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Value-Directed Belief State Approximation for POMDPs
UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Value-Directed Sampling Methods for POMDPs
UAI '01 Proceedings of the 17th Conference in Uncertainty in Artificial Intelligence
Value-function approximations for partially observable Markov decision processes
Journal of Artificial Intelligence Research
Computing optimal policies for partially observable decision processes using compact representations
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 2
A heuristic variable grid solution method for POMDPs
AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Tractable inference for complex stochastic processes
UAI'98 Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence
Incremental pruning: a simple, fast, exact method for partially observable Markov decision processes
UAI'97 Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence
Hi-index | 0.00 |
We propose a new approach to value-directed belief state approximation for POMDPs. The value directed model allows one to choose approximation methods for belief state monitoring that have a small impact on decision quality. Using a vector space analysis of the problem, we devise two new search procedures for selecting an approximation scheme that have much better computational properties than existing methods, Though these provide looser error bounds, we show empirically that they have a similar impact on decision quality in practice, and run up to two orders of magnitude more quickly.