Motion planning under uncertainty using iterative local optimization in belief space

Authors:
Jur Van Den Berg;Sachin Patil;Ron Alterovitz
Affiliations:
School of Computing, University of Utah, USA;Department of Computer Science, University of North Carolina at Chapel Hill, USA.;Department of Computer Science, University of North Carolina at Chapel Hill, USA.
Venue:
International Journal of Robotics Research
Year:
2012

Citing 9
Cited 0

The complexity of Markov decision processes

Mathematics of Operations Research
Practical methods for optimal control using nonlinear programming

Practical methods for optimal control using nonlinear programming
Dynamic Programming and Optimal Control

Dynamic Programming and Optimal Control
Point-Based Value Iteration for Continuous POMDPs

The Journal of Machine Learning Research
The Belief Roadmap: Efficient Planning in Belief Space by Factoring the Covariance

International Journal of Robotics Research
Planning and acting in partially observable stochastic domains

Artificial Intelligence
icLQG: combining local and global optimization for control in information space

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Planning under Uncertainty for Robotic Tasks with Mixed Observability

International Journal of Robotics Research
LQG-MP: Optimized path planning for robots with motion uncertainty and imperfect state information

International Journal of Robotics Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a new approach to motion planning under sensing and motion uncertainty by computing a locally optimal solution to a continuous partially observable Markov decision process (POMDP). Our approach represents beliefs (the distributions of the robot's state estimate) by Gaussian distributions and is applicable to robot systems with non-linear dynamics and observation models. The method follows the general POMDP solution framework in which we approximate the belief dynamics using an extended Kalman filter and represent the value function by a quadratic function that is valid in the vicinity of a nominal trajectory through belief space. Using a belief space variant of iterative LQG (iLQG), our approach iterates with second-order convergence towards a linear control policy over the belief space that is locally optimal with respect to a user-defined cost function. Unlike previous work, our approach does not assume maximum-likelihood observations, does not assume fixed estimator or control gains, takes into account obstacles in the environment, and does not require discretization of the state and action spaces. The running time of the algorithm is polynomial (O[n6]) in the dimension n of the state space. We demonstrate the potential of our approach in simulation for holonomic and non-holonomic robots maneuvering through environments with obstacles with noisy and partial sensing and with non-linear dynamics and observation models.