Forward search value iteration for POMDPs

  • Authors:
  • Guy Shani;Ronen I. Brafman;Solomon E. Shimony

  • Affiliations:
  • Department of Computer Science, Ben-Gurion University, Beer-Sheva, Israel;Department of Computer Science, Ben-Gurion University, Beer-Sheva, Israel;Department of Computer Science, Ben-Gurion University, Beer-Sheva, Israel

  • Venue:
  • IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recent scaling up of POMDP solvers towards realistic applications is largely due to point-based methods which quickly converge to an approximate solution formedium-sized problems. Of this family HSVI, which uses trial-based asynchronous value iteration, can handle the largest domains. In this paper we suggest a new algorithm, FSVI, that uses the underlying MDP to traverse the belief space towards rewards, finding sequences of useful backups, and show how it scales up better than HSVI on larger benchmarks.