Using mathematical programming to solve Factored Markov Decision Processes with Imprecise Probabilities

  • Authors:
  • Karina Valdivia Delgado;Leliane Nunes De Barros;Fabio Gagliardi Cozman;Scott Sanner

  • Affiliations:
  • Escola de Artes, Ciências e Humanidades, Universidade de São Paulo, SP, Brazil;Instituto de Matemática e Estatística, Universidade de São Paulo, SP, Brazil;Escola Politécnica, Universidade de São Paulo, SP, Brazil;NICTA and the Australian National University, Canberra ACT 2601, Australia

  • Venue:
  • International Journal of Approximate Reasoning
  • Year:
  • 2011

Quantified Score

Hi-index 0.01

Visualization

Abstract

This paper investigates Factored Markov Decision Processes with Imprecise Probabilities (MDPIPs); that is, Factored Markov Decision Processes (MDPs) where transition probabilities are imprecisely specified. We derive efficient approximate solutions for Factored MDPIPs based on mathematical programming. To do this, we extend previous linear programming approaches for linear approximations in Factored MDPs, resulting in a multilinear formulation for robust ''maximin'' linear approximations in Factored MDPIPs. By exploiting the factored structure in MDPIPs we are able to demonstrate orders of magnitude reduction in solution time over standard exact non-factored approaches, in exchange for relatively low approximation errors, on a difficult class of benchmark problems with millions of states.