Discounted deterministic Markov decision processes and discounted all-pairs shortest paths

  • Authors:
  • Omid Madani;Mikkel Thorup;Uri Zwick

  • Affiliations:
  • SRI International, Menlo Park, CA;AT&T Labs - Research, Florham Park, NJ;Tel Aviv University, Tel Aviv, Israel

  • Venue:
  • ACM Transactions on Algorithms (TALG)
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present algorithms for finding optimal strategies for discounted, infinite-horizon, Determinsitc Markov Decision Processes (DMDPs). Our fastest algorithm has a worst-case running time of O(mn), improving the recent bound of O(mn2) obtained by Andersson and Vorbyov [2006]. We also present a randomized O(m1/2n2)-time algorithm for finding Discounted All-Pairs Shortest Paths (DAPSP), improving an O(mn2)-time algorithm that can be obtained using ideas of Papadimitriou and Tsitsiklis [1987].