The convergence of value iteration in average cost Markov decision chains

  • Authors:
  • Linn I. Sennott

  • Affiliations:
  • Department of Mathematics 4520, Illinois State University, Normal, IL 61790-4520, USA

  • Venue:
  • Operations Research Letters
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

Let J be the (constant) minimum long-run expected average cost in a Markov decision chain with countable state space. We desire the existence of an average cost optimal stationary policy and, in addition, that J is the limit of v"n(.)/n, where v"n(.) is the minimum n-step expected cost. Three sets of sufficient conditions for this to hold are given. The results generalize Ghosh and Marcus (1992).