Bias optimality for multichain continuous-time Markov decision processes

  • Authors:
  • Xianping Guo;Xinyuan Song;Junyu Zhang

  • Affiliations:
  • The School of Mathematics and Computational Science, Zhongshan University, Guangzhou, PR China;Department of Statistics, The Chinese University of Hong Kong, Hong Kong;The School of Mathematics and Computational Science, Zhongshan University, Guangzhou, PR China

  • Venue:
  • Operations Research Letters
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper deals with the bias optimality of multichain models for finite continuous-time Markov decision processes. Based on new performance difference formulas developed here, we prove the convergence of a so-called bias-optimal policy iteration algorithm, which can be used to obtain bias-optimal policies in a finite number of iterations.