Criteria for selecting the relaxation factor of the value iteration algorithm for undiscounted Markov and semi-Markov decision processes

  • Authors:
  • Meir Herzberg;Uri Yechiali

  • Affiliations:
  • Engineering and Planning Division BEZEQ, The Israel Telecommunication Co., Tel Aviv 61290, Israel;Department of Statistics, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 69978, Israel

  • Venue:
  • Operations Research Letters
  • Year:
  • 1991

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present two criteria for selecting the adaptive relaxation factor being used in speeding-up the value iteration algorithm for undiscounted Markov decision processes. The criteria are: 1. Minimum Ratio, 2. Minimum Variance. For the problems tested it was found that the criterion of minimum variance is most effective for Markov models while an hybrid use of both criteria yields best results when solving semi-Markov decision problems. The advantage of using these criteria increases with the dimension of the models.