On constrained Markov decision processes

  • Authors:
  • Moshe Haviv

  • Affiliations:
  • Department of Statistics, The Hebrew University of Jerusalem, 91905 Jerusalem, Israel

  • Venue:
  • Operations Research Letters
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

A multichain Markov decision process with constraints on the expected state-action frequencies may lead to a unique optimal policy which does not satisfy Bellman's principle of optimality. The model with sample-path constraints does not suffer from this drawback.