Simulation-Based Optimization Algorithms for Finite-Horizon Markov Decision Processes

  • Authors:
  • Shalabh Bhatnagar;Mohammed Shahid Abdulla

  • Affiliations:
  • Department of Computer Science and Automation IndianInstitute of Science Bangalore 560 012, India;General Motors India Science Lab Bangalore

  • Venue:
  • Simulation
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We develop four simulation-based algorithms for finite-horizon Markov decision processes. Two of these algorithms are developed for finite state and compact action spaces while the other two are for finite state and finite action spaces. Of the former two, one algorithm uses a linear parameterization for the policy, resulting in reduced memory complexity. Convergence analysis is briefly sketched and illustrative numerical experiments with the four algorithms are shown for a problem of flow control in communication networks.