A New Learning Algorithm for Optimal Stopping

  • Authors:
  • Vivek S. Borkar;Jervis Pinto;Tarun Prabhu

  • Affiliations:
  • Tata Institute of Fundamental Research, Mumbai, India 400005;St. Francis Institute of Technology, Mumbai, India 400103 and School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, USA 97331;St. Francis Institute of Technology, Mumbai, India 400103 and School of Computing, University of Utah, Salt Lake City, USA 84112

  • Venue:
  • Discrete Event Dynamic Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

A linear programming formulation of the optimal stopping problem for Markov decision processes is approximated using linear function approximation. Using this formulation, a reinforcement learning scheme based on a primal-dual method and incorporating a sampling device called `split sampling' is proposed and analyzed. An illustrative example from option pricing is also included.