Information Relaxations and Duality in Stochastic Dynamic Programs
Operations Research
Regression Methods for Stochastic Control Problems and Their Convergence Analysis
SIAM Journal on Control and Optimization
An Iterative Procedure for Constructing Subsolutions of Discrete-Time Optimal Control Problems
SIAM Journal on Control and Optimization
Robust bayesian reinforcement learning through tight lower bounds
EWRL'11 Proceedings of the 9th European conference on Recent Advances in Reinforcement Learning
Pathwise Optimization for Optimal Stopping Problems
Management Science
Hi-index | 0.00 |
This paper approaches optimal control problems for discrete-time controlled Markov processes by representing the value of the problem in a dual Lagrangian form; the value is expressed as an infimum over a family of Lagrangian martingales of an expectation of a pathwise supremum of the objective adjusted by the Lagrangian martingale term. This representation opens up the possibility of numerical methods based on Monte Carlo simulation, which may be advantageous in high-dimensional problems or in problems with complicated constraints.