Structural Properties of Stochastic Dynamic Programs

Authors:
James E. Smith;Kevin F. McCardle
Affiliations:
-;-
Venue:
Operations Research
Year:
2002

Citing 0
Cited 10

Effects of system parameters on the optimal policy structure in a class of queueing control problems

Queueing Systems: Theory and Applications
Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems

ICIC'09 Proceedings of the Intelligent computing 5th international conference on Emerging intelligent computing technology and applications
Media Revenue Management with Audience Uncertainty: Balancing Upfront and Spot Market Sales

Manufacturing & Service Operations Management
A dynamical games approach to transmission-rate adaptation in multimedia WLAN

IEEE Transactions on Signal Processing
Smarter sampling in model-based Bayesian reinforcement learning

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Parametric concavity in stochastic dynamic programs

Computers and Industrial Engineering
A decision support model for tax revenue collection in Greece

Decision Support Systems
A new method of proving structural properties for certain class of stochastic dynamic control problems

Operations Research Letters
Monotonicity in multidimensional Markov decision processes for the batch dispatch problem

Operations Research Letters
Technology Adoption with Uncertain Future Costs and Quality

Operations Research

Quantified Score

Hi-index	0.00

Visualization

Abstract

In Markov models of sequential decision processes, one is often interested in showing that the value function is monotonic, convex, and/or supermodular in the state variables. These kinds of results can be used to develop a qualitative understanding of the model and characterize how the results will change with changes in model parameters. In this paper we present several fundamental results for establishing these kinds of properties. The results are, in essence, "metatheorems" showing that the value functions satisfy propertyP if the reward functions satisfy propertyP and the transition probabilities satisfy a stochastic version of this property. We focus our attention on closed convex cone properties, a large class of properties that includes monotonicity, convexity, and supermodularity, as well as combinations of these and many other properties of interest.