Bias and Variance Approximation in Value Function Estimates

  • Authors:
  • Shie Mannor;Duncan Simester;Peng Sun;John N. Tsitsiklis

  • Affiliations:
  • Department of Electrical and Computer Engineering, McGill University, Montreal, Quebec, Canada H3A 2A7;Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139;Fuqua School of Business, Duke University, Durham, North Carolina 27708;Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139

  • Venue:
  • Management Science
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

We consider a finite-state, finite-action, infinite-horizon, discounted reward Markov decision process and study the bias and variance in the value function estimates that result from empirical estimates of the model parameters. We provide closed-form approximations for the bias and variance, which can then be used to derive confidence intervals around the value function estimates. We illustrate and validate our findings using a large database describing the transaction and mailing histories for customers of a mail-order catalog firm.