An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning

  • Authors:
  • Ronald Parr;Lihong Li;Gavin Taylor;Christopher Painter-Wakefield;Michael L. Littman

  • Affiliations:
  • Duke University, Durham, NC;Rutgers University, Piscataway, NJ;Duke University, Durham, NC;Duke University, Durham, NC;Rutgers University, Piscataway, NJ

  • Venue:
  • Proceedings of the 25th international conference on Machine learning
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We show that linear value-function approximation is equivalent to a form of linear model approximation. We then derive a relationship between the model-approximation error and the Bellman error, and show how this relationship can guide feature selection for model improvement and/or value-function improvement. We also show how these results give insight into the behavior of existing feature-selection algorithms.