Q-learning with linear function approximation

  • Authors:
  • Francisco S. Melo;M. Isabel Ribeiro

  • Affiliations:
  • Institute for Systems and Robotics, Instituto Superior Técnico, Lisboa, Portugal;Institute for Systems and Robotics, Instituto Superior Técnico, Lisboa, Portugal

  • Venue:
  • COLT'07 Proceedings of the 20th annual conference on Learning theory
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we analyze the convergence of Q-learning with linear function approximation. We identify a set of conditions that implies the convergence of this method with probability 1, when a fixed learning policy is used. We discuss the differences and similarities between our results and those obtained in several related works. We also discuss the applicability of this method when a changing policy is used. Finally, we describe the applicability of this approximate method in partially observable scenarios.