Temporal difference learning with interpolated table value functions

  • Authors:
  • Simon M. Lucas

  • Affiliations:
  • School of Computer Science and Electronic Engineering, University of Essex, Colchester, United Kingdom

  • Venue:
  • CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper introduces a novel function approximation architecture especially well suited to temporal difference learning. The architecture is based on using sets of interpolated table look-up functions. These offer rapid and stable learning, and are efficient when the number of inputs is small. An empirical investigation is conducted to test their performance on a supervised learning task, and on the mountain car problem, a standard reinforcement learning benchmark. In each case, the interpolated table functions offer competitive performance.