The kNN-TD Reinforcement Learning Algorithm

  • Authors:
  • José Antonio Martín H.;Javier Lope;Darío Maravall

  • Affiliations:
  • Dep. Sistemas Informáticos y Computación, Universidad Complutense de Madrid,;Perception for Computers and Robots, Universidad Politécnica de Madrid,;Perception for Computers and Robots, Universidad Politécnica de Madrid,

  • Venue:
  • IWINAC '09 Proceedings of the 3rd International Work-Conference on The Interplay Between Natural and Artificial Computation: Part I: Methods and Models in Artificial and Natural Computation. A Homage to Professor Mira's Scientific Legacy
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

A reinforcement learning algorithm called k NN-TD is introduced. This algorithm has been developed using the classical formulation of temporal difference methods and a k -nearest neighbors scheme as its expectations memory. By means of this kind of memory the algorithm is able to generalize properly over continuous state spaces and also take benefits from collective action selection and learning processes. Furthermore, with the addition of probability traces, we obtain the k NN-TD(*** ) algorithm which exhibits a state of the art performance. Finally the proposed algorithm has been tested on a series of well known reinforcement learning problems and also at the Second Annual RL Competition with excellent results.