Sequential constant size compressors for reinforcement learning

  • Authors:
  • Linus Gisslén;Matt Luciw;Vincent Graziano;Jürgen Schmidhuber

  • Affiliations:
  • IDSIA, University of Lugano, Manno-Lugano, Switzerland;IDSIA, University of Lugano, Manno-Lugano, Switzerland;IDSIA, University of Lugano, Manno-Lugano, Switzerland;IDSIA, University of Lugano, Manno-Lugano, Switzerland

  • Venue:
  • AGI'11 Proceedings of the 4th international conference on Artificial general intelligence
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Traditional Reinforcement Learning methods are insufficient for AGIs who must be able to learn to deal with Partially Observable Markov Decision Processes. We investigate a novel method for dealing with this problem: standard RL techniques using as input the hidden layer output of a Sequential Constant-Size Compressor (SCSC). The SCSC takes the form of a sequential Recurrent Auto-Associative Memory, trained through standard back-propagation. Results illustrate the feasibility of this approach -- this system learns to deal with highdimensional visual observations (up to 640 pixels) in partially observable environments where there are long time lags (up to 12 steps) between relevant sensory information and necessary action.