ISO learning approximates a solution to the inverse-controller problem in an unsupervised behavioral paradigm

  • Authors:
  • Bernd Porr;Christian von Ferber;Florentin Wörgötter

  • Affiliations:
  • Department of Psychology, University of Stirling, Stirling FK9 4LA, Scotland;Theoretical Polymer Physics, Freiburg University, 79104 Freiburg, Germany;Department of Psychology, University of Stirling, Stirling FK9 4LA, Scotland

  • Venue:
  • Neural Computation
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In "Isotropic Sequence Order Learning" (pp. 831-864 in this issue), we introduced a novel algorithm for temporal sequence learning (ISO learning). Here, we embed this algorithm into a formal nonevaluating (teacher free) environment, which establishes a sensor-motor feedback. The system is initially guided by a fixed reflex reaction, which has the objective disadvantage that it can react only after a disturbance has occurred. ISO learning eliminates this disadvantage by replacing the reflex-loop reactions with earlier anticipatory actions. In this article, we analytically demonstrate that this process can be understood in terms of control theory, showing that the system learns the inverse controller of its own reflex. Thereby, this system is able to learn a simple form of feedforward motor control.