On the asymptotic equivalence between differential Hebbian and temporal difference learning

  • Authors:
  • Christoph Kolodziejski;Bernd Porr;Florentin Wörgötter

  • Affiliations:
  • Bernstein Center for Computational Neuroscience, University of Göttingen, Göttingen, Germany;Department of Electronics and Electrical Engineering, University of Glasgow, Glasgow, Scotland;Bernstein Center for Computational Neuroscience, University of Göttingen, Göttingen, Germany

  • Venue:
  • Neural Computation
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this theoretical contribution, we provide mathematical proof that two of the most important classes of network learning-correlation-based differential Hebbian learning and reward-based temporal difference learning--are asymptotically equivalent when timing the learning with a modulatory signal. This opens the opportunity to consistently reformulate most of the abstract reinforcement learning framework from a correlation-based perspective more closely related to the biophysics of neurons.