Modelling coordination of learning systems: a reservoir systems approach to dopamine modulated pavlovian conditioning

  • Authors:
  • Robert Lowe;Francesco Mannella;Tom Ziemke;Gianluca Baldassarre

  • Affiliations:
  • University of Skövde, Informatics Research Centre, Cognition & Interaction Lab;Consiglio Nazionale delle Ricerche, Istituto di Scienze e Tecnologie della Cognizione, Laboratory of Computational Embodied Neuroscience;University of Skövde, Informatics Research Centre, Cognition & Interaction Lab;Consiglio Nazionale delle Ricerche, Istituto di Scienze e Tecnologie della Cognizione, Laboratory of Computational Embodied Neuroscience

  • Venue:
  • ECAL'09 Proceedings of the 10th European conference on Advances in artificial life: Darwin meets von Neumann - Volume Part I
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

This paper presents a biologically constrained reward prediction model capable of learning cue-outcome associations involving temporally distant stimuli without using the commonly used temporal difference model. The model incorporates a novel use of an adapted echo state network to substitute the biologically implausible delay chains usually used, in relation to dopamine phenomena, for tackling temporally structured stimuli. Moreover, the model is based on a novel algorithm which successfully coordinates two sub systems: one providing Pavlovian conditioning, one providing timely inhibition of dopamine responses to salient anticipated stimuli. The model is validated against the typical profile of phasic dopamine in first and second order Pavlovian conditioning. The model is relevant not only to explaining the mechanisms underlying the biological regulation of dopamine signals, but also for applications in autonomous robotics involving reinforcement-based learning.