Stabilising hebbian learning with a third factor in a food retrieval task

Authors:
Adedoyin Maria Thompson;Bernd Porr;Florentin Wörgötter
Affiliations:
Department of Electronics & Electrical Engineering, University of Glasgow, Glasgow, Scotland, United Kingdom;Department of Electronics & Electrical Engineering, University of Glasgow, Glasgow, Scotland, United Kingdom;Bernstein Center of Computational Neuroscience, University Göttingen, Germany
Venue:
SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
Year:
2006

Citing 9
Cited 3

Differential Hebbian learning

AIP Conference Proceedings 151 on Neural Networks for Computing
Neural dynamics of adaptive timing temporal discrimination during associative learning

Neural Networks
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Dopamine-dependent plasticity of corticostriatal synapses

Neural Networks - Computational models of neuromodulation
Learning to Predict by the Methods of Temporal Differences

Machine Learning
Isotropic sequence order learning

Neural Computation
How the shape of pre- and postsynaptic signals can influence STDP: a biophysical model

Neural Computation
Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms

Neural Computation
A robot model of the basal ganglia: Behavior and intrinsic processing

Neural Networks

Second Order Conditioning in the Sub-cortical Nuclei of the Limbic System

SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Discretized ISO-learning neural network for obstacle avoidance in reactive robot controllers

Neurocomputing
Learning and Reversal Learning in the Subcortical Limbic System: A Computational Model

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

When neurons fire together they wire together This is Donald Hebb's famous postulate However, Hebbian learning is inherently unstable because synaptic weights will self amplify themselves: the more a synapse is able to drive a postsynaptic cell the more the synaptic weight will grow We present a new biologically realistic way how to stabilise synaptic weights by introducing a third factor which switches on or off learning so that self amplification is minimised The third factor can be identified by the activity of dopaminergic neurons in VTA which fire when a reward has been encountered This leads to a new interpretation of the dopamine signal which goes beyond the classical prediction error hypothesis The model is tested by a real world task where a robot has to find “food disks” in an environment.