Minimal model of strategy switching in the plus-maze navigation task

Authors:
Denis Sheynikhovich;Laurent Dollé;Ricardo Chavarriaga;Angelo Arleo
Affiliations:
Laboratoire de Neurobiologie des Processus Adaptatifs, UPMC-Paris 6, CNRS, UMR, Paris, France;Institut des Systèmes Intelligents et de Robotique, UPMC-Paris 6, CNRS, UMR, Paris Cedex 05, France;CNBI, EPFL, Lausanne, Switzerland;-
Venue:
SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Year:
2010

Citing 2
Cited 0

Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Analyzing Interactions between Navigation Strategies Using a Computational Model of Action Selection

Proceedings of the international conference on Spatial Cognition VI: Learning, Reasoning, and Talking about Space

Quantified Score

Hi-index	0.00

Visualization

Abstract

Prefrontal cortex (PFC) has been implicated in the ability to switch behavioral strategies in response to changes in reward contingencies. A recent experimental study has shown that separate subpopulations of neurons in the prefrontal cortex were activated when rats switched between allocentric place strategies and egocentric response strategies in the plus maze. In this paper we propose a simple neural-network model of strategy switching, in which the learning of the two strategies as well as learning to select between those strategies is governed by the same temporal-difference (TD) learning algorithm. We show that the model reproduces the experimental data on both behavioral and neural levels. On the basis of our results we derive testable prediction concerning a spatial dynamics of the phasic dopamine signal in the PFC, which is thought to encode reward-prediction error in the TD-learning theory.