Neural mechanism for stochastic behaviour during a competitive game

Authors:
Alireza Soltani;Daeyeol Lee;Xiao-Jing Wang
Affiliations:
Department of Neurobiology and Kavli Institute for Neuroscience, Yale University School of Medicine, New Haven, CT and Department of Physics and Volen Center for Complex Systems, Brandeis Universi ...;Department of Neurobiology and Kavli Institute for Neuroscience, Yale University School of Medicine, New Haven, CT and Department of Brain and Cognitive Sciences, Center for Visual Science, Univer ...;Department of Neurobiology and Kavli Institute for Neuroscience, Yale University School of Medicine, New Haven, CT and Department of Physics and Volen Center for Complex Systems, Brandeis Universi ...
Venue:
Neural Networks - 2006 Special issue: Neurobiology of decision making
Year:
2006

Citing 7
Cited 3

The perception of randomness

Advances in Applied Mathematics
Learning in neural networks with material synapses

Neural Computation
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Metalearning and neuromodulation

Neural Networks - Computational models of neuromodulation
Dopamine-dependent plasticity of corticostriatal synapses

Neural Networks - Computational models of neuromodulation
Meta-learning in reinforcement learning

Neural Networks
A Computational Model of How the Basal Ganglia Produce Sequences

Journal of Cognitive Neuroscience

A model study on the circuit mechanism underlying decision-making in Drosophila

Neural Networks
Statistical mechanics of reward-modulated learning in decision-making networks

Neural Computation
Dynamical regimes in neural network models of matching behavior

Neural Computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Previous studies have shown that non-human primates can generate highly stochastic choice behaviour, especially when this is required during a competitive interaction with another agent. To understand the neural mechanism of such dynamic choice behaviour, we propose a biologically plausible model of decision making endowed with synaptic plasticity that follows a reward-dependent stochastic Hebbian learning rule. This model constitutes a biophysical implementation of reinforcement learning, and it reproduces salient features of behavioural data from an experiment with monkeys playing a matching pennies game. Due to interaction with an opponent and learning dynamics, the model generates quasi-random behaviour robustly in spite of intrinsic biases. Furthermore, non-random choice behaviour can also emerge when the model plays against a noninteractive opponent, as observed in the monkey experiment. Finally, when combined with a meta-learning algorithm, our model accounts for the slow drift in the animal's strategy based on a process of reward maximization.