Reinforcement learning of two-joint virtual arm reaching in a computer model of sensorimotor cortex

Authors:
Samuel A. Neymotin;George L. Chadderdon;Cliff C. Kerr;Joseph T. Francis;William W. Lytton
Affiliations:
-;-;-;-;-
Venue:
Neural Computation
Year:
2013

Citing 10
Cited 1

A neural cocktail-party processor

Biological Cybernetics
Dopamine-dependent plasticity of corticostriatal synapses

Neural Networks - Computational models of neuromodulation
Brain-Based Devices for the Study of Nervous Systems and the Development of Intelligent Machines

Artificial Life
Rapid Temporal Modulation of Synchrony by Competition in Cortical Interneuron Networks

Neural Computation
The NEURON Book

The NEURON Book
Reinforcement Learning Through Modulation of Spike-Timing-Dependent Synaptic Plasticity

Neural Computation
Just-in-time connectivity for large spiking networks

Neural Computation
A spiking neural network model of an actor-critic learning agent

Neural Computation
Rule-based firing for network simulations

Neurocomputing
Synaptic information transfer in computer models of neocortical columns

Journal of Computational Neuroscience

Towards a real-time interface between a biomimetic model of sensorimotor cortex and a robotic arm

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

Neocortical mechanisms of learning sensorimotor control involve a complex series of interactions at multiple levels, from synaptic mechanisms to cellular dynamics to network connectomics. We developed a model of sensory and motor neocortex consisting of 704 spiking model neurons. Sensory and motor populations included excitatory cells and two types of interneurons. Neurons were interconnected with AMPA/NMDA and GABAA synapses. We trained our model using spike-timing-dependent reinforcement learning to control a two-joint virtual arm to reach to a fixed target. For each of 125 trained networks, we used 200 training sessions, each involving 15ï戮 s reaches to the target from 16 starting positions. Learning altered network dynamics, with enhancements to neuronal synchrony and behaviorally relevant information flow between neurons. After learning, networks demonstrated retention of behaviorally relevant memories by using proprioceptive information to perform reach-to-target from multiple starting positions. Networks dynamically controlled which joint rotations to use to reach a target, depending on current arm position. Learning-dependent network reorganization was evident in both sensory and motor populations: learned synaptic weights showed target-specific patterning optimized for particular reach movements. Our model embodies an integrative hypothesis of sensorimotor cortical learning that could be used to interpret future electrophysiological data recorded in vivo from sensorimotor learning experiments. We used our model to make the following predictions: learning enhances synchrony in neuronal populations and behaviorally relevant information flow across neuronal populations, enhanced sensory processing aids task-relevant motor performance and the relative ease of a particular movement in vivo depends on the amount of sensory information required to complete the movement.