Actor-Critic Models of Reinforcement Learning in the Basal Ganglia: From Natural to Artificial Rats

Authors:
Mehdi Khamassi;Loï/c Lachè/ze;Benoî/t Girard;Alain Berthoz;Agnè/s Guillot
Affiliations:
AnimatLab, LIP6, Paris, France/ LPPA, CNRS-Collè/ge de France, Paris, France;AnimatLab, LIP6, Paris, France;AnimatLab, LIP6, Paris, France/ LPPA, CNRS-Collè/ge de France, Paris, France;LPPA, CNRS-Collè/ge de France, Paris, France;AnimatLab, LIP6, Paris, France
Venue:
Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Year:
2005

Citing 12
Cited 10

Learning to perceive the world as articulated: an approach for hierarchical learning in sensory-motor systems

Neural Networks - Special issue on organisation of computation in brain-like systems
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Actor-critic models of the basal ganglia: new anatomical and computational perspectives

Neural Networks - Computational models of neuromodulation
Neuromodulation and Plasticity in an autonomous robot

Neural Networks - Computational models of neuromodulation
Multiple model-based reinforcement learning

Neural Computation
Using a time-delay actor-critic neural architecture with dopamine-like reinforcement signal for learning in autonomous robots

Emergent neural computational architectures based on neuroscience
Applications of the self-organising map to reinforcement learning

Neural Networks - New developments in self-organizing maps
Reinforcement learning models of the dopamine system and their behavioral implications

Reinforcement learning models of the dopamine system and their behavioral implications
Integration of Navigation and Action Selection Functionalities in a Computational Model of Cortico-Basal-Ganglia-Thalamo-Cortical Loops

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Temporal Difference Model Reproduces Anticipatory Neural Activity

Neural Computation
Reinforcement Learning in Continuous Time and Space

Neural Computation
A modular neural-network model of the basal ganglia's role in learning and selecting motor behaviours

Cognitive Systems Research

An Action-Selection Calculus

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
2008 Special Issue: Where neuroscience and dynamic system theory meet autonomous robotics: A contracting basal ganglia model for action selection

Neural Networks
Integration of an Omnidirectional Visual System with the Control Architecture of Psikharpax

SAB '08 Proceedings of the 10th international conference on Simulation of Adaptive Behavior: From Animals to Animats
Levels and Types of Action Selection: The Action Selection Soup

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
Learning Affordances of Consummatory Behaviors: Motivation-Driven Adaptive Perception

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems
An integrated neuromimetic model of the saccadic eye movements for the psikharpax robot

SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Why and how hippocampal transition cells can be used in reinforcement learning

SAB'10 Proceedings of the 11th international conference on Simulation of adaptive behavior: from animals to animats
Combining self-organizing maps with mixtures of experts: application to an actor-critic model of reinforcement learning in the basal ganglia

SAB'06 Proceedings of the 9th international conference on From Animals to Animats: simulation of Adaptive Behavior
Phasic dopamine as a prediction error of intrinsic and extrinsic reinforcements driving both action acquisition and reward maximization: A simulated robotic study

Neural Networks
Hedonic value: enhancing adaptation for motivated agents

Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Since 1995, numerous Actor-Critic architectures for reinforcement learning have been proposed as models of dopamine-like reinforcement learning mechanisms in the rat's basal ganglia. However, these models were usually tested in different tasks, and it is then difficult to compare their efficiency for an autonomous animat. We present here the comparison of four architectures in an animat as it per forms the same reward-seeking task. This will illustrate the consequences of different hypotheses about the management of different Actor sub-modules and Critic units, and their more or less autono mously determined coordination. We show that the classical method of coordination of modules by mixture of experts, depending on each module's performance, did not allow solving our task. Then we address the question of which principle should be applied efficiently to combine these units. Improve ments for Critic modeling and accuracy of Actor-Critic models for a natural task are finally discussed in the perspective of our Psikharpax project--an artificial rat having to survive autonomously in unpre dictable environments.