Thalamic cooperation between the cerebellum and basal ganglia with a new tropism-based action-dependent heuristic dynamic programming method

Authors:
Xiaogang Ruan;Jing Chen;Naigong Yu
Affiliations:
Institute of Artificial Intelligence and Robots, Beijing University of Technology, Beijing, China;Institute of Artificial Intelligence and Robots, Beijing University of Technology, Beijing, China;Institute of Artificial Intelligence and Robots, Beijing University of Technology, Beijing, China
Venue:
Neurocomputing
Year:
2012

Citing 21
Cited 0

What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?

Neural Networks - Special issue on organisation of computation in brain-like systems
Brains, Behavior and Robotics

Brains, Behavior and Robotics
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
TD Models of reward predictive responses in dopamine neurons

Neural Networks - Computational models of neuromodulation
Actor-critic models of the basal ganglia: new anatomical and computational perspectives

Neural Networks - Computational models of neuromodulation
Handbook of Learning and Approximate Dynamic Programming (IEEE Press Series on Computational Intelligence)

Handbook of Learning and Approximate Dynamic Programming (IEEE Press Series on Computational Intelligence)
Stochastic Optimal Control and Estimation Methods Adapted to the Noise Characteristics of the Sensorimotor System

Neural Computation
MOSAIC Model for Sensorimotor Learning and Control

Neural Computation
A fuzzy Actor-Critic reinforcement learning network

Information Sciences: an International Journal
Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions

Neurocomputing
Learning and generation of goal-directed arm reaching from scratch

Neural Networks
Using continuous action spaces to solve discrete problems

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Design of neural-fuzzy-based controller for two autonomously driven wheeled robot

Neurocomputing
Support vector machine optimal control for mobile wheeled inverted pendulums with unmodelled dynamics

Neurocomputing
Optimal control laws for time-delay systems with saturating actuators based on heuristic dynamic programming

Neurocomputing
Intelligent backstepping control for wheeled inverted pendulum

Expert Systems with Applications: An International Journal
A three-network architecture for on-line learning and optimization based on adaptive dynamic programming

Neurocomputing
Velocity and position control of a wheeled inverted pendulum by partial feedback linearization

IEEE Transactions on Robotics
Fuzzy inference system learning by reinforcement methods

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Adaptive critic designs

IEEE Transactions on Neural Networks
Online learning control by association and reinforcement

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	0.01

Visualization

Abstract

In order to explore the possible cooperation mechanism between the cerebellum and basal ganglia in the central nervous system and to establish a more intelligent learning mechanism for robots, a new tropism-based ADHDP (action-dependent heuristic dynamic programming) learning mechanism involving the cortico-basal ganglia and cerebellar circuitry and the thalamic function is proposed. The cerebellum specializes in the actor part, while the basal ganglia are related to critic prediction. The thalamic function is considered as the tropism mechanism. Tropism value denoting the biological propensity is introduced to illustrate the degree of closing to the target. Although several motor control models have been proposed to explain the control and learning mechanism in the cerebellum and basal ganglia separately, it seems that the cooperation mechanism between them has not received much attention. In our proposed learning mechanism, the thalamic function and the cooperation between the cerebellum and basal ganglia are considered, and with a neurophysiological view, a striato-striatal lateral weight in the basal ganglia was added in the critic network. We present the detailed design architecture and explain how effective learning and optimization can be achieved with this novel tropism-based ADHDP architecture. Furthermore, we test its performance on the balance learning task of a two-wheeled self-balancing robot (TWSBR), which simulates the typical motor control and learning of the human body. In order to illustrate the effect of the thalamic function, some comparison researches about the balance learning problem have been done.