A novel model of motor learning capable of developing an optimal movement control law online from scratch

Authors:
Yury P. Shimansky;Tao Kang;Jiping He
Affiliations:
Arizona Biodesign Institute, Arizona State University, Harrington Department of Bioengineering, USA;Arizona Biodesign Institute, Arizona State University, Harrington Department of Bioengineering, USA;Arizona Biodesign Institute, Arizona State University, Harrington Department of Bioengineering, USA and Huazhong University of Science and Technology, Control Engineering, China
Venue:
Biological Cybernetics
Year:
2004

Citing 0
Cited 2

The concept of a universal learning system as a basis for creating a general mathematical theory of learning

Minds and Machines - Machine learning as experimental philosophy of science
Adaptive optimal control without weight transport

Neural Computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

A computational model of a learning system (LS) is described that acquires knowledge and skill necessary for optimal control of a multisegmental limb dynamics (controlled object or CO), starting from “knowing” only the dimensionality of the object’s state space. It is based on an optimal control problem setup different from that of reinforcement learning. The LS solves the optimal control problem online while practicing the manipulation of CO. The system’s functional architecture comprises several adaptive components, each of which incorporates a number of mapping functions approximated based on artificial neural nets. Besides the internal model of the CO’s dynamics and adaptive controller that computes the control law, the LS includes a new type of internal model, the minimal cost (IMmc) of moving the controlled object between a pair of states. That internal model appears critical for the LS’s capacity to develop an optimal movement trajectory. The IMmc interacts with the adaptive controller in a cooperative manner. The controller provides an initial approximation of an optimal control action, which is further optimized in real time based on the IMmc. The IMmc in turn provides information for updating the controller. The LS’s performance was tested on the task of center-out reaching to eight randomly selected targets with a 2DOF limb model. The LS reached an optimal level of performance in a few tens of trials. It also quickly adapted to movement perturbations produced by two different types of external force field. The results suggest that the proposed design of a self-optimized control system can serve as a basis for the modeling of motor learning that includes the formation and adaptive modification of the plan of a goal-directed movement.