Stochastic Optimal Control and Estimation Methods Adapted to the Noise Characteristics of the Sensorimotor System

Authors:
Emanuel Todorov
Affiliations:
Department of Cognitive Science, University of California San Diego, La Jolla CA 92093-0515.
Venue:
Neural Computation
Year:
2005

Citing 7
Cited 14

Numerical methods for stochastic control problems in continuous time

Numerical methods for stochastic control problems in continuous time
A computational description of the organization of human reaching and prehension

A computational description of the organization of human reaching and prehension
State-feedback control of systems with multiplicative noise via linear matrix inequalities

Systems & Control Letters
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Cosine tuning minimizes motor errors

Neural Computation
Technical Communique: Discrete-Time Optimal Control with Control-Dependent Noise and Generalized Riccati Difference Equations

Automatica (Journal of IFAC)

Different Predictions by the Minimum Variance and Minimum Torque-Change Models on the Skewness of Movement Velocity Profiles

Neural Computation
A State-Space Analysis for Reconstruction of Goal-Directed Movements Using Neural Signals

Neural Computation
Neural network learning of optimal Kalman prediction and control

Neural Networks
Neural learning of Kalman filtering, Kalman control, and system identification

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Influences of data filtering on human-computer interaction by gaze-contingent display and eye-tracking applications

Computers in Human Behavior
A Generalized Path Integral Control Approach to Reinforcement Learning

The Journal of Machine Learning Research
Brief paper: Improving the state estimation for optimal control of stochastic processes subject to multiplicative noise

Automatica (Journal of IFAC)
A fingerprint method for variability and robustness analysis of stochastically controlled cellular actuator arrays

International Journal of Robotics Research
An optimal feedback control framework for grasping objects with position uncertainty

Neural Computation
Evaluating gaze control on a multi-target sequencing task: The distribution of fixations is evidence of exploration optimisation

Computers in Biology and Medicine
Correlations in state space can cause sub-optimal adaptation of optimal feedback control models

Journal of Computational Neuroscience
Thalamic cooperation between the cerebellum and basal ganglia with a new tropism-based action-dependent heuristic dynamic programming method

Neurocomputing
Energy-based stochastic control of neural mass models suggests time-varying effective connectivity in the resting state

Journal of Computational Neuroscience
Movement duration, fitts's law, and an infinite-horizon optimal feedback control model for biological motor systems

Neural Computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Optimality principles of biological movement are conceptually appealing and straightforward to formulate. Testing them empirically, however, requires the solution to stochastic optimal control and estimation problems for reasonably realistic models of the motor task and the sensorimotor periphery. Recent studies have highlighted the importance of incorporating biologically plausible noise into such models. Here we extend the linear-quadratic-gaussian framework—currently the only framework where such problems can be solved efficiently—to include controldependent, state-dependent, and internal noise. Under this extended noise model, we derive a coordinate-descent algorithm guaranteed to converge to a feedback control law and a nonadaptive linear estimator optimal with respect to each other. Numerical simulations indicate that convergence is exponential, local minima do not exist, and the restriction to nonadaptive linear estimators has negligible effects in the control problems of interest. The application of the algorithm is illustrated in the context of reaching movements. A Matlab implementation is available at www.cogsci.ucsd.edu/~todorov.