Asynchronous neurocomputing for optimal control and reinforcement learning with large state spaces

Authors:
Bruno Scherrer
Affiliations:
Computer Science and Artificial Intelligence Laboratory, NE43-783, 200 Technology Square, Cambridge, MA 02139-4307, USA and LORIA INRIA-Lorraine, CORTEX/MAIA Teams, Campus Scientifique BP 239, 545 ...
Venue:
Neurocomputing
Year:
2005

Citing 8
Cited 1

Parallel and distributed computation: numerical methods

Parallel and distributed computation: numerical methods
An adaptive neural network: the cerebral cortex

An adaptive neural network: the cerebral cortex
An Algorithm for Finding Best Matches in Logarithmic Expected Time

ACM Transactions on Mathematical Software (TOMS)
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Rates of Convergence for Variable Resolution Schemes in Optimal Control

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
DEA: An Architecture for Goal Planning and Classification

Neural Computation
Reinforcement learning: a survey

Journal of Artificial Intelligence Research
On the complexity of solving Markov decision problems

UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence

Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions

Neurocomputing

Quantified Score

Hi-index	0.01

Visualization

Abstract

We consider two machine learning related problems, optimal control and reinforcement learning. We show that, even when their state space is very large (possibly infinite), natural algorithmic solutions can be implemented in an asynchronous neurocomputing way, that is by an assembly of interconnected simple neuron-like units which does not require any synchronization. From a neuroscience perspective, this work might help understanding how an asynchronous assembly of simple units can give rise to efficient control. From a computational point of view, such neurocomputing architectures can exploit their massively parallel structure and be significantly faster than standard sequential approaches. The contributions of this paper are the following: (1) We introduce a theoretically sound methodology for designing a whole class of asynchronous neurocomputing algorithms. (2) We build an original asynchronous neurocomputing architecture for optimal control in a small state space, then we show how to improve this architecture so that also solves the reinforcement learning problem. (3) Finally, we show how to extend this architecture to address the case where the state space is large (possibly infinite) by using an asynchronous neurocomputing adaptive approximation scheme. We illustrate this approximation scheme on two continuous space control problems.