Parallel and distributed computation: numerical methods
Parallel and distributed computation: numerical methods
An adaptive neural network: the cerebral cortex
An adaptive neural network: the cerebral cortex
An Algorithm for Finding Best Matches in Logarithmic Expected Time
ACM Transactions on Mathematical Software (TOMS)
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Rates of Convergence for Variable Resolution Schemes in Optimal Control
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
DEA: An Architecture for Goal Planning and Classification
Neural Computation
Reinforcement learning: a survey
Journal of Artificial Intelligence Research
On the complexity of solving Markov decision problems
UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence
Hi-index | 0.01 |
We consider two machine learning related problems, optimal control and reinforcement learning. We show that, even when their state space is very large (possibly infinite), natural algorithmic solutions can be implemented in an asynchronous neurocomputing way, that is by an assembly of interconnected simple neuron-like units which does not require any synchronization. From a neuroscience perspective, this work might help understanding how an asynchronous assembly of simple units can give rise to efficient control. From a computational point of view, such neurocomputing architectures can exploit their massively parallel structure and be significantly faster than standard sequential approaches. The contributions of this paper are the following: (1) We introduce a theoretically sound methodology for designing a whole class of asynchronous neurocomputing algorithms. (2) We build an original asynchronous neurocomputing architecture for optimal control in a small state space, then we show how to improve this architecture so that also solves the reinforcement learning problem. (3) Finally, we show how to extend this architecture to address the case where the state space is large (possibly infinite) by using an asynchronous neurocomputing adaptive approximation scheme. We illustrate this approximation scheme on two continuous space control problems.