Reinforcement learning based neural controllers for dynamic processes without exploration

Authors:
Frank-Florian Steege;André Hartmann;Erik Schaffernicht;Horst-Michael Gross
Affiliations:
Ilmenau Technical University, Neuroinformatics and Cognitive Robotics Lab, Ilmenau, Germany and Powitec Intelligent Technologies GmbH, Essen-Kettwig, Germany;Powitec Intelligent Technologies GmbH, Essen-Kettwig, Germany;Ilmenau Technical University, Neuroinformatics and Cognitive Robotics Lab, Ilmenau, Germany;Ilmenau Technical University, Neuroinformatics and Cognitive Robotics Lab, Ilmenau, Germany
Venue:
ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
Year:
2010

Citing 3
Cited 0

A Bayesian approach to imitation in reinforcement learning

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Machine learning techniques for selforganizing combustion control

KI'09 Proceedings of the 32nd annual German conference on Advances in artificial intelligence
Neural fitted q iteration – first experiences with a data efficient neural reinforcement learning method

ECML'05 Proceedings of the 16th European conference on Machine Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present a Reinforcement Learning (RL) approach with the capability to train neural adaptive controllers for complex control problems without expensive online exploration. The basis of the neural controller is a Neural fitted Q-Iteration (NFQ). This network is trained with data from the example set enriched with artificial data. With this training scheme, unlike most other existing approaches, the controller is able to learn offline on observed training data of an already closed-loop controlled process with often sparse and uninformative training samples. The suggested neural controller is evaluated on a modified and advanced cartpole simulator and a combustion control of a real waste-incineration plant and can successfully demonstrate its superiority.