Online learning control by association and reinforcement

Authors:
J. Si;Yu-Tsung Wang
Affiliations:
Dept. of Electr. Eng., Arizona State Univ., Tempe, AZ;-
Venue:
IEEE Transactions on Neural Networks
Year:
2001

Citing 0
Cited 44

New internal optimal neurocontrol for a series FACTS device in a power transmission line

Neural Networks - 2003 Special issue: Advances in neural networks research — IJCNN'03
Brief paper: Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control

Automatica (Journal of IFAC)
Dual heuristic programming based nonlinear optimal control for a synchronous generator

Engineering Applications of Artificial Intelligence
Approximate Dynamic Programming for Ship Course Control

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
On-Line Learning Control for Discrete Nonlinear Systems Via an Improved ADDHP Method

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
ADHDP for the pH Value Control in the Clarifying Process of Sugar Cane Juice

ISNN '08 Proceedings of the 5th international symposium on Neural Networks: Advances in Neural Networks
Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions

Neurocomputing
Performance Evaluation of Direct Heuristic Dynamic Programming using Control-Theoretic Measures

Journal of Intelligent and Robotic Systems
Temperature Control in Cement Rotary Kiln with Neural Network-Based Heuristic Dynamic Programming

ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
Reinforcement-learning-based output-feedback control of nonstrict nonlinear discrete-time systems with application to engine emission control

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints

IEEE Transactions on Neural Networks
Reinforcement learning and adaptive dynamic programming for feedback control

IEEE Circuits and Systems Magazine
Constrained controller design for a class of nonlinear discrete-time uncertain systems

ACC'09 Proceedings of the 2009 conference on American Control Conference
LS-SVM based neural controller as optimized by particle swarm algorithm using dual heuristic dynamic programming

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Adaptive dynamic programming for discrete-time systems with infinite horizon and Ɛ -error bound in the performance cost

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
An improved method of DHP for optimal control in the clarifying process of sugar cane juice

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Coordinated multiple ramps metering based on neuro-fuzzy adaptive dynamic programming

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Adaptive neural controller design for synchronous generator based on heuristic dynamic programming

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
Direct heuristic dynamic programming for nonlinear tracking control with filtered tracking error

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive dynamic programming: an introduction

IEEE Computational Intelligence Magazine
A new approach to fuzzy classifier systems and its application in self-generating neuro-fuzzy systems

Neurocomputing
Online adaptive utilization control for real-time embedded multiprocessor systems

Journal of Systems Architecture: the EUROMICRO Journal
PH optimal control in the clarifying process of sugar cane juice based on DHP

ICIC'10 Proceedings of the 6th international conference on Advanced intelligent computing theories and applications: intelligent computing
Adaptive critic design with ESN critic for bioprocess optimization

ICANN'10 Proceedings of the 20th international conference on Artificial neural networks: Part II
The implementation of Q-learning for problems in continuous state and action space using SOM-based fuzzy systems

ICCOMP'06 Proceedings of the 10th WSEAS international conference on Computers
Optimal control for a class of unknown nonlinear systems via the iterative GDHP algorithm

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
An adaptive dynamic programming approach for closely-coupled MIMO system control

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part III
Application of dual heuristic programming in excitation system of synchronous generators

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part III
A three-network architecture for on-line learning and optimization based on adaptive dynamic programming

Neurocomputing
Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach

Neurocomputing
Reinforcement learning-based tuning algorithm applied to fuzzy identification

ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
A new fuzzy identification method based on adaptive critic designs

ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Book reviews: Neural networks for modeling and control of dynamic systems: a practitioner's handbook

Automatica (Journal of IFAC)
Book review: Stochastic controls-Hamiltonian systems and HJB equations

Automatica (Journal of IFAC)
2012 Special Issue: A boundedness result for the direct heuristic dynamic programming

Neural Networks
2012 Special Issue: An iterative ε-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state

Neural Networks
Thalamic cooperation between the cerebellum and basal ganglia with a new tropism-based action-dependent heuristic dynamic programming method

Neurocomputing
Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming

Automatica (Journal of IFAC)
Optimal battery management with ADHDP in smart home environments

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs

Information Sciences: an International Journal
Neuro-optimal control for a class of unknown nonlinear dynamic systems using SN-DHP technique

Neurocomputing
Neural-network-based optimal tracking control scheme for a class of unknown discrete-time nonlinear systems using iterative ADP algorithm

Neurocomputing
Full-range adaptive cruise control based on supervised adaptive dynamic programming

Neurocomputing
Reactive power control of grid-connected wind farm based on adaptive dynamic programming

Neurocomputing

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper focuses on a systematic treatment for developing a generic online learning control system based on the fundamental principle of reinforcement learning or more specifically neural dynamic programming. This online learning system improves its performance over time in two aspects: 1) it learns from its own mistakes through the reinforcement signal from the external environment and tries to reinforce its action to improve future performance; and 2) system states associated with the positive reinforcement is memorized through a network learning process where in the future, similar states will be more positively associated with a control action leading to a positive reinforcement. A successful candidate of online learning control design is introduced. Real-time learning algorithms is derived for individual components in the learning system. Some analytical insight is provided to give guidelines on the learning process took place in each module of the online learning control system