Adaptive dynamic programming

Authors:
J. J. Murray;C. J. Cox;G. G. Lendaris;R. Saeks
Affiliations:
Dept. of Electr. Eng., State Univ. of New York, Stony Brook, NY;-;-;-
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Year:
2002

Citing 0
Cited 29

A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems

Neural Networks
Multiple UAVs path planning algorithms: a comparative study

Fuzzy Optimization and Decision Making
On-Line Learning Control for Discrete Nonlinear Systems Via an Improved ADDHP Method

ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
2009 Special Issue: Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence

Neural Networks
Reinforcement-learning-based output-feedback control of nonstrict nonlinear discrete-time systems with application to engine emission control

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Nonlinear system control using adaptive neural fuzzy networks based on a modified differential evolution

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews - Special issue on information reuse and integration
Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints

IEEE Transactions on Neural Networks
Reinforcement learning and adaptive dynamic programming for feedback control

IEEE Circuits and Systems Magazine
Optimal control of nonlinear systems using RBF neural network and adaptive extended Kalman filter

ACC'09 Proceedings of the 2009 conference on American Control Conference
Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
LS-SVM based neural controller as optimized by particle swarm algorithm using dual heuristic dynamic programming

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Adaptive dynamic programming for discrete-time systems with infinite horizon and Ɛ -error bound in the performance cost

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
A retrospective on adaptive dynamic programming for control

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Adaptive dynamic programming-based optimal control of unknown affine nonlinear discrete-time systems

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
An improved method of DHP for optimal control in the clarifying process of sugar cane juice

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Generalized policy iteration for continuous-time systems

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Neural dynamic programming based temperature optimal control for cement calcined process

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
Adaptive dynamic programming: an introduction

IEEE Computational Intelligence Magazine
Optimal control laws for time-delay systems with saturating actuators based on heuristic dynamic programming

Neurocomputing
PH optimal control in the clarifying process of sugar cane juice based on DHP

ICIC'10 Proceedings of the 6th international conference on Advanced intelligent computing theories and applications: intelligent computing
Optimal control for a class of unknown nonlinear systems via the iterative GDHP algorithm

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach

Neurocomputing
A new fuzzy identification method based on adaptive critic designs

ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Temperature control in water-gas shift reaction with adaptive dynamic programming

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
The optimal control of discrete-time delay nonlinear system with dual heuristic dynamic programming

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part I
Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm

Neurocomputing
Neural-network-based optimal tracking control scheme for a class of unknown discrete-time nonlinear systems using iterative ADP algorithm

Neurocomputing
Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems

Automatica (Journal of IFAC)
Dual Heuristic dynamic Programming for nonlinear discrete-time uncertain systems with state delay

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Unlike the many soft computing applications where it suffices to achieve a "good approximation most of the time," a control system must be stable all of the time. As such, if one desires to learn a control law in real-time, a fusion of soft computing techniques to learn the appropriate control law with hard computing techniques to maintain the stability constraint and guarantee convergence is required. The objective of the paper is to describe an adaptive dynamic programming algorithm (ADPA) which fuses soft computing techniques to learn the optimal cost (or return) functional for a stabilizable nonlinear system with unknown dynamics and hard computing techniques to verify the stability and convergence of the algorithm. Specifically, the algorithm is initialized with a (stabilizing) cost functional and the system is run with the corresponding control law (defined by the Hamilton-Jacobi-Bellman equation), with the resultant state trajectories used to update the cost functional in a soft computing mode. Hard computing techniques are then used to show that this process is globally convergent with stepwise stability to the optimal cost functional/control law pair for an (unknown) input affine system with an input quadratic performance measure (modulo the appropriate technical conditions). Three specific implementations of the ADPA are developed for 1) the linear case, 2) for the nonlinear case using a locally quadratic approximation to the cost functional, and 3) the nonlinear case using a radial basis function approximation of the cost functional; illustrated by applications to flight control.