Multiple UAVs path planning algorithms: a comparative study
Fuzzy Optimization and Decision Making
On-Line Learning Control for Discrete Nonlinear Systems Via an Improved ADDHP Method
ISNN '07 Proceedings of the 4th international symposium on Neural Networks: Advances in Neural Networks
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews - Special issue on information reuse and integration
IEEE Transactions on Neural Networks
Reinforcement learning and adaptive dynamic programming for feedback control
IEEE Circuits and Systems Magazine
Optimal control of nonlinear systems using RBF neural network and adaptive extended Kalman filter
ACC'09 Proceedings of the 2009 conference on American Control Conference
Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
A retrospective on adaptive dynamic programming for control
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Adaptive dynamic programming-based optimal control of unknown affine nonlinear discrete-time systems
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
An improved method of DHP for optimal control in the clarifying process of sugar cane juice
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Generalized policy iteration for continuous-time systems
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Neural dynamic programming based temperature optimal control for cement calcined process
CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
Adaptive dynamic programming: an introduction
IEEE Computational Intelligence Magazine
PH optimal control in the clarifying process of sugar cane juice based on DHP
ICIC'10 Proceedings of the 6th international conference on Advanced intelligent computing theories and applications: intelligent computing
Optimal control for a class of unknown nonlinear systems via the iterative GDHP algorithm
ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
A new fuzzy identification method based on adaptive critic designs
ISNN'06 Proceedings of the Third international conference on Advances in Neural Networks - Volume Part I
Temperature control in water-gas shift reaction with adaptive dynamic programming
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
The optimal control of discrete-time delay nonlinear system with dual heuristic dynamic programming
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part I
Hi-index | 0.00 |
Unlike the many soft computing applications where it suffices to achieve a "good approximation most of the time," a control system must be stable all of the time. As such, if one desires to learn a control law in real-time, a fusion of soft computing techniques to learn the appropriate control law with hard computing techniques to maintain the stability constraint and guarantee convergence is required. The objective of the paper is to describe an adaptive dynamic programming algorithm (ADPA) which fuses soft computing techniques to learn the optimal cost (or return) functional for a stabilizable nonlinear system with unknown dynamics and hard computing techniques to verify the stability and convergence of the algorithm. Specifically, the algorithm is initialized with a (stabilizing) cost functional and the system is run with the corresponding control law (defined by the Hamilton-Jacobi-Bellman equation), with the resultant state trajectories used to update the cost functional in a soft computing mode. Hard computing techniques are then used to show that this process is globally convergent with stepwise stability to the optimal cost functional/control law pair for an (unknown) input affine system with an input quadratic performance measure (modulo the appropriate technical conditions). Three specific implementations of the ADPA are developed for 1) the linear case, 2) for the nonlinear case using a locally quadratic approximation to the cost functional, and 3) the nonlinear case using a radial basis function approximation of the cost functional; illustrated by applications to flight control.