Adaptive dynamic programming: an introduction

Authors:
Fei-Yue Wang;Huaguang Zhang;Derong Liu
Affiliations:
Chinese Academy of Sciences, China and University of Arizona;Northeastern University, China;Chinese Academy of Sciences, China
Venue:
IEEE Computational Intelligence Magazine
Year:
2009

Citing 52
Cited 0

Building and understanding adaptive systems: a statistical/numerical approach to factory automation and brain research

IEEE Transactions on Systems, Man and Cybernetics
A menu of designs for reinforcement learning over time

Neural networks for control
Practical Issues in Temporal Difference Learning

Machine Learning
Adaptive critic designs: a case study for neurocontrol

Neural Networks
Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation

Automatica (Journal of IFAC)
Approximate solutions to the time-invariant Hamilton-Jacobi-Bellman equation

Journal of Optimization Theory and Applications
Applied Optimal Control and Estimation

Applied Optimal Control and Estimation
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Art and Theory of Dynamic Programming

Art and Theory of Dynamic Programming
Dynamic Programming

Dynamic Programming
A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems

Neural Networks
Brief paper: Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control

Automatica (Journal of IFAC)
Brief paper: A neural network solution for fixed-final time optimal control of nonlinear systems

Automatica (Journal of IFAC)
Approximate Dynamic Programming: Solving the Curses of Dimensionality (Wiley Series in Probability and Statistics)

Approximate Dynamic Programming: Solving the Curses of Dimensionality (Wiley Series in Probability and Statistics)
Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions

Neurocomputing
Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints

IEEE Transactions on Neural Networks
Adaptive dynamic programming

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Reinforcement learning-based output feedback control of nonlinear systems with input constraints

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive critic autopilot design of Bank-to-turn missiles using fuzzy basis function networks

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Second-order training of adaptive critics for online process control

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to Control

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Reinforcement Learning Neural-Network-Based Controller for Nonlinear Discrete-Time Systems With Input Constraints

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Ensemble Algorithms in Reinforcement Learning

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Comparison of Adaptive Critic-Based and Classical Wide-Area Controllers for Power Systems

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Incoherent Control of Quantum Systems With Wavefunction-Controllable Subspaces via Quantum Reinforcement Learning

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Higher Level Application of ADP: A Next Phase for the Control Field?

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive Critic Learning Techniques for Engine Torque and Air–Fuel Ratio Control

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Direct Heuristic Dynamic Programming for Damping Oscillations in a Large Power System

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Issues on Stability of ADP Feedback Controllers for Dynamical Systems

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Hamilton–Jacobi–Bellman Equations and Approximate Dynamic Programming on Time Scales

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
A Novel Infinite-Time Optimal Tracking Control Scheme for a Class of Discrete-Time Nonlinear Systems via the Greedy HDP Iteration Algorithm

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive Feedback Control by Constrained Approximate Dynamic Programming

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Decentralized Bayesian Search Using Approximate Dynamic Programming Methods

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Missile defense and interceptor allocation by neuro-dynamic programming

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Intelligent supply chain management using adaptive critic learning

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Fully Evolvable Optimal Neurofuzzy Controller Using Adaptive Critic Designs

IEEE Transactions on Fuzzy Systems
Stable adaptive neuro-control design via Lyapunov function derivative estimation

Automatica (Journal of IFAC)
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach

Automatica (Journal of IFAC)
A neighboring optimal adaptive critic for missile guidance

Mathematical and Computer Modelling: An International Journal
Adaptive critic designs

IEEE Transactions on Neural Networks
Online learning control by association and reinforcement

IEEE Transactions on Neural Networks
Dynamic re-optimization of a fed-batch fermentor using adaptive critic designs

IEEE Transactions on Neural Networks
Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator

IEEE Transactions on Neural Networks
Helicopter trimming and tracking control using direct neural dynamic programming

IEEE Transactions on Neural Networks
A self-learning call admission control scheme for CDMA cellular networks

IEEE Transactions on Neural Networks
Continuous-Time Adaptive Critics

IEEE Transactions on Neural Networks
SVM-Based Tree-Type Neural Networks as a Critic in Adaptive Critic Designs for Control

IEEE Transactions on Neural Networks
Robust/Optimal Temperature Profile Control of a High-Speed Aerospace Vehicle Using Neural Networks

IEEE Transactions on Neural Networks
Fixed-Final-Time-Constrained Optimal Control of Nonlinear Systems Using Neural Network HJB Approach

IEEE Transactions on Neural Networks
Generalized Hamilton–Jacobi–Bellman Formulation -Based Neural Network Control of Affine Nonlinear Discrete-Time Systems

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this article, we introduce some recent research trends within the field of adaptive/approximate dynamic programming (ADP), including the variations on the structure of ADP schemes, the development of ADP algorithms and applications of ADP schemes. For ADP algorithms, the point of focus is that iterative algorithms of ADP can be sorted into two classes: one class is the iterative algorithm with initial stable policy; the other is the one without the requirement of initial stable policy. It is generally believed that the latter one has less computation at the cost of missing the guarantee of system stability during iteration process. In addition, many recent papers have provided convergence analysis associated with the algorithms developed. Furthermore, we point out some topics for future studies.