IEEE Transactions on Systems, Man and Cybernetics
A menu of designs for reinforcement learning over time
Neural networks for control
Practical Issues in Temporal Difference Learning
Machine Learning
Adaptive critic designs: a case study for neurocontrol
Neural Networks
Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
Automatica (Journal of IFAC)
Approximate solutions to the time-invariant Hamilton-Jacobi-Bellman equation
Journal of Optimization Theory and Applications
Applied Optimal Control and Estimation
Applied Optimal Control and Estimation
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Neuro-Dynamic Programming
Art and Theory of Dynamic Programming
Art and Theory of Dynamic Programming
Dynamic Programming
Brief paper: A neural network solution for fixed-final time optimal control of nonlinear systems
Automatica (Journal of IFAC)
Approximate Dynamic Programming: Solving the Curses of Dimensionality (Wiley Series in Probability and Statistics)
IEEE Transactions on Neural Networks
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Reinforcement learning-based output feedback control of nonlinear systems with input constraints
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive critic autopilot design of Bank-to-turn missiles using fuzzy basis function networks
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Second-order training of adaptive critics for online process control
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to Control
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Ensemble Algorithms in Reinforcement Learning
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Comparison of Adaptive Critic-Based and Classical Wide-Area Controllers for Power Systems
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Higher Level Application of ADP: A Next Phase for the Control Field?
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive Critic Learning Techniques for Engine Torque and Air–Fuel Ratio Control
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Direct Heuristic Dynamic Programming for Damping Oscillations in a Large Power System
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Issues on Stability of ADP Feedback Controllers for Dynamical Systems
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Hamilton–Jacobi–Bellman Equations and Approximate Dynamic Programming on Time Scales
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive Feedback Control by Constrained Approximate Dynamic Programming
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Decentralized Bayesian Search Using Approximate Dynamic Programming Methods
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Missile defense and interceptor allocation by neuro-dynamic programming
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Intelligent supply chain management using adaptive critic learning
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Fully Evolvable Optimal Neurofuzzy Controller Using Adaptive Critic Designs
IEEE Transactions on Fuzzy Systems
Stable adaptive neuro-control design via Lyapunov function derivative estimation
Automatica (Journal of IFAC)
Automatica (Journal of IFAC)
A neighboring optimal adaptive critic for missile guidance
Mathematical and Computer Modelling: An International Journal
IEEE Transactions on Neural Networks
Online learning control by association and reinforcement
IEEE Transactions on Neural Networks
Dynamic re-optimization of a fed-batch fermentor using adaptive critic designs
IEEE Transactions on Neural Networks
IEEE Transactions on Neural Networks
Helicopter trimming and tracking control using direct neural dynamic programming
IEEE Transactions on Neural Networks
A self-learning call admission control scheme for CDMA cellular networks
IEEE Transactions on Neural Networks
Continuous-Time Adaptive Critics
IEEE Transactions on Neural Networks
SVM-Based Tree-Type Neural Networks as a Critic in Adaptive Critic Designs for Control
IEEE Transactions on Neural Networks
Robust/Optimal Temperature Profile Control of a High-Speed Aerospace Vehicle Using Neural Networks
IEEE Transactions on Neural Networks
Fixed-Final-Time-Constrained Optimal Control of Nonlinear Systems Using Neural Network HJB Approach
IEEE Transactions on Neural Networks
IEEE Transactions on Neural Networks
Hi-index | 0.00 |
In this article, we introduce some recent research trends within the field of adaptive/approximate dynamic programming (ADP), including the variations on the structure of ADP schemes, the development of ADP algorithms and applications of ADP schemes. For ADP algorithms, the point of focus is that iterative algorithms of ADP can be sorted into two classes: one class is the iterative algorithm with initial stable policy; the other is the one without the requirement of initial stable policy. It is generally believed that the latter one has less computation at the cost of missing the guarantee of system stability during iteration process. In addition, many recent papers have provided convergence analysis associated with the algorithms developed. Furthermore, we point out some topics for future studies.