2009 Special Issue: Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence

Authors:
Travis Dierks;Balaje T. Thumati;S. Jagannathan
Affiliations:
Department of Electrical and Computer Engineering, Missouri University of Science and Technology (formerly University of Missouri-Rolla), 1870 Miner Circle, Rolla, MO 65409, United States;Department of Electrical and Computer Engineering, Missouri University of Science and Technology (formerly University of Missouri-Rolla), 1870 Miner Circle, Rolla, MO 65409, United States;Department of Electrical and Computer Engineering, Missouri University of Science and Technology (formerly University of Missouri-Rolla), 1870 Miner Circle, Rolla, MO 65409, United States
Venue:
Neural Networks
Year:
2009

Citing 6
Cited 16

Handbook of Learning and Approximate Dynamic Programming (IEEE Press Series on Computational Intelligence)

Handbook of Learning and Approximate Dynamic Programming (IEEE Press Series on Computational Intelligence)
Adaptive dynamic programming-based optimal control of unknown affine nonlinear discrete-time systems

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Adaptive dynamic programming

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive critic designs

IEEE Transactions on Neural Networks
Generalized Hamilton–Jacobi–Bellman Formulation -Based Neural Network Control of Affine Nonlinear Discrete-Time Systems

IEEE Transactions on Neural Networks

Learning Non-linear Multivariate Dynamics of Motion in Robotic Manipulators

International Journal of Robotics Research
Optimal control for a class of unknown nonlinear systems via the iterative GDHP algorithm

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
Neural network solution of optimal control problem with control and state constraints

ICANN'11 Proceedings of the 21st international conference on Artificial neural networks - Volume Part II
Adaptive dynamic programming-based optimal control of unknown nonaffine nonlinear discrete-time systems with proof of convergence

Neurocomputing
2012 Special Issue: An iterative ε-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state

Neural Networks
Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming

Automatica (Journal of IFAC)
Temperature control in water-gas shift reaction with adaptive dynamic programming

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs

Information Sciences: an International Journal
Output feedback direct adaptive neural network control for uncertain SISO nonlinear systems using a fuzzy estimator of the control error

Neural Networks
A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems

Automatica (Journal of IFAC)
Generalized Hamilton-Jacobi-Isaacs formulation-based neural network H∞ control for constrained input nonlinear systems

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part I
Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm

Neurocomputing
Neural network H∞ tracking control of nonlinear systems using GHJI method

ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part II
Neuro-optimal control for a class of unknown nonlinear dynamic systems using SN-DHP technique

Neurocomputing
Neural-network-based optimal tracking control scheme for a class of unknown discrete-time nonlinear systems using iterative ADP algorithm

Neurocomputing
Fixed-final-time optimal tracking control of input-affine nonlinear systems

Neurocomputing

Quantified Score

Hi-index	0.01

Visualization

Abstract

The optimal control of linear systems accompanied by quadratic cost functions can be achieved by solving the well-known Riccati equation. However, the optimal control of nonlinear discrete-time systems is a much more challenging task that often requires solving the nonlinear Hamilton-Jacobi-Bellman (HJB) equation. In the recent literature, discrete-time approximate dynamic programming (ADP) techniques have been widely used to determine the optimal or near optimal control policies for affine nonlinear discrete-time systems. However, an inherent assumption of ADP requires the value of the controlled system one step ahead and at least partial knowledge of the system dynamics to be known. In this work, the need of the partial knowledge of the nonlinear system dynamics is relaxed in the development of a novel approach to ADP using a two part process: online system identification and offline optimal control training. First, in the system identification process, a neural network (NN) is tuned online using novel tuning laws to learn the complete plant dynamics so that a local asymptotic stability of the identification error can be shown. Then, using only the learned NN system model, offline ADP is attempted resulting in a novel optimal control law. The proposed scheme does not require explicit knowledge of the system dynamics as only the learned NN model is needed. The proof of convergence is demonstrated. Simulation results verify theoretical conjecture.