Reinforcement Learning Neural-Network-Based Controller for Nonlinear Discrete-Time Systems With Input Constraints

Authors:
Pingan He;S. Jagannathan
Affiliations:
Dept. of Electr. & Comput. Eng., Missouri Univ., Rolla, MO;-
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Year:
2007

Citing 0
Cited 18

Fuzzy rules emulated network and its application on nonlinear control systems

Applied Soft Computing
Wavelet based adaptive backstepping controller for a class of nonregular systems with input constraints

Expert Systems with Applications: An International Journal
A hybrid approach for design of stable adaptive fuzzy controllers employing Lyapunov theory and particle swarm optimization

IEEE Transactions on Fuzzy Systems
Nonlinear discrete-time controller with unknown systems identification based on fuzzy rules emulated network

Applied Soft Computing
Nonlinear discrete-time controller based on fuzzy-rule emulated network and shuttering condition

Applied Intelligence
Reinforcement learning and adaptive dynamic programming for feedback control

IEEE Circuits and Systems Magazine
Asymptotically stable adaptive critic design for uncertain nonlinear systems

ACC'09 Proceedings of the 2009 conference on American Control Conference
Adaptive dynamic programming: an introduction

IEEE Computational Intelligence Magazine
Direct adaptive controller for nonaffine discrete-time systems based on fuzzy rules emulated networks

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Optimal control laws for time-delay systems with saturating actuators based on heuristic dynamic programming

Neurocomputing
Wavelet based control for a class of delayed nonlinear systems with input constraints

Expert Systems with Applications: An International Journal
Nonlinear system control using self-evolving neural fuzzy inference networks with reinforcement evolutionary learning

Applied Soft Computing
A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems

Automatica (Journal of IFAC)
A multivariable predictive fuzzy PID control system

Applied Soft Computing
Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm

Neurocomputing
Adaptive NN control for a class of strict-feedback discrete-time nonlinear systems with input saturation

ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part II
Neural-network-based optimal tracking control scheme for a class of unknown discrete-time nonlinear systems using iterative ADP algorithm

Neurocomputing
Adaptive Neural Control of a Hypersonic Vehicle in Discrete Time

Journal of Intelligent and Robotic Systems

Quantified Score

Hi-index	0.01

Visualization

Abstract

A novel adaptive-critic-based neural network (NN) controller in discrete time is designed to deliver a desired tracking performance for a class of nonlinear systems in the presence of actuator constraints. The constraints of the actuator are treated in the controller design as the saturation nonlinearity. The adaptive critic NN controller architecture based on state feedback includes two NNs: the critic NN is used to approximate the "strategic" utility function, whereas the action NN is employed to minimize both the strategic utility function and the unknown nonlinear dynamic estimation errors. The critic and action NN weight updates are derived by minimizing certain quadratic performance indexes. Using the Lyapunov approach and with novel weight updates, the uniformly ultimate boundedness of the closed-loop tracking error and weight estimates is shown in the presence of NN approximation errors and bounded unknown disturbances. The proposed NN controller works in the presence of multiple nonlinearities, unlike other schemes that normally approximate one nonlinearity. Moreover, the adaptive critic NN controller does not require an explicit offline training phase, and the NN weights can be initialized at zero or random. Simulation results justify the theoretical analysis