Reinforcement learning-based output feedback control of nonlinear systems with input constraints

Authors:
P. He;S. Jagannathan
Affiliations:
Dept. of Electr. & Comput. Eng., Univ. of Missouri, Rolla, MO, USA;-
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Year:
2005

Citing 0
Cited 6

Brief paper: Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control

Automatica (Journal of IFAC)
Reinforcement-learning-based output-feedback control of nonstrict nonlinear discrete-time systems with application to engine emission control

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Direct heuristic dynamic programming for nonlinear tracking control with filtered tracking error

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive dynamic programming: an introduction

IEEE Computational Intelligence Magazine
Behavioral-fusion control based on reinforcement learning

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Neural observer-based adaptive compensation control for nonlinear time-varying delays systems with input constraints

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

A novel neural network (NN)-based output feedback controller with magnitude constraints is designed to deliver a desired tracking performance for a class of multi-input and multi-output (MIMO) strict feedback nonlinear discrete-time systems. Reinforcement learning is proposed for the output feedback controller, which uses three NNs: 1) an NN observer to estimate the system states with the input-output data, 2) a critic NN to approximate certain strategic utility function, and 3) an action NN to minimize both the strategic utility function and the unknown dynamics estimation errors. Using the Lyapunov approach, the uniformly ultimate boundedness (UUB) of the state estimation errors, the tracking errors and weight estimates is shown.