Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to Control

Authors:
A. Al-Tamimi;M. Abu-Khalaf;F. L. Lewis
Affiliations:
Autom. & Robotics Res. Inst., Univ. of Texas, Fort Worth, TX;-;-
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Year:
2007

Citing 0
Cited 13

Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions

Neurocomputing
Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints

IEEE Transactions on Neural Networks
Generalized policy iteration for continuous-time systems

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Direct heuristic dynamic programming for nonlinear tracking control with filtered tracking error

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Adaptive dynamic programming: an introduction

IEEE Computational Intelligence Magazine
Review article: Synergizing reinforcement learning and game theory-A new direction for control

Applied Soft Computing
Finite horizon optimal tracking control for a class of discrete-time nonlinear systems

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
A three-network architecture for on-line learning and optimization based on adaptive dynamic programming

Neurocomputing
2012 Special Issue: An iterative ε-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state

Neural Networks
Self-learning control schemes for two-person zero-sum differential games of continuous-time nonlinear systems with saturating controllers

ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm

Neurocomputing
Optimal tracking control scheme for discrete-time nonlinear systems with approximation errors

ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part II
Reinforcement learning algorithms with function approximation: Recent advances and applications

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this correspondence, adaptive critic approximate dynamic programming designs are derived to solve the discrete-time zero-sum game in which the state and action spaces are continuous. This results in a forward-in-time reinforcement learning algorithm that converges to the Nash equilibrium of the corresponding zero-sum game. The results in this correspondence can be thought of as a way to solve the Riccati equation of the well-known discrete-time Hinfin optimal control problem forward in time. Two schemes are presented, namely: 1) a heuristic dynamic programming and 2) a dual-heuristic dynamic programming, to solve for the value function and the costate of the game, respectively. An Hinfin autopilot design for an F-16 aircraft is presented to illustrate the results