SIAM Journal on Control and Optimization
Robust nonlinear control design: state-space and Lyapunov techniques
Robust nonlinear control design: state-space and Lyapunov techniques
Analysis and modification of Newton's method for algebraic Riccati equations
Mathematics of Computation
Stabilization of Nonlinear Uncertain Systems
Stabilization of Nonlinear Uncertain Systems
Automatica (Journal of IFAC)
Reinforcement learning and adaptive dynamic programming for feedback control
IEEE Circuits and Systems Magazine
Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Generalized policy iteration for continuous-time systems
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
Automatica (Journal of IFAC)
Optimal control for a class of unknown nonlinear systems via the iterative GDHP algorithm
ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
Automatica (Journal of IFAC)
Automatica (Journal of IFAC)
Automatica (Journal of IFAC)
Automatica (Journal of IFAC)
Information Sciences: an International Journal
Reinforcement learning algorithms with function approximation: Recent advances and applications
Information Sciences: an International Journal
Hi-index | 22.16 |
In this paper we propose a new scheme based on adaptive critics for finding online the state feedback, infinite horizon, optimal control solution of linear continuous-time systems using only partial knowledge regarding the system dynamics. In other words, the algorithm solves online an algebraic Riccati equation without knowing the internal dynamics model of the system. Being based on a policy iteration technique, the algorithm alternates between the policy evaluation and policy update steps until an update of the control policy will no longer improve the system performance. The result is a direct adaptive control algorithm which converges to the optimal control solution without using an explicit, a priori obtained, model of the system internal dynamics. The effectiveness of the algorithm is shown while finding the optimal-load-frequency controller for a power system.