Technical Note: \cal Q-Learning
Machine Learning
A Theory of Learning and Generalization: With Applications to Neural Networks and Control Systems
A Theory of Learning and Generalization: With Applications to Neural Networks and Control Systems
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Neuro-Dynamic Programming
Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Springer Handbook of Robotics
Self-teaching adaptive dynamic programming for Gomoku
Neurocomputing
Online learning control by association and reinforcement
IEEE Transactions on Neural Networks
DHP Method for Ramp Metering of Freeway Traffic
IEEE Transactions on Intelligent Transportation Systems
Randomized Algorithms for Analysis and Control of Uncertain Systems: With Applications
Randomized Algorithms for Analysis and Control of Uncertain Systems: With Applications
Hi-index | 0.01 |
The paper proposes a supervised adaptive dynamic programming (SADP) algorithm for a full-range adaptive cruise control (ACC) system, which can be formulated as a dynamic programming problem with stochastic demands. The suggested ACC system has been designed to allow the host vehicle to drive both in highways and in Stop and Go (SG) urban scenarios. The ACC system can autonomously drive the host vehicle to a desired speed and/or a given distance from the target vehicle in both operational cases. Traditional adaptive dynamic programming (ADP) is a suitable tool to address the problem but training usually suffers from low convergence rates and hardly achieves an effective controller. A SADP algorithm which introduces the concept of inducing region is here introduced to overcome such training drawbacks. The SADP algorithm performs very well in all simulation scenarios and always better than more traditional controllers. The conclusion is that the proposed SADP algorithm is an effective control methodology able to effectively address the full-range ACC problem.