A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning
Discrete Event Dynamic Systems
Performance Loss Bounds for Approximate Value Iteration with State Aggregation
Mathematics of Operations Research
An analysis of reinforcement learning with function approximation
Proceedings of the 25th international conference on Machine learning
ICIC'09 Proceedings of the Intelligent computing 5th international conference on Emerging intelligent computing technology and applications
Approximate Dynamic Programming via a Smoothed Linear Program
Operations Research
The Journal of Machine Learning Research
Policy oscillation is overshooting
Neural Networks
Hi-index | 0.00 |