A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning
Discrete Event Dynamic Systems
Performance Loss Bounds for Approximate Value Iteration with State Aggregation
Mathematics of Operations Research
Hi-index | 0.00 |