Gradient estimation for discrete-event systems by measure-valued differentiation
ACM Transactions on Modeling and Computer Simulation (TOMACS)
Reinforcement learning and adaptive dynamic programming for feedback control
IEEE Circuits and Systems Magazine
On-Line Policy Gradient Estimation with Multi-Step Sampling
Discrete Event Dynamic Systems
A Convergent Online Single Time Scale Actor Critic Algorithm
The Journal of Machine Learning Research
A Perturbation Analysis Approach to Phantom Estimators for Waiting Times in the G/G/1 Queue
Discrete Event Dynamic Systems
Delay-optimal resource allocation for OFDMA systems via stochastic approximation
GLOBECOM'09 Proceedings of the 28th IEEE conference on Global telecommunications
Delay-optimal power and subcarrier allocation for OFDMA systems via stochastic approximation
IEEE Transactions on Wireless Communications
Threshold optimization for rate adaptation algorithms in IEEE 802.11 WLANs
IEEE Transactions on Wireless Communications
Distributive stochastic learning for delay-optimal OFDMA power and subband allocation
IEEE Transactions on Signal Processing
IEEE Transactions on Wireless Communications
Stochastic control via direct comparison
Discrete Event Dynamic Systems
Simulation model calibration with correlated knowledge-gradients
Winter Simulation Conference
Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
Mathematics of Operations Research
Admission control with elastic QoS for video on demand systems
International Journal of Automation and Computing
On-line coordination: Event interaction and state communication between cooperative agents
Web Intelligence and Agent Systems
Hi-index | 0.00 |