Two timescale analysis of the Alopex algorithm for optimization
Neural Computation
ACM Transactions on Modeling and Computer Simulation (TOMACS)
On the Lock-in Probability of Stochastic Approximation
Combinatorics, Probability and Computing
A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROL
Probability in the Engineering and Informational Sciences
Distributed Topology Control of Wireless Networks
WIOPT '05 Proceedings of the Third International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks
STEWARD: demo of spatio-textual extraction on the web aiding the retrieval of documents
dg.o '07 Proceedings of the 8th annual international conference on Digital government research: bridging disciplines & domains
Brief paper: New algorithms of the Q-learning type
Automatica (Journal of IFAC)
An analysis of reinforcement learning with function approximation
Proceedings of the 25th international conference on Machine learning
Distributed topology control of wireless networks
Wireless Networks
A New Learning Algorithm for Optimal Stopping
Discrete Event Dynamic Systems
Reinforcement Learning: A Tutorial Survey and Recent Advances
INFORMS Journal on Computing
Fast gradient-descent methods for temporal-difference learning with linear function approximation
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
IEEE/ACM Transactions on Networking (TON)
Natural actor-critic algorithms
Automatica (Journal of IFAC)
From Q(λ) to average Q-learning: efficient implementation of an asymptotic approximation
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
A Convergent Online Single Time Scale Actor Critic Algorithm
The Journal of Machine Learning Research
Distributive stochastic learning for delay-optimal OFDMA power and subband allocation
IEEE Transactions on Signal Processing
Adaptive bases for reinforcement learning
ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Stochastic approximation algorithms for constrained optimization via simulation
ACM Transactions on Modeling and Computer Simulation (TOMACS)
Reinforcement learning for model building and variance-penalized control
Winter Simulation Conference
Dynamic robust power allocation games under channel uncertainty and time delays
Computer Communications
Learning to use the spectrum in self-configuring heterogenous networks: a logit equilibrium approach
Proceedings of the 5th International ICST Conference on Performance Evaluation Methodologies and Tools
Satisfying demands in a multicellular network: A universal power allocation algorithm
Computer Communications
Reinforcement learning algorithms with function approximation: Recent advances and applications
Information Sciences: an International Journal
Hi-index | 0.01 |