Stochastic approximation with two time scales

Authors:
Vivek S. Borkar
Affiliations:
-
Venue:
Systems & Control Letters
Year:
1997

Citing 0
Cited 25

Two timescale analysis of the Alopex algorithm for optimization

Neural Computation
Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences

ACM Transactions on Modeling and Computer Simulation (TOMACS)
On the Lock-in Probability of Stochastic Approximation

Combinatorics, Probability and Computing
A LEARNING ALGORITHM FOR DISCRETE-TIME STOCHASTIC CONTROL

Probability in the Engineering and Informational Sciences
Distributed Topology Control of Wireless Networks

WIOPT '05 Proceedings of the Third International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks
Online calibrated forecasts: Memory efficiency versus universality for learning in games

Machine Learning
STEWARD: demo of spatio-textual extraction on the web aiding the retrieval of documents

dg.o '07 Proceedings of the 8th annual international conference on Digital government research: bridging disciplines & domains
Brief paper: New algorithms of the Q-learning type

Automatica (Journal of IFAC)
An analysis of reinforcement learning with function approximation

Proceedings of the 25th international conference on Machine learning
Distributed topology control of wireless networks

Wireless Networks
A New Learning Algorithm for Optimal Stopping

Discrete Event Dynamic Systems
Reinforcement Learning: A Tutorial Survey and Recent Advances

INFORMS Journal on Computing
Fast gradient-descent methods for temporal-difference learning with linear function approximation

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Distributed iterative optimal resource allocation with concurrent updates of routing and flow control variables

IEEE/ACM Transactions on Networking (TON)
Natural actor-critic algorithms

Automatica (Journal of IFAC)
From Q(λ) to average Q-learning: efficient implementation of an asymptotic approximation

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
A Convergent Online Single Time Scale Actor Critic Algorithm

The Journal of Machine Learning Research
Distributive stochastic learning for delay-optimal OFDMA power and subband allocation

IEEE Transactions on Signal Processing
Adaptive bases for reinforcement learning

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Stochastic approximation algorithms for constrained optimization via simulation

ACM Transactions on Modeling and Computer Simulation (TOMACS)
Reinforcement learning for model building and variance-penalized control

Winter Simulation Conference
Dynamic robust power allocation games under channel uncertainty and time delays

Computer Communications
Learning to use the spectrum in self-configuring heterogenous networks: a logit equilibrium approach

Proceedings of the 5th International ICST Conference on Performance Evaluation Methodologies and Tools
Satisfying demands in a multicellular network: A universal power allocation algorithm

Computer Communications
Reinforcement learning algorithms with function approximation: Recent advances and applications

Information Sciences: an International Journal

Quantified Score

Hi-index	0.01

Visualization

Abstract