Data networks
Technical Note: \cal Q-Learning
Machine Learning
Risk sensitive control of Markov processes in countable state space
Systems & Control Letters
Stochastic approximation with two time scales
Systems & Control Letters
A one-measurement form of simultaneous perturbation stochastic approximation
Automatica (Journal of IFAC)
Asynchronous Stochastic Approximations
SIAM Journal on Control and Optimization
Some Pathological Traps for Stochastic Approximation
SIAM Journal on Control and Optimization
Actor-Critic--Type Learning Algorithms for Markov Decision Processes
SIAM Journal on Control and Optimization
Dynamic Programming and Optimal Control
Dynamic Programming and Optimal Control
Simulation Modeling and Analysis
Simulation Modeling and Analysis
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Neuro-Dynamic Programming
The Relations Among Potentials, Perturbation Analysis,and Markov Decision Processes
Discrete Event Dynamic Systems
Use of the SAND spatial browser for digital government applications
Communications of the ACM
Q-Learning for Risk-Sensitive Control
Mathematics of Operations Research
Risk-Sensitive Optimal Control for Markov Decision Processes with Monotone Cost
Mathematics of Operations Research
ACM Transactions on Modeling and Computer Simulation (TOMACS)
Computer Networks: The International Journal of Computer and Telecommunications Networking
ACM Transactions on Modeling and Computer Simulation (TOMACS)
Infinite-horizon policy-gradient estimation
Journal of Artificial Intelligence Research
Experiments with infinite-horizon, policy-gradient estimation
Journal of Artificial Intelligence Research
Call admission control and routing in integrated services networks using neuro-dynamic programming
IEEE Journal on Selected Areas in Communications
Reinforcement learning in the presence of rare events
Proceedings of the 25th international conference on Machine learning
Hi-index | 0.00 |
A spatio-textual search engine, termed "STEWARD" is demonstrated where document similarity is based on both the textual similarity as well as the spatial proximity of the locations in the document to the spatial search input. STEWARD's performance is enhanced by the presence of a document tagger that is able to identify textual references to geographical entities. The user-interface of STEWARD provides the ability to browse results, thereby making it a valuable "knowledge discovery" tool.