Adele Howe;Larry Don Pyeatt
-;-
Complexity of finite-horizon Markov decision process problems
Journal of the ACM (JACM)
Teaching robots to plan through Q-learning
Robotica
Using control theory for analysis of reinforcement learning and optimal policy properties in grid-world problems
ICIC'09 Proceedings of the Intelligent computing 5th international conference on Emerging intelligent computing technology and applications