Reinforcement learning for POMDPs based on action values and stochastic optimization
Eighteenth national conference on Artificial intelligence
A Cultural Algorithm for POMDPs from Stochastic Inventory Control
HM '08 Proceedings of the 5th International Workshop on Hybrid Metaheuristics
Improving the performance of complex agent plans through reinforcement learning
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems: volume 1 - Volume 1
LearnPNP: a tool for learning agent behaviors
RoboCup 2010
Reinforcement learning through global stochastic search in N-MDPs
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Policy oscillation is overshooting
Neural Networks
Hi-index | 0.00 |