Dopamine: generalization and bonuses
Neural Networks - Computational models of neuromodulation
Control of exploitation-exploration meta-parameter in reinforcement learning
Neural Networks - Computational models of neuromodulation
Exploration Strategies for Model-based Learning in Multi-agent Systems: Exploration Strategies
Autonomous Agents and Multi-Agent Systems
Biasing Exploration in an Anticipatory Learning Classifier System
IWLCS '01 Revised Papers from the 4th International Workshop on Advances in Learning Classifier Systems
Hidden-Mode Markov Decision Processes for Nonstationary Sequential Decision Making
Sequence Learning - Paradigms, Algorithms, and Applications
Sequential Decision Making Based on Direct Search
Sequence Learning - Paradigms, Algorithms, and Applications
The Two Facets of the Exploration-Exploitation Dilemma
IAT '06 Proceedings of the IEEE/WIC/ACM international conference on Intelligent Agent Technology
Robots that learn language: developmental approach to human-machine conversations
EELC'06 Proceedings of the Third international conference on Emergence and Evolution of Linguistic Communication: symbol Grounding and Beyond
Scalable and efficient bayes-adaptive reinforcement learning based on monte-carlo tree search
Journal of Artificial Intelligence Research
Hi-index | 0.00 |