Learning to coordinate without sharing information
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Temporal difference learning and TD-Gammon
Communications of the ACM
Intelligent agents in computer games
AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Multiagent learning using a variable learning rate
Artificial Intelligence
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
SHOP: Simple Hierarchical Ordered Planner
IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
ECCBR '08 Proceedings of the 9th European conference on Advances in Case-Based Reasoning
Real-time team-mate AI in games: a definition, survey, & critique
Proceedings of the Fifth International Conference on the Foundations of Digital Games
ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Imitating inscrutable enemies: learning from stochastic policy observation, retrieval and reuse
ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Integrated learning for goal-driven autonomy
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Bootstrapping learning from abstract models in games
International Journal of Bio-Inspired Computation
Hi-index | 0.00 |
In this paper we present RETALIATE, an online reinforcement learning algorithm for developing winning policies in team first-person shooter games. RETALIATE has three crucial characteristics: (1) individual BOT behavior is fixed although not known in advance, therefore individual BOTS work as "plugins", (2) RETALIATE models the problem of learning team tactics through a simple state formulation, (3) discount rates commonly used in Q-Iearning are not used. As a result of these characteristics, the application of the Q-learning algorithm results in the rapid exploration towards a winning policy against an opponent team. In our empirical evaluation we demonstrate that RETALIATE adapts well when the environment changes.