Machine Learning
Temporal difference learning and TD-Gammon
Communications of the ACM
On verifying game designs and playing strategies using reinforcement learning
Proceedings of the 2001 ACM symposium on Applied computing
Sokoban: enhancing general single-agent search methods using domain knowledge
Artificial Intelligence - Special issue on heuristic search in artificial intelligence
Introduction to Reinforcement Learning
Introduction to Reinforcement Learning
Learning to Predict by the Methods of Temporal Differences
Machine Learning
Interactive Verification of Game Design and Playing Strategies
ICTAI '02 Proceedings of the 14th IEEE International Conference on Tools with Artificial Intelligence
Player Co-Modelling in a Strategy Board Game: Discovering How to Play Fast
Cybernetics and Systems
Learning and applying competitive strategies
AAAI'04 Proceedings of the 19th national conference on Artifical intelligence
Ensemble pruning using reinforcement learning
SETN'06 Proceedings of the 4th Helenic conference on Advances in Artificial Intelligence
Designing an Evolutionary Strategizing Machine for Game Playing and Beyond
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Hi-index | 0.00 |
In this paper we review our work on the acquisition of game-playing capabilities by a computer, when the only source of knowledge comes from extended self-play and sparsely dispersed human (expert) play. We summarily present experiments that show how a reinforcement learning backbone coupled with neural networks for approximation can indeed serve as a mechanism of the acquisition of game playing skill and we derive game interestingness measures that are inexpensive and straightforward to compute, yet also capture the relative quality of the game playing engine. We draw direct analogues to classical genetic algorithms and we stress that evolutionary development should be coupled with more traditional, expert-designed paths. That way the learning computer is exposed to tutorial games without having to revert to domain knowledge, thus facilitating the knowledge engineering life-cycle.