Machine Learning
Finite-time Analysis of the Multiarmed Bandit Problem
Machine Learning
Gambling in a rigged casino: The adversarial multi-armed bandit problem
FOCS '95 Proceedings of the 36th Annual Symposium on Foundations of Computer Science
Game Player Strategy Pattern Recognition by Using Radial Basis Function
CSSE '08 Proceedings of the 2008 International Conference on Computer Science and Software Engineering - Volume 01
Bandit based monte-carlo planning
ECML'06 Proceedings of the 17th European conference on Machine Learning
Hi-index | 0.00 |
Game AI controlled by UCT which achieves excellent performance in computer go can be applied to control non-player characters (NPCs) in video games. While, it is computation intensive algorithm, so applying it to on-line game is not suitable. But data collected from NPC controlled by UCT is able to be utilized to train Neuro-Controler. Furthermore, Neuro-Controler is an efficient algorithm due to its capability of extracting knowledge from training data which is generated from UCT. In order to obtain outstanding peiformance of Neurol-Controler, training data is a key factor but the structure of Neurol-Controler is also important. In this paper, the prey and predator game genre of Dead- End is utilized as a test-bed, the basic principle of UCT and Neurol-Controller is drawn, and the effectiveness of their application to game AI development is demonstrated.