To create neuro-controlled game opponent from UCT-created data

Authors:
Fan Xie;Suoju He;Xiao Liu;Xingguo Li;Junping Du;Jiajian Yang;Yiwen Fu;Yang Chen;Junping Wang;Zhiqing Liu;Qiliang Zhu
Affiliations:
Beijing University of Posts and Telecommunications, Beijing, China, 100876, Beijing, China;Beijing University of Posts and Telecommunications, Beijing, China, 100876, beijing, China;Beijing University of Posts and Telecommunications, Beijing, China, 100876, beijing, China;Beijing University of Posts and Telecommunications, Beijing, China, 100876, beijing, China;Beijing University of Posts and Telecommunications, Beijing, China, 100876, beijing, China;Beijing University of Posts and Telecommunications, Beijing, China, 100876, beijing, China;Beijing University of Posts and Telecommunications, Beijing, China, 100876, beijing, China;Beijing University of Posts and Telecommunications, Beijing, China, 100876, beijing, China;Beijing University of Posts and Telecommunications, Beijing, China, 100876, beijing, China;Beijing University of Posts and Telecommunications, Beijing, China, 100876, beijing, China;Beijing University of Posts and Telecommunications, Beijing, China, 100876, beijing, China
Venue:
Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation
Year:
2009

Citing 4
Cited 0

Machine Learning

Machine Learning
Finite-time Analysis of the Multiarmed Bandit Problem

Machine Learning
Gambling in a rigged casino: The adversarial multi-armed bandit problem

FOCS '95 Proceedings of the 36th Annual Symposium on Foundations of Computer Science
Bandit based monte-carlo planning

ECML'06 Proceedings of the 17th European conference on Machine Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

Adaptive Game AI improves adaptability of opponent AI as well as the challenge level of the gameplay, as a result the entertainment of game is augmented. Opponent game AI is usually implemented by scripted rules in video games, but the most updated algorithm of UCT (Upper Confidence bound for Trees) which perform well in computer go can also be used to achieve excellent result to control non-player characters (NPCs) in video games. However, due to computational intensiveness of UCT, it is actually not suitable for Online Games. As it is already known that UCT can create near optimal control, so it is possible to create Neuro-Controlled Game Opponent by off-line learning from the UCT created sample data; finally Neuro-Controlled Game Opponent for Online Games from UCT-Created Data without worry about computational intensiveness is generated. And also if the optimization approach of Neuro-Evolution is applied to the above generated Neuro-Controller, the performance of the opponent AI is enhanced even further.