Achieving master level play in 9×9 computer go

Authors:
Sylvain Gelly;David Silver
Affiliations:
Google, Zurich and Univ. Paris Sud, LRI, CNRS, INRIA, France;University of Alberta, Edmonton, Alberta, Canada
Venue:
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Year:
2008

Citing 8
Cited 22

The History Heuristic and Alpha-Beta Search Enhancements in Practice

IEEE Transactions on Pattern Analysis and Machine Intelligence
Computer Go

Artificial Intelligence - Chips challenging champions: games, computers and Artificial Intelligence
Learning to Predict by the Methods of Temporal Differences

Machine Learning
Combining online and offline knowledge in UCT

Proceedings of the 24th international conference on Machine learning
Reinforcement learning of local shape in the game of go

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Temporal difference learning applied to a high-performance game-playing program

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 1
Efficient selectivity and backup operators in Monte-Carlo tree search

CG'06 Proceedings of the 5th international conference on Computers and games
Bandit based monte-carlo planning

ECML'06 Proceedings of the 17th European conference on Machine Learning

2009 Special Issue: Goal-directed control and its antipodes

Neural Networks
Monte-Carlo exploration for deterministic planning

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Nested Monte-Carlo Expression Discovery

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Score bounded Monte-Carlo tree search

CG'10 Proceedings of the 7th international conference on Computers and games
Node-expansion operators for the UCT algorithm

CG'10 Proceedings of the 7th international conference on Computers and games
Monte-Carlo tree search and rapid action value estimation in computer Go

Artificial Intelligence
Evolving neural networks for geometric game-tree pruning

Proceedings of the 13th annual conference on Genetic and evolutionary computation
X-Armed Bandits

The Journal of Machine Learning Research
Applying UCT to boolean satisfiability

SAT'11 Proceedings of the 14th international conference on Theory and application of satisfiability testing
A methodology for learning players| styles from game records

International Journal of Artificial Intelligence and Soft Computing
Multi-armed bandits with episode context

Annals of Mathematics and Artificial Intelligence
Evolutionary learning of policies for MCTS simulations

Proceedings of the International Conference on the Foundations of Digital Games
Bayesian policy search with policy priors

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Learning data transformation rules through examples: preliminary results

Proceedings of the Ninth International Workshop on Information Integration on the Web
Guiding combinatorial optimization with UCT

CPAIOR'12 Proceedings of the 9th international conference on Integration of AI and OR Techniques in Constraint Programming for Combinatorial Optimization Problems
Strong mitigation: nesting search for good policies within search for good reward

Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
UCD: Upper confidence bound for rooted directed acyclic graphs

Knowledge-Based Systems
Learning-Based test programming for programmers

ISoLA'12 Proceedings of the 5th international conference on Leveraging Applications of Formal Methods, Verification and Validation: technologies for mastering change - Volume Part I
Learning non-myopically from human-generated reward

Proceedings of the 2013 international conference on Intelligent user interfaces
Design with shape grammars and reinforcement learning

Advanced Engineering Informatics
Integrated task and motion planning in belief space

International Journal of Robotics Research
BoostingTree: parallel selection of weak learners in boosting, with application to ranking

Machine Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

Spatial scaffolding is a naturally occurring human teaching behavior, in which teachers use their bodies to spatially structure the learning environment to direct the attention of the learner. Robotic systems can take advantage of simple, highly reliable ...