Monte-Carlo Tree Search in Poker Using Expected Reward Distributions

Authors:
Guy Broeck;Kurt Driessens;Jan Ramon
Affiliations:
Department of Computer Science, Katholieke Universiteit Leuven, Belgium;Department of Computer Science, Katholieke Universiteit Leuven, Belgium;Department of Computer Science, Katholieke Universiteit Leuven, Belgium
Venue:
ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Year:
2009

Citing 7
Cited 9

Finite-time Analysis of the Multiarmed Bandit Problem

Machine Learning
Artificial Intelligence: A Modern Approach

Artificial Intelligence: A Modern Approach
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Algorithms and assessment in computer poker

Algorithms and assessment in computer poker
A heads-up no-limit Texas Hold'em poker player: discretized betting models and automatically generated equilibrium-finding programs

Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 2
Efficient selectivity and backup operators in Monte-Carlo tree search

CG'06 Proceedings of the 5th international conference on Computers and games
Bandit based monte-carlo planning

ECML'06 Proceedings of the 17th European conference on Machine Learning

Computer poker: A review

Artificial Intelligence
Poker vision: playing cards and chips identification based on image processing

IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
Building a no limit texas hold'em poker agent based on game logs using supervised learning

AIS'11 Proceedings of the Second international conference on Autonomous and intelligent systems
Similarity-Based retrieval and solution re-use policies in the game of texas hold'em

ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Successful performance via decision generalisation in no limit texas hold'em

ICCBR'11 Proceedings of the 19th international conference on Case-Based Reasoning Research and Development
On combining decisions from multiple expert imitators for performance

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume One
Case-based strategies in computer poker

AI Communications
Adapting strategies to opponent models in incomplete information games: a reinforcement learning approach for poker

AIS'12 Proceedings of the Third international conference on Autonomous and Intelligent Systems
Decision generalisation from game logs in no limit texas Hold'em

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

We investigate the use of Monte-Carlo Tree Search (MCTS) within the field of computer Poker, more specifically No-Limit Texas Hold'em. The hidden information in Poker results in so called miximax game trees where opponent decision nodes have to be modeled as chance nodes. The probability distribution in these nodes is modeled by an opponent model that predicts the actions of the opponents. We propose a modification of the standard MCTS selection and backpropagation strategies that explicitly model and exploit the uncertainty of sampled expected values. The new strategies are evaluated as a part of a complete Poker bot that is, to the best of our knowledge, the first exploiting no-limit Texas Hold'em bot that can play at a reasonable level in games of more than two players.