A Markovian process modeling for Pickomino

Authors:
Stéphane Cardon;Nathalie Chetcuti-Sperandio;Fabien Delorme;Sylvain Lagrue
Affiliations:
Univ Lille Nord de France, Lille, France and UArtois, CRIL, Lens, France and CNRS, UMR, Lens, France;Univ Lille Nord de France, Lille, France and UArtois, CRIL, Lens, France and CNRS, UMR, Lens, France;Univ Lille Nord de France, Lille, France and UArtois, CRIL, Lens, France and CNRS, UMR, Lens, France;Univ Lille Nord de France, Lille, France and UArtois, CRIL, Lens, France and CNRS, UMR, Lens, France
Venue:
CG'10 Proceedings of the 7th international conference on Computers and games
Year:
2010

Citing 7
Cited 0

Dynamic programming: deterministic and stochastic models

Dynamic programming: deterministic and stochastic models
Deep Blue

Artificial Intelligence - Chips challenging champions: games, computers and Artificial Intelligence
Programming backgammon using self-teaching neural nets

Artificial Intelligence - Chips challenging champions: games, computers and Artificial Intelligence
World-championship-caliber Scrabble

Artificial Intelligence - Chips challenging champions: games, computers and Artificial Intelligence
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Solving checkers

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper deals with a nondeterministic game based on die rolls and on the "stop or continue" principle: Pickomino. During his turn, each participant has to make the best decisions first to choose the dice to keep, then to choose between continuing or stopping depending on the previous rolls and on the available resources. Markov Decision Processes (MDPs) offer the formal framework to model this game. The two main problems are first to determine the set of states, then to compute the transition probabilities. We provide in this paper original solutions to both problems: we provide (1) a compact representation of states and (2) a constructive method to compute the probability distributions, based on the partitioning of the space of roll results depending on a set of marked values. Finally, we show the efficiency of the proposed method through numerous experimental results: it turns out to be impressive compared to previous programs we developed.