Tuning Bandit Algorithms in Stochastic Environments
ALT '07 Proceedings of the 18th international conference on Algorithmic Learning Theory
Hoeffding and Bernstein races for selecting policies in evolutionary direct policy search
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
On the evolution of artificial Tetris players
CIG'09 Proceedings of the 5th international conference on Computational Intelligence and Games
Hi-index | 0.00 |
This article describes how racing procedures in evolution strategies can help reduce the number of evaluations. This idea is illustrated on learning Tetris players which can be addressed as a stochastic optimization problem. Different experiments show the benefits of the racing procedures in evolution strategies which can significantly reduce the number of evaluations.