Learning tetris using the noisy cross-entropy method

  • Authors:
  • István Szita;András Lörincz

  • Affiliations:
  • Department of Information Systems, Eötvös Loránd University, Budapest, Hungary;Department of Information Systems, Eötvös Loránd University, Budapest, Hungary

  • Venue:
  • Neural Computation
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The cross-entropy method is an efficient and general optimization algorithm. However, its applicability in reinforcement learning (RL) seems to be limited because it often converges to suboptimal policies. We apply noise for preventing early convergence of the cross-entropy method, using Tetris, a computer game, for demonstration. The resulting policy outperforms previous RL algorithms by almost two orders of magnitude.