Temporal difference learning and TD-Gammon
Communications of the ACM
Co-Evolution in the Successful Learning of Backgammon Strategy
Machine Learning
A view of the EM algorithm that justifies incremental, sparse, and other variants
Learning in graphical models
Hi-index | 0.00 |
A probabilistic neural network is applied as a tool to approximate the statistical evaluation function for a simple version of the game Tic-Tac-Toe. We solve the problem by a sequential estimation of the underlying discrete distribution mixture of product components.