The History Heuristic and Alpha-Beta Search Enhancements in Practice
IEEE Transactions on Pattern Analysis and Machine Intelligence
Artificial Intelligence - Chips challenging champions: games, computers and Artificial Intelligence
Learning to Predict by the Methods of Temporal Differences
Machine Learning
Combining online and offline knowledge in UCT
Proceedings of the 24th international conference on Machine learning
Reinforcement learning of local shape in the game of go
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Temporal difference learning applied to a high-performance game-playing program
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 1
Efficient selectivity and backup operators in Monte-Carlo tree search
CG'06 Proceedings of the 5th international conference on Computers and games
Bandit based monte-carlo planning
ECML'06 Proceedings of the 17th European conference on Machine Learning
2009 Special Issue: Goal-directed control and its antipodes
Neural Networks
Monte-Carlo exploration for deterministic planning
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Nested Monte-Carlo Expression Discovery
Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Score bounded Monte-Carlo tree search
CG'10 Proceedings of the 7th international conference on Computers and games
Node-expansion operators for the UCT algorithm
CG'10 Proceedings of the 7th international conference on Computers and games
Monte-Carlo tree search and rapid action value estimation in computer Go
Artificial Intelligence
Evolving neural networks for geometric game-tree pruning
Proceedings of the 13th annual conference on Genetic and evolutionary computation
The Journal of Machine Learning Research
Applying UCT to boolean satisfiability
SAT'11 Proceedings of the 14th international conference on Theory and application of satisfiability testing
A methodology for learning players| styles from game records
International Journal of Artificial Intelligence and Soft Computing
Multi-armed bandits with episode context
Annals of Mathematics and Artificial Intelligence
Evolutionary learning of policies for MCTS simulations
Proceedings of the International Conference on the Foundations of Digital Games
Bayesian policy search with policy priors
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Learning data transformation rules through examples: preliminary results
Proceedings of the Ninth International Workshop on Information Integration on the Web
Guiding combinatorial optimization with UCT
CPAIOR'12 Proceedings of the 9th international conference on Integration of AI and OR Techniques in Constraint Programming for Combinatorial Optimization Problems
Strong mitigation: nesting search for good policies within search for good reward
Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
UCD: Upper confidence bound for rooted directed acyclic graphs
Knowledge-Based Systems
Learning-Based test programming for programmers
ISoLA'12 Proceedings of the 5th international conference on Leveraging Applications of Formal Methods, Verification and Validation: technologies for mastering change - Volume Part I
Learning non-myopically from human-generated reward
Proceedings of the 2013 international conference on Intelligent user interfaces
Design with shape grammars and reinforcement learning
Advanced Engineering Informatics
Integrated task and motion planning in belief space
International Journal of Robotics Research
Hi-index | 0.00 |
Spatial scaffolding is a naturally occurring human teaching behavior, in which teachers use their bodies to spatially structure the learning environment to direct the attention of the learner. Robotic systems can take advantage of simple, highly reliable ...