Learning heuristic functions for large state spaces

Authors:
Shahab Jabbari Arfaee;Sandra Zilles;Robert C. Holte
Affiliations:
University of Alberta, Department of Computing Science, Edmonton, Alberta, Canada T6G 2H8;University of Regina, Department of Computer Science, Regina, Saskatchewan, Canada S4S 0A2;University of Alberta, Department of Computing Science, Edmonton, Alberta, Canada T6G 2H8
Venue:
Artificial Intelligence
Year:
2011

Citing 30
Cited 1

Depth-first iterative-deepening: an optimal admissible tree search

Artificial Intelligence
Learning internal representations by error propagation

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
Searching in the plane

Information and Computation
Genetic programming II: automatic discovery of reusable programs

Genetic programming II: automatic discovery of reusable programs
Blocks World revisited

Artificial Intelligence
Planning as heuristic search

Artificial Intelligence - Special issue on heuristic search in artificial intelligence
Disjoint pattern database heuristics

Artificial Intelligence - Chips challenging champions: games, computers and Artificial Intelligence
Machine Discovery of Effective Admissible Heuristics

Machine Learning
Searching with Pattern Databases

AI '96 Proceedings of the 11th Biennial Conference of the Canadian Society for Computational Studies of Intelligence on Advances in Artificial Intelligence
Learning While -Solving Problems in Single Agent Search: Preliminary Results

AI*IA '95 Proceedings of the 4th Congress of the Italian Association for Artificial Intelligence on Topics in Artificial Intelligence
Duality in permutation state spaces and the dual search algorithm

Artificial Intelligence
Learning Control Knowledge for Forward Search Planning

The Journal of Machine Learning Research
Scaling Search with Pattern Databases

Model Checking and Artificial Intelligence
Compressing Pattern Databases with Learning

Proceedings of the 2008 conference on ECAI 2008: 18th European Conference on Artificial Intelligence
GP-rush: using genetic programming to evolve solvers for the rush hour puzzle

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
Dual search in permutation state spaces

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
External-memory pattern databases using structured duplicate detection

AAAI'05 Proceedings of the 20th national conference on Artificial intelligence - Volume 3
Learning from multiple heuristics

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 1
The FF planning system: fast plan generation through heuristic search

Journal of Artificial Intelligence Research
The fast downward planning system

Journal of Artificial Intelligence Research
A general theory of additive state space abstractions

Journal of Artificial Intelligence Research
A selective macro-learning algorithm and its application to the N × N sliding-tile puzzle

Journal of Artificial Intelligence Research
Dual lookups in pattern databases

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Limited discrepancy beam search

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Relative-Order Abstractions for the Pancake Problem

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Linear-space best-first search: summary of results

AAAI'92 Proceedings of the tenth national conference on Artificial intelligence
Finding optimal solutions to Rubik's cube using pattern databases

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Hierarchical heuristic search revisited

SARA'05 Proceedings of the 6th international conference on Abstraction, Reformulation and Approximation
Solving the 24 puzzle with instance dependent pattern databases

SARA'05 Proceedings of the 6th international conference on Abstraction, Reformulation and Approximation
Learning while solving problems in best first search

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans

Stratified tree search: a novel suboptimal heuristic search algorithm

Proceedings of the 2013 international conference on Autonomous agents and multi-agent systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We investigate the use of machine learning to create effective heuristics for search algorithms such as IDA^@? or heuristic-search planners such as FF. Our method aims to generate a sequence of heuristics from a given weak heuristic h"0 and a set of unsolved training instances using a bootstrapping procedure. The training instances that can be solved using h"0 provide training examples for a learning algorithm that produces a heuristic h"1 that is expected to be stronger than h"0. If h"0 is so weak that it cannot solve any of the given instances we use random walks backward from the goal state to create a sequence of successively more difficult training instances starting with ones that are guaranteed to be solvable by h"0. The bootstrap process is then repeated using h"i in lieu of h"i"-"1 until a sufficiently strong heuristic is produced. We test this method on the 24-sliding-tile puzzle, the 35-pancake puzzle, Rubik@?s Cube, and the 20-blocks world. In every case our method produces a heuristic that allows IDA^@? to solve randomly generated problem instances quickly with solutions close to optimal. The total time for the bootstrap process to create strong heuristics for these large state spaces is on the order of days. To make the process effective when only a single problem instance needs to be solved, we present a variation in which the bootstrap learning of new heuristics is interleaved with problem-solving using the initial heuristic and whatever heuristics have been learned so far. This substantially reduces the total time needed to solve a single instance, while the solutions obtained are still close to optimal.