Learning subgoal sequences for planning

Authors:
David Ruby;Dennis Kibler
Affiliations:
Information & Computer Science, University of California, Irvine, Irvine, CA;Information & Computer Science, University of California, Irvine, Irvine, CA
Venue:
IJCAI'89 Proceedings of the 11th international joint conference on Artificial intelligence - Volume 1
Year:
1989

Citing 4
Cited 2

Depth-first iterative-deepening: an optimal admissible tree search

Artificial Intelligence
Chunking in Soar: The Anatomy of a General Learning Mechanism

Machine Learning
Experimental Goal Regression: A Method for Learning Problem-Solving Heuristics

Machine Learning
Learning effective search control knowledge: an explanation-based approach

Learning effective search control knowledge: an explanation-based approach

SteppingStone: an empirical and analytical evaluation

AAAI'91 Proceedings of the ninth National conference on Artificial intelligence - Volume 2
Linear-space best-first search: summary of results

AAAI'92 Proceedings of the tenth national conference on Artificial intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

A learning problem solver consists of three components: (1) a problem solver, (2) a memory of problem-solving knowledge, and (3) a learning component for deriving new problemsolving knowledge from experience. Previous learning problem solvers have acquired knowledge as macros, control knowledge, or cases. SteppingStone is a general learning problem solver that improves its performance by learning subgoal sequences. The underlying problem solver for SteppingStone is a combination of means-ends analysis and brute-force search. It depends primarily upon means-ends analysis and its problem-solving knowledge to solve the subgoals of the problem. Learning occurs only when this approach fails. Upon failure, search is applied and a new subgoal sequence is derived and added to its problem-solving knowledge. Before SteppingStone attempts to solve any of the problem subgoals, it first orders them with a domain independent heuristic which we call openness. Openness is used to order the subgoals to minimize interactions. Stepping-Stone's ability to improve its performance and scale to difficult problems is demonstrated with an implemented system. We show that a small memory of appropriate subgoals yields multiple orders of magnitude savings in problem-solving cost.