Transferring learned control-knowledge between planners

Authors:
Susana Fernández;Ricardo Aler;Daniel Borrajo
Affiliations:
Universidad Carlos III de Madrid, Madrid, Spain;Universidad Carlos III de Madrid, Madrid, Spain;Universidad Carlos III de Madrid, Madrid, Spain
Venue:
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Year:
2007

Citing 7
Cited 1

Lazy Incremental Learning of Control Knowledge for EfficientlyObtaining Quality Plans

Artificial Intelligence Review - Special issue on lazy learning
Explanation-Based Generalization: A Unifying View

Machine Learning
Explanation-Based Learning: An Alternative View

Machine Learning
Temporal Planning with Mutual Exclusion Reasoning

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Learning-assisted automated planning: looking back, taking stock, going forward

AI Magazine
Behavior transfer for value-function-based reinforcement learning

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Probabilistic policy reuse in a reinforcement learning agent

AAMAS '06 Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems

Combining Macro-operators with Control Knowledge

Inductive Logic Programming

Quantified Score

Hi-index	0.00

Visualization

Abstract

As any other problem solving task that employs search, AI Planning needs heuristics to efficiently guide the problem-space exploration. Machine learning (ML) provides several techniques for automatically acquiring those heuristics. Usually, a planner solves a problem, and a ML technique generates knowledge from the search episode in terms of complete plans (macro-operators or cases), or heuristics (also named control knowledge in planning). In this paper, we present a novel way of generating planning heuristics: we learn heuristics in one planner and transfer them to another planner. This approach is based on the fact that different planners employ different search bias. We want to extract knowledge from the search performed by one planner and use the learned knowledge on another planner that uses a different search bias. The goal is to improve the efficiency of the second planner by capturing regularities of the domain that it would not capture by itself due to its bias. We employ a deductive learning method (EBL) that is able to automatically acquire control knowledge by generating bounded explanations of the problem-solving episodes in a Graphplan-based planner. Then, we transform the learned knowledge so that it can be used by a bidirectional planner.