Assisting Data Mining through Automated Planning

Authors:
Fernando Fernández;Daniel Borrajo;Susana Fernández;David Manzano
Affiliations:
Universidad Carlos III de Madrid, Leganés, Spain;Universidad Carlos III de Madrid, Leganés, Spain;Universidad Carlos III de Madrid, Leganés, Spain;Ericsson España, Madrid, Spain
Venue:
MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Year:
2009

Citing 10
Cited 0

Automating Image Processing for Scientific Data Analysis of a Large Image Database

IEEE Transactions on Pattern Analysis and Machine Intelligence
Evaluation of a semi-autonomous assistant for exploratory data analysis

AGENTS '97 Proceedings of the first international conference on Autonomous agents
Data mining: practical machine learning tools and techniques with Java implementations

Data mining: practical machine learning tools and techniques with Java implementations
Learning-assisted automated planning: looking back, taking stock, going forward

AI Magazine
Toward Intelligent Assistance for a Data Mining Process: An Ontology-Based Approach for Cost-Sensitive Classification

IEEE Transactions on Knowledge and Data Engineering
IPSS: A Hybrid Approach to Planning and Scheduling Integration

IEEE Transactions on Knowledge and Data Engineering
PLTOOL: A knowledge engineering tool for planning and learning

The Knowledge Engineering Review
Using Cases Utility for Heuristic Planning Improvement

ICCBR '07 Proceedings of the 7th international conference on Case-Based Reasoning: Case-Based Reasoning Research and Development
Planning through stochastic local search and temporal action graphs in LPG

Journal of Artificial Intelligence Research
The metric-FF planning system: translating "Ignoring delete lists" to numeric state variables

Journal of Artificial Intelligence Research

Quantified Score

Hi-index	0.02

Visualization

Abstract

The induction of knowledge from a data set relies in the execution of multiple data mining actions: to apply filters to clean and select the data, to train different algorithms (clustering, classification, regression, association), to evaluate the results using different approaches (cross validation, statistical analysis), to visualize the results, etc. In a real data mining process, previous actions are executed several times, sometimes in a loop, until an accurate result is obtained. However, performing previous tasks requires a data mining engineer or expert which supervises the design and evaluate the whole process. The goal of this paper is to describe MOLE, an architecture to automatize the data mining process. The architecture assumes that the data mining process can be seen from a classical planning perspective, and hence, that classical planning tools can be used to design the process. MOLE is built and instantiated on the basis of i) standard languages to describe the data set and the data mining process; ii) available tools to design, execute and evaluate the data mining processes.