Exploratory mining in cube space

Authors:
Raghu Ramakrishnan;Bee-Chung Chen
Affiliations:
Department of Computer Science, University of Wisconsion, Mdison, USA 53706;Department of Computer Science, University of Wisconsion, Mdison, USA 53706
Venue:
Data Mining and Knowledge Discovery
Year:
2007

Citing 0
Cited 4

Bellwether analysis: Searching for cost-effective query-defined predictors in large databases

ACM Transactions on Knowledge Discovery from Data (TKDD)
Adversarial-knowledge dimensions in data privacy

The VLDB Journal — The International Journal on Very Large Data Bases
Region-based online promotion analysis

Proceedings of the 13th International Conference on Extending Database Technology
Toward boosting distributed association rule mining by data de-clustering

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data Mining has evolved as a new discipline at the intersection of several existing areas, including Database Systems, Machine Learning, Optimization, and Statistics. An important question is whether the field has matured to the point where it has originated substantial new problems and techniques that distinguish it from its parent disciplines. In this paper, we discuss a class of new problems and techniques that show great promise for exploratory mining, while synthesizing and generalizing ideas from the parent disciplines. While the class of problems we discuss is broad, there is a common underlying objective--to look beyond a single data-mining step (e.g., data summarization or model construction) and address the combined process of data selection and transformation, parameter and algorithm selection, and model construction. The fundamental difficulty lies in the large space of alternative choices at each step, and good solutions must provide a natural framework for managing this complexity. We regard this as a grand challenge for Data Mining, and see the ideas discussed here as promising initial steps towards a rigorous exploratory framework that supports the entire process.