The hows, whys, and whens of constraints in itemset and rule discovery

Authors:
Roberto J. Bayardo
Affiliations:
IBM Almaden Research Center
Venue:
Proceedings of the 2004 European conference on Constraint-Based Mining and Inductive Databases
Year:
2004

Citing 18
Cited 10

Mining association rules between sets of items in large databases

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Dynamic itemset counting and implication rules for market basket data

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
The process of knowledge discovery in databases

Advances in knowledge discovery and data mining
Fast discovery of association rules

Advances in knowledge discovery and data mining
Exploratory mining and pruning optimizations of constrained associations rules

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Efficiently mining long patterns from databases

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Efficient mining of association rules using closed itemset lattices

Information Systems
Mining the most interesting rules

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Using a knowledge cache for interactive discovery of association rules

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Interestingness via what is not interesting

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Generating non-redundant association rules

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Is pushing constraints deeply into the mining algorithms really what we want?: an alternative approach for association rule mining

ACM SIGKDD Explorations Newsletter
Approximation of Frequency Queris by Means of Free-Sets

PKDD '00 Proceedings of the 4th European Conference on Principles of Data Mining and Knowledge Discovery
Using Condensed Representations for Interactive Association Rule Mining

PKDD '02 Proceedings of the 6th European Conference on Principles of Data Mining and Knowledge Discovery
DualMiner: a dual-pruning algorithm for itemsets with constraints

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Constraint-Based Rule Mining in Large, Dense Databases

ICDE '99 Proceedings of the 15th International Conference on Data Engineering
A New Algorithm for Generating Prime Implicants

IEEE Transactions on Computers
OPUS: an efficient admissible algorithm for unordered search

Journal of Artificial Intelligence Research

Combining linguistic and structural descriptors for mining biomedical literature

Proceedings of the 2006 ACM symposium on Document engineering
Soft constraint based pattern mining

Data & Knowledge Engineering
Mining constraint-based patterns using automatic relaxation

Intelligent Data Analysis
Exploring ant-based algorithms for gene expression data analysis

Artificial Intelligence in Medicine
Extending the soft constraint based mining paradigm

KDID'06 Proceedings of the 5th international conference on Knowledge discovery in inductive databases
Feature construction based on closedness properties is not that simple

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Using trees to mine multirelational databases

Data Mining and Knowledge Discovery
Mining frequent δ-free patterns in large databases

DS'05 Proceedings of the 8th international conference on Discovery Science
Exploiting virtual patterns for automatically pruning the search space

KDID'05 Proceedings of the 4th international conference on Knowledge Discovery in Inductive Databases
Pushing constraints into data streams

Proceedings of the 2nd International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many researchers in our community (this author included) regularly emphasize the role constraints play in improving performance of data-mining algorithms. This emphasis has led to remarkable progress – current algorithms allow an incredibly rich and varied set of hidden patterns to be efficiently elicited from massive datasets, even under the burden of NP-hard problem definitions and disk-resident or distributed data. But this progress has come at a cost. In our single-minded drive towards maximum performance, we have often neglected and in fact hindered the important role of discovery in the knowledge discovery and data-mining (KDD) process. In this paper, I propose various strategies for applying constraints within algorithms for itemset and rule mining in order to escape this pitfall.