Pushing Convertible Constraints in Frequent Itemset Mining

  • Authors:
  • Jian Pei;Jiawei Han;Laks V. S. Lakshmanan

  • Affiliations:
  • University at Buffalo, The State University of New York, 201 Bell Hall, Buffalo, NY 14260-2000, USA. jianpei@cse.buffalo.edu;University of Illinois at Urbana-Champaign, 2123 DCL, 1304 West Springfield Avenue, Urbana, IL 61801, USA. hanj@cs.uiuc.edu;University of British Columbia, 201-2366 Main Mall, Vancouver, B.C. Canada V6T 1Z4. laks@cs.ubc.ca

  • Venue:
  • Data Mining and Knowledge Discovery
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recent work has highlighted the importance of the constraint-based mining paradigm in the context of frequent itemsets, associations, correlations, sequential patterns, and many other interesting patterns in large databases. Constraint pushing techniques have been developed for mining frequent patterns and associations with antimonotonic, monotonic, and succinct constraints. In this paper, we study constraints which cannot be handled with existing theory and techniques in frequent pattern mining. For example, avg(S)θv, median(S)θv, sum(S)θv (S can contain items of arbitrary values, θ ∈ {, v is a real number.) are customarily regarded as “tough” constraints in that they cannot be pushed inside an algorithm such as Apriori. We develop a notion of convertible constraints and systematically analyze, classify, and characterize this class. We also develop techniques which enable them to be readily pushed deep inside the recently developed FP-growth algorithm for frequent itemset mining. Results from our detailed experiments show the effectiveness of the techniques developed.