Speed-up Iterative Frequent Itemset Mining with Constraint Changes

Authors:
Gao Cong;Bing Liu
Affiliations:
-;-
Venue:
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Year:
2002

Citing 0
Cited 9

Sequential Association Rule Mining with Time Lags

Journal of Intelligent Information Systems
Go Green: Recycle and Reuse Frequent Patterns

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
CFP-tree: A compact disk-based structure for storing and querying frequent itemsets

Information Systems
Empirical likelihood confidence intervals for differences between two datasets with missing data

Pattern Recognition Letters
GAM: a guidance enabled association mining environment

International Journal of Business Intelligence and Data Mining
A data mining proxy approach for efficient frequent itemset mining

The VLDB Journal — The International Journal on Very Large Data Bases
Estimating confidence intervals for structural differences between contrast groups with missing data

Expert Systems with Applications: An International Journal
Difference detection between two contrast sets

DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
Mining frequent itemsets with dualistic constraints

PRICAI'12 Proceedings of the 12th Pacific Rim international conference on Trends in Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Mining of frequent itemsets is a fundamental datamining task. Past research has proposed many efficientalgorithms for the purpose. Recent work also highlightedthe importance of using constraints to focus the miningprocess to mine only those relevant itemsets. In practice,data mining is often an interactive and iterative process.The user typically changes constraints and runs the miningalgorithm many times before satisfied with the finalresults. This interactive process is very time consuming.Existing mining algorithms are unable to take advantageof this iterative process to use previous mining results tospeed up the current mining process. This results inenormous waste in time and in computation. In this paper,we propose an efficient technique to utilize previousmining results to improve the efficiency of current miningwhen constraints are changed. We first introduce theconcept of tree boundary to summarize the usefulinformation available from previous mining. We then showthat the tree boundary provides an effective and efficientframework for the new mining. The proposed techniquehas been implemented in the contexts of two existingfrequent itemset mining algorithms, FP-tree and TreeProjection. Experiment results on both synthetic and real-lifedatasets show that the proposed approach achievesdramatic saving in computation.