A pattern decomposition algorithm for data mining of frequent patterns

  • Authors:
  • Qinghua Zou;Wesley Chu;David Johnson;Henry Chiu

  • Affiliations:
  • Department of Computer Science, University of California at Los Angeles, CA;Department of Computer Science, University of California at Los Angeles, CA;UCLA Telemedicine, CA;IBM Almaden, San Jose, CA

  • Venue:
  • Knowledge and Information Systems
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Efficient algorithms to mine frequent patterns are crucial to many tasks in data mining. Since the Apriori algorithm was proposed in 1994, there have been several methods proposed to improve its performance. However, most still adopt its candidate set generation-and-test approach. In addition, many methods do not generate all frequent patterns, making them inadequate to derive association rules. We propose a pattern decomposition (PD) algorithm that can significantly reduce the size of the dataset on each pass, making it more efficient to mine all frequent patterns in a large dataset. The proposed algorithm avoids the costly process of candidate set generation and saves time by reducing the size of the dataset. Our empirical evaluation shows that the algorithm outperforms Apriori by one order of magnitude and is faster than FP-tree algorithm.