An Efficient Algorithm for Mining Large Item Sets

  • Authors:
  • Hong-Zhen Zheng;Dian-Hui Chu;De-Chen Zhan;Xiao-Fei Xu

  • Affiliations:
  • -;-;-;-

  • Venue:
  • FSKD '08 Proceedings of the 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery - Volume 02
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

It propose Online Mining Algorithm ( OMA) which online discover large item sets. Without pre-setting a default threshold, the OMA algorithm achieves its efficiency and threshold-flexibility by calculating item-sets’ counts. It is unnecessary and independent of the default threshold and can flexibly adapt to any user’s input threshold. In addition, we propose Cluster-Based Association Rule Algorithm (CARA) creates cluster tables to aid discovery of large item sets. It only requires a single scan of the database, followed by contrasts with the partial cluster tables. It not only prunes considerable amounts of data reducing the time needed to perform data scans and requiring less contrast, but also ensures the correctness of the mined results. By using the CARA algorithm to create cluster tables in advance, each CPU can be utilized to process a cluster table; thus large item sets can be immediately mined even when the database is very large.