Association Analysis with One Scan of Databases

  • Authors:
  • Hao Huang;Xindong Wu;Richard Relue

  • Affiliations:
  • -;-;-

  • Venue:
  • ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Mining frequent patterns with an FP-tree avoids costlycandidate generation and repeatedly occurrence frequencychecking against the support threshold. It thereforeachieves better performance and efficiency than Apriori-likealgorithms. However, the database still needs tobe scanned twice to get the FP-tree. This can be verytime-consuming when new data are added to an existingdatabase because two scans may be needed for not only thenew data but also the existing data. This paper presentsa new data structure P-tree, Pattern Tree, and a new technique,which can get the P-tree through only one scan of thedatabase and can obtain the corresponding FP-tree with aspecified support threshold. Updating a P-tree with newdata needs one scan of the new data only, and the existingdata do not need to be re-scanned.