Mining Top.K Frequent Closed Patterns without Minimum Support

  • Authors:
  • Jiawei Han;Jianyong Wang;Ying Lu;Petre Tzvetkov

  • Affiliations:
  • -;-;-;-

  • Venue:
  • ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper, we propose a new mining task: mining top-kfrequent closed patterns of length no less than min_l, wherek is the desired number of frequent closed patterns to bemined, and min _l is the minimal length of each pattern.An efficient algorithm, called TFP, is developed for mining such patterns without minimum support. Two methods, closed_node_count and descendant_sum are proposedto effiectively raise support threshold and prune FP-tree bothduring and after the construction of FP-tree. During themining process, a novel top-down and bottom-up combinedFP-tree mining strategy is developed to speed-up support-raising and closed frequent pattern discovering. In addition,a fast hash-based closed pattern verification scheme has beenemployed to check efficiently if a potential closed pattern isreally closed.Our performance study shows that in most cases, TFPoutperforms CLOSET and CHARM, two efficient frequentclosed pattern mining algorithms, even when both are running with the best tuned min_support. Furthermore, themethod can be extended to generate association rules andto incorporate user-specified constraints. Thus we concludethat for frequent pattern mining, mining top-k frequent closedpatterns without min support is more preferable than thetraditional min_support-based mining.