From Path Tree To Frequent Patterns: A Framework for Mining Frequent Patterns

  • Authors:
  • Yabo Xu;Jeffrey Xu Yu;Guimei Liu;Hongjun Lu

  • Affiliations:
  • -;-;-;-

  • Venue:
  • ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose a new framework for miningfrequent patterns from large transactional databases. Thecore of the framework is of a novel coded prefix-path treewith two representations, namely, a memory-based prefix-pathtree and a disk-based prefix-path tree. The disk-basedprefix-path tree is simple in its data structure yet rich ininformation contained, and is small in size. The memory-basedprefix-path tree is simple and compact. Upon thememory-based prefix-path tree, a new depth-first frequentpattern discovery algorithm, called P P-Mine, is proposedin this paper that outperforms FP-growth significantly. Thememory-based prefix-path tree can be stored on disk usinga disk-based prefix-path tree with assistance of the new codingscheme. We present efficient loading algorithms to loadthe minimal required disk-based prefix-path tree into mainmemory. Our technique is to push constraints into the loadingprocess, which has not been well studied yet.