A condensed representation to find frequent patterns

  • Authors:
  • Artur Bykowski;Christophe Rigotti

  • Affiliations:
  • Laboratoire d'Ingénierie des Systèmes d'Information, INSA Lyon, Bâtiment 501, F-69621 Villeurbanne Cedex, France;Laboratoire d'Ingénierie des Systèmes d'Information, INSA Lyon, Bâtiment 501, F-69621 Villeurbanne Cedex, France

  • Venue:
  • PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Given a large set of data, a common data mining problem is to extract the frequent patterns occurring in this set. The idea presented in this paper is to extract a condensed representation of the frequent patterns called disjunction-free sets, instead of extracting the whole frequent pattern collection. We show that this condensed representation can be used to regenerate all frequent patterns and their exact frequencies. Moreover, this regeneration can be performed without any access to the original data. Practical experiments show that this representation can be extracted very efficiently even in difficult cases. We compared it with another representation of frequent patterns previously investigated in the literature called frequent closed sets. In nearly all experiments we have run, the disjunction-free sets have been extracted much more efficiently than frequent closed sets.