Upper Bound on the Length of Generalized Disjunction-Free Patterns

  • Authors:
  • Marzena Kryszkiewicz

  • Affiliations:
  • Warsaw University of Technology, Poland

  • Venue:
  • SSDBM '04 Proceedings of the 16th International Conference on Scientific and Statistical Database Management
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

A number of lossless representations of frequentpatterns were proposed in recent years. Therepresentation that consists of all frequent closeditemsets and the representations based on generalizeddisjunction-free patterns or on non-derivable itemsetsare proven the most concise ones. Experiments showfurther that the latter ones are by a few orders ofmagnitude more concise (and determinable) than theformer one. As follows from experiments, therepresentations based on generalized disjunction-freepatterns are also more concise than the available in theliterature representations of frequent patterns, whichdetermine supports of patterns in an approximate way. Inthis paper, we provide an upper bound on the length ofgeneralized disjunction-free patterns. The bounddetermines the maximum number of scans of thedatabase carried out by Apriori-like algorithmsdiscovering the representations based on generalizeddisjunction-free patterns.