Compressed disjunction-free pattern representation versus essential pattern representation

  • Authors:
  • Marzena Kryszkiewicz

  • Affiliations:
  • Institute of Computer Science, Warsaw University of Technology, Warsaw, Poland

  • Venue:
  • IDEAL'09 Proceedings of the 10th international conference on Intelligent data engineering and automated learning
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The discovery of frequent patterns is one of the most important issues in the data mining area. A major difficulty concerning frequent patterns is huge amount of discovered patterns. The problem can be solved or at least significantly alleviated by applying concise representations of frequent patterns. A number of most concise representations use generalized disjunctive rules for reasoning about patterns. Recently, the representation based on essential patterns has been introduced, but was not confronted with the representations using generalized disjunctive rules. In this paper, we 1) prove that essential patterns with at least two elements can be defined equivalently in terms of generalized disjunctive rules of a particular subtype and that singleton patterns are essential if their supports do not equal 0, 2) identify the relationship between compressed disjunction-free patterns and essential ones, 3) propose new lossless representation E-CDFR of frequent patterns that is primarily based on compressed disjunction-free patterns and uses generalized disjunctive rules to reason about other patterns, 4) prove that the new representation is never less concise than the representation based on essential patterns.