Extensions for continuous pattern mining

  • Authors:
  • Marcin Gorawski;Pawel Jureczek

  • Affiliations:
  • Silesian University of Technology, Institute of Computer Science, Gliwice Poland and Wroclaw University of Technology, Institute of Computer Science, Wrocław, Poland;Silesian University of Technology, Institute of Computer Science, Gliwice Poland

  • Venue:
  • IDEAL'11 Proceedings of the 12th international conference on Intelligent data engineering and automated learning
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present extensions for continuous pattern mining. Our previous continuous pattern mining algorithm mines the set of all frequent sequences satisfying the minSup condition. However, those sequences contain an explosive number of frequent subsequences, which makes the analysis and understanding of patterns very diæcult. In order to overcome these diæculties, we propose four new algorithms for mining maximal and closed continuous patterns. These algorithms return a superset of the result patterns and then a post-pruning algorithm is performed to eliminate redundant sequences. For each type of patterns (maximal or closed) two algorithms are presented (with and without some improvements). The key idea is to omit as many redundant sequences as possible during the exploration. The proposed algorithms allow one to reduce the size of the result set when input sequences have low uniqueness.