Extensions for continuous pattern mining

Authors:
Marcin Gorawski;Pawel Jureczek
Affiliations:
Silesian University of Technology, Institute of Computer Science, Gliwice Poland and Wroclaw University of Technology, Institute of Computer Science, Wrocław, Poland;Silesian University of Technology, Institute of Computer Science, Gliwice Poland
Venue:
IDEAL'11 Proceedings of the 12th international conference on Intelligent data engineering and automated learning
Year:
2011

Citing 10
Cited 0

Mining frequent patterns without candidate generation

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
SPADE: an efficient algorithm for mining frequent sequences

Machine Learning
A Framework for Generating Network-Based Moving Objects

Geoinformatica
PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth

Proceedings of the 17th International Conference on Data Engineering
Efficiently Mining Maximal Frequent Itemsets

ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Mining Access Patterns Efficiently from Web Logs

PADKK '00 Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications
WUM - A Tool for WWW Ulitization Analysis

WebDB '98 Selected papers from the International Workshop on The World Wide Web and Databases
Sequential PAttern mining using a bitmap representation

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
BIDE: Efficient Mining of Frequent Closed Sequences

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Efficient mining and prediction of user behavior patterns in mobile web systems

Information and Software Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we present extensions for continuous pattern mining. Our previous continuous pattern mining algorithm mines the set of all frequent sequences satisfying the minSup condition. However, those sequences contain an explosive number of frequent subsequences, which makes the analysis and understanding of patterns very diæcult. In order to overcome these diæculties, we propose four new algorithms for mining maximal and closed continuous patterns. These algorithms return a superset of the result patterns and then a post-pruning algorithm is performed to eliminate redundant sequences. For each type of patterns (maximal or closed) two algorithms are presented (with and without some improvements). The key idea is to omit as many redundant sequences as possible during the exploration. The proposed algorithms allow one to reduce the size of the result set when input sequences have low uniqueness.