Pre-Processing Time Constraints for Efficiently Mining Generalized Sequential Patterns

  • Authors:
  • Florent Masseglia;Pascal Poncelet;Maguelonne Teisseire

  • Affiliations:
  • INRIA Sophia Antipolis;École des Mines dýAlès;LIRMM UMR CNRS

  • Venue:
  • TIME '04 Proceedings of the 11th International Symposium on Temporal Representation and Reasoning
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we consider the problem of discovering sequential patterns by handling time constraints. While sequential patterns could be seen as temporal relationships between facts embedded in the database, generalized sequential patterns aim at providing the end user with a more flexible handling of the transactions embedded in the database. We propose a new efficient algorithm, called GTC (Graph for Time Constraints) for mining such patterns in very large databases. It is based on the idea that handling time constraints in the earlier stage of the algorithm can be highly beneficial since it minimizes computational costs by preprocessing data sequences. Our test shows that the proposed algorithm performs significantly faster than a stateof- the-art sequence mining algorithm.