Condensed Representation of Sequential Patterns According to Frequency-Based Measures

  • Authors:
  • Marc Plantevit;Bruno Crémilleux

  • Affiliations:
  • GREYC-CNRS-UMR 6072, Université de Caen Basse-Normandie, Caen Cedex, France 14032;GREYC-CNRS-UMR 6072, Université de Caen Basse-Normandie, Caen Cedex, France 14032

  • Venue:
  • IDA '09 Proceedings of the 8th International Symposium on Intelligent Data Analysis: Advances in Intelligent Data Analysis VIII
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Condensed representations of patterns are at the core of many data mining works and there are a lot of contributions handling data described by items. In this paper, we tackle sequential data and we define an exact condensed representation for sequential patterns according to the frequency-based measures. These measures are often used, typically in order to evaluate classification rules. Furthermore, we show how to infer the best patterns according to these measures, i.e., the patterns which maximize them. These patterns are immediately obtained from the condensed representation so that this approach is easily usable in practice. Experiments conducted on various datasets demonstrate the feasibility and the interest of our approach.