Taxonomy-Driven Lumping for Sequence Mining

  • Authors:
  • Francesco Bonchi;Carlos Castillo;Debora Donato;Aristides Gionis

  • Affiliations:
  • Yahoo! Research, Barcelona, Spain 080018;Yahoo! Research, Barcelona, Spain 080018;Yahoo! Research, Barcelona, Spain 080018;Yahoo! Research, Barcelona, Spain 080018

  • Venue:
  • ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In many application domains, events are naturally organized in a hierarchy. Whether events describe human activities, system failures, coordinates in a trajectory, or biomedical phenomena, there is often a taxonomy that should be taken into consideration. A taxonomy allow us to represent the information at a more general description level, if we choose carefully the most suitable level of granularity. Given a taxonomy of events and a dataset of sequences of these events, we study the problem of finding efficient and effective ways to produce a compact representation of the sequences. This can be valuable by itself, or can be used to help solving other problems, such as clustering.