Mining approximate motifs in time series

  • Authors:
  • Pedro G. Ferreira;Paulo J. Azevedo;Cândida G. Silva;Rui M. M. Brito

  • Affiliations:
  • Department of Informatics, University of Minho, Braga, Portugal;Department of Informatics, University of Minho, Braga, Portugal;Chemistry Department, Faculty of Sciences and Technology, and Centre of Neurosciences of Coimbra, University of Coimbra, Coimbra, Portugal;Chemistry Department, Faculty of Sciences and Technology, and Centre of Neurosciences of Coimbra, University of Coimbra, Coimbra, Portugal

  • Venue:
  • DS'06 Proceedings of the 9th international conference on Discovery Science
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The problem of discovering previously unknown frequent patterns in time series, also called motifs, has been recently introduced. A motif is a subseries pattern that appears a significant number of times. Results demonstrate that motifs may provide valuable insights about the data and have a wide range of applications in data mining tasks. The main motivation for this study was the need to mine time series data from protein folding/unfolding simulations. We propose an algorithm that extracts approximate motifs, i.e. motifs that capture portions of time series with a similar and eventually symmetric behavior. Preliminary results on the analysis of protein unfolding data support this proposal as a valuable tool. Additional experiments demonstrate that the application of utility of our algorithm is not limited to this particular problem. Rather it can be an interesting tool to be applied in many real world problems.