Time series symbolization and search for frequent patterns

  • Authors:
  • Mai Van Hoan;Matthieu Exbrayat

  • Affiliations:
  • University of Information and Communication Technology, Thai Nguyen, Viet Nam;University of Orleans, Orleans Cedex, France

  • Venue:
  • Proceedings of the Fourth Symposium on Information and Communication Technology
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we focus on two aspects of time series mining: first on the transformation of numerical data to symbolic data; then on the search for frequent patterns in the resulting symbolic time series. We are thus interested in some patterns which have a high frequency in our database of time series and might help to generate candidates for various tasks in the area of time series mining. During the symbolization phase, we transform the numerical time series into a symbolic time series by i) splitting this latter into consecutive subsequences, ii) using a clustering algorithm to cluster these subsequences, each subsequence being then replaced by the name of its cluster to produce the symbolic time series. In the second phase, we use a sliding window to create a collection of transactions from the symbolic time series, then we use some algorithm for mining sequential pattern to find out some interesting motifs in the original time series. An example experiment based on environmental data is presented.