Quantizing time series for efficient similarity search under time warping

  • Authors:
  • Inés F. Vega-López;Bongki Moon

  • Affiliations:
  • School of Informatics, Autonomous University of Sinaloa, Culiacán, Sinaloa, México;Department of Computer Science, University of Arizona, Tucson, AZ

  • Venue:
  • ACST'06 Proceedings of the 2nd IASTED international conference on Advances in computer science and technology
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Indexing Time Series Data is an interesting problem that has attracted much interest in the research community for the last decade. Traditional indexing methods organize the data space using different metrics. For time series, however, there are some cases when a metric is not suited for properly assessing the similarity between sequences. For instance, to detect similarities between sequences that are locally out of phase Dynamic Time Warping (DTW) must be used. DTW is not a metric as it does not satisfy the triangular inequality. Therefore, traditional spatial access methods cannot be used without introducing false dismissals. In such cases, alternative methods for organizing and searching time series data must be proposed. In this paper we propose the use of quantization to generate small and homogeneous representations of time series. We compute upper- and lower-bounds on the DTW distance to a query sequence using this quantized representation to filter-out sequences that cannot be a best match for the query. In the proposed approach, efficient search is achieved by organizing the quantized representation of data in a linear array that can be efficiently read from disk. The computational cost of processing the query is shadowed by the IO cost required to scan the file containing the linear array and it does affect the total query cost.