Adaptive similarity search in streaming time series with sliding windows

  • Authors:
  • Maria Kontaki;Apostolos N. Papadopoulos;Yannis Manolopoulos

  • Affiliations:
  • Data Engineering Research Laboratory, Department of Informatics, Aristotle University, 54124 Thessaloniki, Greece;Data Engineering Research Laboratory, Department of Informatics, Aristotle University, 54124 Thessaloniki, Greece;Data Engineering Research Laboratory, Department of Informatics, Aristotle University, 54124 Thessaloniki, Greece

  • Venue:
  • Data & Knowledge Engineering
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

The challenge in a database of evolving time series is to provide efficient algorithms and access methods for query processing, taking into consideration the fact that the database changes continuously as new data become available. Traditional access methods that continuously update the data are considered inappropriate, due to significant update costs. In this paper, we use the IDC-Index (Incremental DFT Computation - Index), an efficient technique for similarity query processing in streaming time series. The index is based on a multidimensional access method enhanced with a deferred update policy and an incremental computation of the Discrete Fourier Transform (DFT), which is used as a feature extraction method. We focus both on range and nearest-neighbor queries, since both types are frequently used in modern applications. An important characteristic of the proposed approach is its ability to adapt to the update frequency of the data streams. By using a simple heuristic approach, we manage to keep the update frequency at a specified level to guarantee efficiency. In order to investigate the efficiency of the proposed method, experiments have been performed for range queries and k-nearest-neighbor queries on real-life data sets. The proposed method manages to reduce the number of false alarms examined, achieving high answers vs. candidates ratio. Moreover, the results have shown that the new techniques exhibit consistently better performance in comparison to previously proposed approaches.