The influence of global constraints on similarity measures for time-series databases

Authors:
Vladimir Kurbalija;Miloš Radovanović;Zoltan Geler;Mirjana Ivanović
Affiliations:
Department of Mathematics and Informatics, Faculty of Sciences, University of Novi Sad, Trg D. Obradovića 4, 21000 Novi Sad, Serbia;Department of Mathematics and Informatics, Faculty of Sciences, University of Novi Sad, Trg D. Obradovića 4, 21000 Novi Sad, Serbia;Faculty of Philosophy, University of Novi Sad, Dr Zorana inića 2, 21000 Novi Sad, Serbia;Department of Mathematics and Informatics, Faculty of Sciences, University of Novi Sad, Trg D. Obradovića 4, 21000 Novi Sad, Serbia
Venue:
Knowledge-Based Systems
Year:
2014

Citing 37
Cited 0

Fast subsequence matching in time-series databases

SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Locally adaptive dimensionality reduction for indexing large time series databases

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Efficient Similarity Search In Sequence Databases

FODO '93 Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms
Efficient Retrieval of Similar Time Sequences Under Time Warping

ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
Finding Similar Time Series

PKDD '97 Proceedings of the First European Symposium on Principles of Data Mining and Knowledge Discovery
On Similarity Queries for Time-Series Data: Constraint Specification and Implementation

CP '95 Proceedings of the First International Conference on Principles and Practice of Constraint Programming
Inference for the Generalization Error

Machine Learning
Efficient Time Series Matching by Wavelets

ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Visualizing Time-Series on Spirals

INFOVIS '01 Proceedings of the IEEE Symposium on Information Visualization 2001 (INFOVIS'01)
Discovering Similar Multidimensional Trajectories

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Visually mining and monitoring massive time series

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Exact indexing of dynamic time warping

Knowledge and Information Systems
A Multiresolution Symbolic Representation of Time Series

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Robust and fast similarity search for moving object trajectories

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Data Mining: Concepts and Techniques

Data Mining: Concepts and Techniques
Fast time series classification using numerosity reduction

ICML '06 Proceedings of the 23rd international conference on Machine learning
An efficient and accurate method for evaluating time series similarity

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Exact indexing of dynamic time warping

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
On the marriage of Lp-norms and edit distance

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Querying time-series streams

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Constructing High Dimensional Feature Space for Time Series Classification

PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
TraClass: trajectory classification using hierarchical region-based and trajectory-based clustering

Proceedings of the VLDB Endowment
Querying and mining of time series data: experimental comparison of representations and distance measures

Proceedings of the VLDB Endowment
Case-based curve behaviour prediction

Software—Practice & Experience
Time Warp Edit Distance with Stiffness Adjustment for Time Series Matching

IEEE Transactions on Pattern Analysis and Machine Intelligence
A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability

Soft Computing - A Fusion of Foundations, Methodologies and Applications
The WEKA data mining software: an update

ACM SIGKDD Explorations Newsletter
Clustering Trajectories of Moving Objects in an Uncertain World

ICDM '09 Proceedings of the 2009 Ninth IEEE International Conference on Data Mining
Anonymization of moving objects databases by clustering and perturbation

Information Systems
A framework for time-series analysis

AIMSA'10 Proceedings of the 14th international conference on Artificial intelligence: methodology, systems, and applications
Classification of pulse waveforms using edit distance with real penalty

EURASIP Journal on Advances in Signal Processing
Weighted dynamic time warping for time series classification

Pattern Recognition
Segment and combine approach for non-parametric time-series classification

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Intelligent system for time series classification using support vector machines applied to supply-chain

Expert Systems with Applications: An International Journal
Segment-Based Features for Time Series Classification

The Computer Journal
Large margin mixture of AR models for time series classification

Applied Soft Computing
Using derivatives in time series classification

Data Mining and Knowledge Discovery

Quantified Score

Hi-index	0.00

Visualization

Abstract

A time series consists of a series of values or events obtained over repeated measurements in time. Analysis of time series represents an important tool in many application areas, such as stock-market analysis, process and quality control, observation of natural phenomena, and medical diagnosis. A vital component in many types of time-series analyses is the choice of an appropriate distance/similarity measure. Numerous measures have been proposed to date, with the most successful ones based on dynamic programming. Being of quadratic time complexity, however, global constraints are often employed to limit the search space in the matrix during the dynamic programming procedure, in order to speed up computation. Furthermore, it has been reported that such constrained measures can also achieve better accuracy. In this paper, we investigate four representative time-series distance/similarity measures based on dynamic programming, namely Dynamic Time Warping (DTW), Longest Common Subsequence (LCS), Edit distance with Real Penalty (ERP) and Edit Distance on Real sequence (EDR), and the effects of global constraints on them when applied via the Sakoe-Chiba band. To better understand the influence of global constraints and provide deeper insight into their advantages and limitations we explore the change of the 1-nearest neighbor graph with respect to the change of the constraint size. Also, we examine how these changes reflect on the classes of the nearest neighbors of time series, and evaluate the performance of the 1-nearest neighbor classifier with respect to different distance measures and constraints. Since we determine that constraints introduce qualitative differences in all considered measures, and that different measures are affected by constraints in various ways, we expect our results to aid researchers and practitioners in selecting and tuning appropriate time-series similarity measures for their respective tasks.