Fast subsequence matching in time-series databases
SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
An introduction to Kolmogorov complexity and its applications (2nd ed.)
An introduction to Kolmogorov complexity and its applications (2nd ed.)
Deformable Markov model templates for time-series pattern matching
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Mining the stock market (extended abstract): which measure is best?
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
A vector space model for automatic indexing
Communications of the ACM
Locally adaptive dimensionality reduction for indexing large time series databases
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Efficient Similarity Search In Sequence Databases
FODO '93 Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms
Pattern Extraction for Time Series Classification
PKDD '01 Proceedings of the 5th European Conference on Principles of Data Mining and Knowledge Discovery
Feature-based classification of time-series data
Information processing and technology
On the need for time series data mining benchmarks: a survey and empirical demonstration
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Learning to Recognize Time Series: Combining ARMA models with memory-based learning
CIRA '97 Proceedings of the 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation
Efficient Time Series Matching by Wavelets
ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Discovering Similar Multidimensional Trajectories
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Generalized feature extraction for structural pattern recognition in time-series data
Generalized feature extraction for structural pattern recognition in time-series data
Towards parameter-free data mining
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Semi-supervised time series classification
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Characteristic-Based Clustering for Time Series Data
Data Mining and Knowledge Discovery
SAXually Explicit Images: Finding Unusual Shapes
ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Finding the most unusual time series subsequence: algorithms and applications
Knowledge and Information Systems
Experiencing SAX: a novel symbolic representation of time series
Data Mining and Knowledge Discovery
Exact indexing of dynamic time warping
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
On the marriage of Lp-norms and edit distance
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Introduction to Information Retrieval
Introduction to Information Retrieval
Proceedings of the VLDB Endowment
Time Warp Edit Distance with Stiffness Adjustment for Time Series Matching
IEEE Transactions on Pattern Analysis and Machine Intelligence
Time series shapelets: a new primitive for data mining
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Finding Structural Similarity in Time Series Data Using Bag-of-Patterns Representation
SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
Approximate clustering of time series using compact model-based descriptions
DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Accelerating Dynamic Time Warping Subsequence Search with GPUs and FPGAs
ICDM '10 Proceedings of the 2010 IEEE International Conference on Data Mining
Logical-shapelets: an expressive primitive for time series classification
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Visual-interactive querying for multivariate research data repositories using bag-of-words
Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
Biomedical time series clustering based on non-negative sparse coding and probabilistic topic model
Computer Methods and Programs in Biomedicine
Hi-index | 0.00 |
For more than a decade, time series similarity search has been given a great deal of attention by data mining researchers. As a result, many time series representations and distance measures have been proposed. However, most existing work on time series similarity search relies on shape-based similarity matching. While some of the existing approaches work well for short time series data, they typically fail to produce satisfactory results when the sequence is long. For long sequences, it is more appropriate to consider the similarity based on the higher-level structures. In this work, we present a histogram-based representation for time series data, similar to the "bag of words" approach that is widely accepted by the text mining and information retrieval communities. We performed extensive experiments and show that our approach outperforms the leading existing methods in clustering, classification, and anomaly detection on dozens of real datasets. We further demonstrate that the representation allows rotation-invariant matching in shape datasets.