Time series shapelets: a new primitive for data mining

Authors:
Lexiang Ye;Eamonn Keogh
Affiliations:
University of California, Riverside, Riverside, CA, USA;University of California, Riverside, Riverside, CA, USA
Venue:
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2009

Citing 6
Cited 29

On Comparing Classifiers: Pitfalls toAvoid and a Recommended Approach

Data Mining and Knowledge Discovery
On the need for time series data mining benchmarks: a survey and empirical demonstration

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Probabilistic discovery of time series motifs

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Interval and dynamic time warping-based decision trees

Proceedings of the 2004 ACM symposium on Applied computing
Fast time series classification using numerosity reduction

ICML '06 Proceedings of the 23rd international conference on Machine learning
LB_Keogh supports exact indexing of shapes under rotation invariance with arbitrary representations and distance measures

VLDB '06 Proceedings of the 32nd international conference on Very large data bases

Distortion-free predictive streaming time-series matching

Information Sciences: an International Journal
Approximate variable-length time series motif discovery using grammar inference

Proceedings of the Tenth International Workshop on Multimedia Data Mining
Where are you heading, metric access methods?: a provocative survey

Proceedings of the Third International Conference on SImilarity Search and APplications
A brief survey on sequence classification

ACM SIGKDD Explorations Newsletter
NDPMine: efficiently mining discriminative numerical features for pattern-based classification

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Identifying predictive multi-dimensional time series motifs: an application to severe weather prediction

Data Mining and Knowledge Discovery
Time series shapelets: a novel technique that allows accurate, interpretable and fast classification

Data Mining and Knowledge Discovery
Dynamic time warping constraint learning for large margin nearest neighbor classification

Information Sciences: an International Journal
A case-study on learning from large-scale intracranial EEG data using multi-core machines and clusters

Proceedings of the Third Workshop on Large Scale Data Mining: Theory and Applications
Logical-shapelets: an expressive primitive for time series classification

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining significant time intervals for relationship detection

SSTD'11 Proceedings of the 12th international conference on Advances in spatial and temporal databases
Stess@Work: from measuring stress to its understanding, prediction and handling with personalized coaching

Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
Review: Plant species identification using digital morphometrics: A review

Expert Systems with Applications: An International Journal
Similarity measure based on piecewise linear approximation and derivative dynamic time warping for time series mining

Expert Systems with Applications: An International Journal
Searching and mining trillions of time series subsequences under dynamic time warping

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
A shapelet transform for time series classification

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Alternative quality measures for time series shapelets

IDEAL'12 Proceedings of the 13th international conference on Intelligent Data Engineering and Automated Learning
Time-series data mining

ACM Computing Surveys (CSUR)
Rotation-invariant similarity in time series using bag-of-patterns representation

Journal of Intelligent Information Systems
Classifying plant leaves from their margins using dynamic time warping

ACIVS'12 Proceedings of the 14th international conference on Advanced Concepts for Intelligent Vision Systems
Time series visualization based on shape features

Knowledge-Based Systems
A time series forest for classification and feature extraction

Information Sciences: an International Journal
Addressing Big Data Time Series: Mining Trillions of Time Series Subsequences Under Dynamic Time Warping

ACM Transactions on Knowledge Discovery from Data (TKDD) - Special Issue on ACM SIGKDD 2012
Early prediction on imbalanced multivariate time series

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Data mining a trillion time series subsequences under dynamic time warping

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Discovering common motifs in cursor movement data for improving web search

Proceedings of the 7th ACM international conference on Web search and data mining
Silhouette-based human action recognition using SAX-Shapes

The Visual Computer: International Journal of Computer Graphics
Enhancing understanding and improving prediction of severe weather through spatiotemporal relational learning

Machine Learning
Classification of time series by shapelet transformation

Data Mining and Knowledge Discovery

Quantified Score

Hi-index	0.01

Visualization

Abstract

Classification of time series has been attracting great interest over the past decade. Recent empirical evidence has strongly suggested that the simple nearest neighbor algorithm is very difficult to beat for most time series problems. While this may be considered good news, given the simplicity of implementing the nearest neighbor algorithm, there are some negative consequences of this. First, the nearest neighbor algorithm requires storing and searching the entire dataset, resulting in a time and space complexity that limits its applicability, especially on resource-limited sensors. Second, beyond mere classification accuracy, we often wish to gain some insight into the data. In this work we introduce a new time series primitive, time series shapelets, which addresses these limitations. Informally, shapelets are time series subsequences which are in some sense maximally representative of a class. As we shall show with extensive empirical evaluations in diverse domains, algorithms based on the time series shapelet primitives can be interpretable, more accurate and significantly faster than state-of-the-art classifiers.