Continually evaluating similarity-based pattern queries on a streaming time series

Authors:
Like Gao;X. Sean Wang
Affiliations:
George Mason University, Fairfax, VA;George Mason University, Fairfax, VA
Venue:
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Year:
2002

Citing 25
Cited 36

Discrete-time signal processing

Discrete-time signal processing
Continuous queries over append-only databases

SIGMOD '92 Proceedings of the 1992 ACM SIGMOD international conference on Management of data
Fast subsequence matching in time-series databases

SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Similarity-based queries for time series data

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
High-dimensional index structures database support for next decade's applications (tutorial)

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Fast time-series searching with scaling and shifting

PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Adaptive query processing for time-series data

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
NiagaraCQ: a scalable continuous query system for Internet databases

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Locally adaptive dimensionality reduction for indexing large time series databases

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
DFT/FFT and Convolution Algorithms: Theory and Implementation

DFT/FFT and Convolution Algorithms: Theory and Implementation
Continuous queries over data streams

ACM SIGMOD Record
Continual Queries for Internet Scale Event-Driven Information Delivery

IEEE Transactions on Knowledge and Data Engineering
Querying Time Series Data Based on Similarity

IEEE Transactions on Knowledge and Data Engineering
Efficient Similarity Search In Sequence Databases

FODO '93 Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms
Approximate Queries and Representations for Large Data Sequences

ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
The Tangram Stream Query Processing System

Proceedings of the Fifth International Conference on Data Engineering
Variable Length Queries for Time Series Data

Proceedings of the 17th International Conference on Data Engineering
Optimizations Enabled by Relational Data Model View to Querying Data Streams

IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Fast Similarity Search in the Presence of Noise, Scaling, and Translation in Time-Series Databases

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Differential evaluation of continual queries

ICDCS '96 Proceedings of the 16th International Conference on Distributed Computing Systems (ICDCS '96)
Landmarks: A New Model for Similarity-Based Pattern Querying in Time Series Databases

ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Design and Evaluation of Alternative Selection Placement Strategies in Optimizing Continuous Queries

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Fjording the Stream: An Architecture for Queries Over Streaming Sensor Data

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
A simple randomized algorithm for sequential prediction of ergodic time series

IEEE Transactions on Information Theory
Financial time series prediction using least squares support vector machines within the evidence framework

IEEE Transactions on Neural Networks

Evaluating continuous nearest neighbor queries for streaming time series via pre-fetching

Proceedings of the eleventh international conference on Information and knowledge management
A learning-based approach to estimate statistics of operators in continuous queries: a case study

DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Mining concept-drifting data streams using ensemble classifiers

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Detection of complex temporal patterns over data streams

Information Systems - Special issue: ADBIS 2002: Advances in databases and information systems
Online event-driven subsequence matching over financial data streams

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Systematic data selection to mine concept-drifting data streams

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Segmentation and recognition of multi-attribute motion sequences

Proceedings of the 12th annual ACM international conference on Multimedia
Subsequence matching on structured time series data

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Atomic Wedgie: Efficient Query Filtering for Streaming Times Series

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Online summarization of dynamic time series data

The VLDB Journal — The International Journal on Very Large Data Bases
Suppressing model overfitting in mining concept-drifting data streams

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Efficient query filtering for streaming time series with applications to semisupervised learning of time series classifiers

Knowledge and Information Systems
Adaptive similarity search in streaming time series with sliding windows

Data & Knowledge Engineering
Efficient moving average transform-based subsequence matching algorithms in time-series databases

Information Sciences: an International Journal
Approximate NN queries on streams with guaranteed error/performance bounds

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
StreamMiner: a classifier ensemble-based engine to mine concept-drifting data streams

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Similar sequence matching supporting variable-length and variable-tolerance continuous queries on time-series data stream

Information Sciences: an International Journal
Efficient Similarity Search over Future Stream Time Series

IEEE Transactions on Knowledge and Data Engineering
Querying time-series streams

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Mining sequential patterns across time sequences

New Generation Computing
Top-k/w publish/subscribe: finding k most relevant publications in sliding time window w

Proceedings of the second international conference on Distributed event-based systems
Identifying Similar Subsequences in Data Streams

DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
SNIF TOOL: sniffing for patterns in continuous streams

Proceedings of the 17th ACM conference on Information and knowledge management
Mining data streams with periodically changing distributions

Proceedings of the 18th ACM conference on Information and knowledge management
Matching stream patterns of various lengths and tolerances

Proceedings of the 18th ACM conference on Information and knowledge management
Distortion-free predictive streaming time-series matching

Information Sciences: an International Journal
An MBR-safe transform for high-dimensional MBRs in similar sequence matching

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
A review on time series data mining

Engineering Applications of Artificial Intelligence
A clustering algorithm for multiple data streams based on spectral component similarity

Information Sciences: an International Journal
Algorithm for the predictive hibernation of sensor systems

UCS'06 Proceedings of the Third international conference on Ubiquitous Computing Systems
TWStream: finding correlated data streams under time warping

APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Mining delay in streaming time series of industrial process

ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Monitoring abnormal patterns with complex semantics over ICU data streams

IWICPAS'06 Proceedings of the 2006 Advances in Machine Vision, Image Processing, and Pattern Analysis international conference on Intelligent Computing in Pattern Analysis/Synthesis
Mining images of material nanostructure data

ICDCIT'06 Proceedings of the Third international conference on Distributed Computing and Internet Technology
Similarity search in streaming time series based on MP_C dimensionality reduction method

ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part I
Local correlation detection with linearity enhancement in streaming data

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management

Quantified Score

Hi-index	0.00

Visualization

Abstract

In many applications, local or remote sensors send in streams of data, and the system needs to monitor the streams to discover relevant events/patterns and deliver instant reaction correspondingly. An important scenario is that the incoming stream is a continually appended time series, and the patterns are time series in a database. At each time when a new value arrives (called a time position), the system needs to find, from the database, the nearest or near neighbors of the incoming time series up to the time position. This paper attacks the problem by using Fast Fourier Transform (FFT) to efficiently find the cross correlations of time series, which yields, in a batch mode, the nearest and near neighbors of the incoming time series at many time positions. To take advantage of this batch processing in achieving fast response time, this paper uses prediction methods to predict future values. FFT is used to compute the cross correlations of the predicted series (with the values that have already arrived) and the database patterns, and to obtain predicted distances between the incoming time series at many future time positions and the database patterns. When the actual data value arrives, the prediction error together with the predicted distances is used to filter out patterns that are not possible to be the nearest or near neighbors, which provides fast responses. Experiments show that with reasonable prediction errors, the performance gain is significant.