Parsimonious temporal aggregation

Authors:
Juozas Gordevičius;Johann Gamper;Michael Böhlen
Affiliations:
Institute of Mathematics and Informatics, Vilnius University, Vilnius, Lithuania;Free University of Bozen-Bolzano, Bolzano, Italy;Department of Informatics, University of Zurich, Zurich, Switzerland
Venue:
The VLDB Journal — The International Journal on Very Large Data Bases
Year:
2012

Citing 28
Cited 0

A temporal relational model and a query language

Information Sciences: an International Journal
A Review and Empirical Evaluation of Feature Weighting Methods for aClass of Lazy Learning Algorithms

Artificial Intelligence Review - Special issue on lazy learning
Locally adaptive dimensionality reduction for indexing large time series databases

ACM Transactions on Database Systems (TODS)
Wavelets for Computer Graphics: A Primer, Part 1

IEEE Computer Graphics and Applications
Aggregates in the Temporal Query Language TQuel

IEEE Transactions on Knowledge and Data Engineering
Efficient Similarity Search In Sequence Databases

FODO '93 Proceedings of the 4th International Conference on Foundations of Data Organization and Algorithms
HierarchyScan: A Hierarchical Similarity Search Algorithm for Databases of Long Sequences

ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
When Is ''Nearest Neighbor'' Meaningful?

ICDT '99 Proceedings of the 7th International Conference on Database Theory
Optimal Histograms with Quality Guarantees

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Fast Time Sequence Indexing for Arbitrary Lp Norms

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Coalescing in Temporal Databases

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
A Simple Dimensionality Reduction Technique for Fast Similarity Search in Large Time Series Databases

PADKK '00 Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications
Computing Temporal Aggregates

ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
Efficient Algorithms for Large-Scale Temporal Aggregation

IEEE Transactions on Knowledge and Data Engineering
On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration

Data Mining and Knowledge Discovery
Incremental computation and maintenance of temporal aggregates

The VLDB Journal — The International Journal on Very Large Data Bases
Approximate Temporal Aggregation

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Online Amnesic Approximation of Streaming Time Series

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Indexing spatio-temporal trajectories with Chebyshev polynomials

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Spatiotemporal Aggregate Computation: A Survey

IEEE Transactions on Knowledge and Data Engineering
A time machine for text search

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Experiencing SAX: a novel symbolic representation of time series

Data Mining and Knowledge Discovery
Streaming Time Series Summarization Using User-Defined Amnesic Functions

IEEE Transactions on Knowledge and Data Engineering
iSAX: indexing and mining terabyte sized time series

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
A Greedy Approach Towards Parsimonious Temporal Aggregation

TIME '08 Proceedings of the 2008 15th International Symposium on Temporal Representation and Reasoning
Parsimonious temporal aggregation

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Online piece-wise linear approximation of numerical streams with precision guarantees

Proceedings of the VLDB Endowment
Multi-dimensional aggregation for temporal data

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

Temporal aggregation is an important operation in temporal databases, and different variants thereof have been proposed. In this paper, we introduce a novel temporal aggregation operator, termed parsimonious temporal aggregation (PTA), that overcomes major limitations of existing approaches. PTA takes the result of instant temporal aggregation (ITA) of size n, which might be up to twice as large as the argument relation, and merges similar tuples until a given error ( $${\epsilon}$$ ) or size (c) bound is reached. The new operator is data-adaptive and allows the user to control the trade-off between the result size and the error introduced by merging. For the precise evaluation of PTA queries, we propose two dynamic programming---based algorithms for size- and error-bounded queries, respectively, with a worst-case complexity that is quadratic in n. We present two optimizations that take advantage of temporal gaps and different aggregation groups and achieve a linear runtime in experiments with real-world data. For the quick computation of an approximate PTA answer, we propose an efficient greedy merging strategy with a precision that is upper bounded by O(log n). We present two algorithms that implement this strategy and begin to merge as ITA tuples are produced. They require O(n log (c + β)) time and O(c + β) space, where β is the size of a read-ahead buffer and is typically very small. An empirical evaluation on real-world and synthetic data shows that PTA considerably reduces the size of the aggregation result, yet introducing only small errors. The greedy algorithms are scalable for large data sets and introduce less error than other approximation techniques.