Efficient temporal counting with bounded error

Authors:
Yufei Tao;Xiaokui Xiao
Affiliations:
Department of Computer Science and Engineering, Chinese University of Hong Kong, New Territories, Hong Kong;Department of Computer Science and Engineering, Chinese University of Hong Kong, New Territories, Hong Kong
Venue:
The VLDB Journal — The International Journal on Very Large Data Bases
Year:
2008

Citing 39
Cited 2

Functional approach to data structures and its use in multidimensional searching

SIAM Journal on Computing
The R*-tree: an efficient and robust access method for points and rectangles

SIGMOD '90 Proceedings of the 1990 ACM SIGMOD international conference on Management of data
Range queries in OLAP data cubes

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Fast discovery of association rules

Advances in knowledge discovery and data mining
Computational geometry: algorithms and applications

Computational geometry: algorithms and applications
Comparison of access methods for time-evolving data

ACM Computing Surveys (CSUR)
On temporal aggregate processing based on time points

Information Processing Letters
Privacy-preserving data mining

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Efficient computation of temporal aggregates with range predicates

PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Space-efficient online computation of quantile summaries

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Progressive approximate aggregate queries with a multi-resolution tree structure

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Temporal statement modifiers

ACM Transactions on Database Systems (TODS)
Efficient aggregation over objects with extent

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Efficient integration and aggregation of historical information

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
R-trees: a dynamic index structure for spatial searching

SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
An Efficient Multiversion Access Structure

IEEE Transactions on Knowledge and Data Engineering
Temporal Aggregation over Data Streams Using Multiple Granularities

EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
Aggregate Processing of Planar Points

EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
CRB-Tree: An Efficient Indexing Scheme for Range-Aggregate Queries

ICDT '03 Proceedings of the 9th International Conference on Database Theory
The MD-join: An Operator for Complex OLAP

Proceedings of the 17th International Conference on Data Engineering
Dynamic Update Cube for Range-sum Queries

Proceedings of the 27th International Conference on Very Large Data Bases
Efficient OLAP Operations in Spatial Data Warehouses

SSTD '01 Proceedings of the 7th International Symposium on Advances in Spatial and Temporal Databases
Computing Temporal Aggregates

ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
An asymptotically optimal multiversion B-tree

The VLDB Journal — The International Journal on Very Large Data Bases
Efficient Algorithms for Large-Scale Temporal Aggregation

IEEE Transactions on Knowledge and Data Engineering
Parallel Algorithms for Computing Temporal Aggregates

ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Relative Prefix Sums: An Efficient Approach for Querying Dynamic OLAP Data Cubes

ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Analyzing Range Queries on Spatial Data

ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Exploring Spatial Datasets with Histograms

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Incremental computation and maintenance of temporal aggregates

The VLDB Journal — The International Journal on Very Large Data Bases
Approximate Temporal Aggregation

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Continuously Maintaining Quantile Summaries of the Most Recent N Elements over a Data Stream

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Online maintenance of very large random samples

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
I/O-efficient dynamic planar point location

Computational Geometry: Theory and Applications
Spatiotemporal Aggregate Computation: A Survey

IEEE Transactions on Knowledge and Data Engineering
Approximate counts and quantiles over sliding windows

PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Optimizing spatial Min/Max aggregations

The VLDB Journal — The International Journal on Very Large Data Bases
Approximate Processing of Massive Continuous Quantile Queries over High-Speed Data Streams

IEEE Transactions on Knowledge and Data Engineering
Multi-dimensional aggregation for temporal data

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology

On computing temporal aggregates with range predicates

ACM Transactions on Database Systems (TODS)
Ranking large temporal data

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper studies aggregate search in transaction time databases. Specifically, each object in such a database can be modeled as a horizontal segment, whose y-projection is its search key, and its x-projection represents the period when the key was valid in history. Given a query timestamp q t and a key range $$\vec{q_k}$$ , a count-query retrieves the number of objects that are alive at q t , and their keys fall in $$\vec{q_k}$$ . We provide a method that accurately answers such queries, with error less than $$\frac{1}{\varepsilon} + \varepsilon \cdot N_{\rm alive}(q_t)$$ , where N alive(q t ) is the number of objects alive at time q t , and 驴 is any constant in (0, 1]. Denoting the disk page size as B, and n = N / B, our technique requires O(n) space, processes any query in O(log B n) time, and supports each update in O(log B n) amortized I/Os. As demonstrated by extensive experiments, the proposed solutions guarantee query results with extremely high precision (median relative error below 5%), while consuming only a fraction of the space occupied by the existing approaches that promise precise results.