An efficient algorithm for computing range-groupby queries

Authors:
Young-Koo Lee;Woong-Kee Loh;Yang-Sae Moon;Kyu-Young Whang;Il-Yeol Song
Affiliations:
Department of Computer Science &, Advanced Information Technology Research Center (AITrc), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea;Department of Computer Science &, Advanced Information Technology Research Center (AITrc), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea;Department of Computer Science, Kangwon National University, Chunchon, Kangwon, Korea;Department of Computer Science &, Advanced Information Technology Research Center (AITrc), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea;College of Information Science and Technology, Drexel University, Philadelphia, Pennsylvania
Venue:
DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Year:
2006

Citing 12
Cited 0

An overview of data warehousing and OLAP technology

ACM SIGMOD Record
Range queries in OLAP data cubes

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Approximating multi-dimensional aggregate range queries over real attributes

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Optimal and approximate computation of summary statistics for range aggregates

PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Progressive approximate aggregate queries with a multi-resolution tree structure

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
The Dynamic Data Cube

EDBT '00 Proceedings of the 7th International Conference on Extending Database Technology: Advances in Database Technology
ProPolyne: A Fast Wavelet-Based Algorithm for Progressive Evaluation of Polynomial Range-Sum Queries

EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
Modeling Multidimensional Databases

ICDE '97 Proceedings of the Thirteenth International Conference on Data Engineering
Hierarchical Prefix Cubes for Range-Sum Queries

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Relative Prefix Sums: An Efficient Approach for Querying Dynamic OLAP Data Cubes

ICDE '99 Proceedings of the 15th International Conference on Data Engineering
Incremental computation and maintenance of temporal aggregates

The VLDB Journal — The International Journal on Very Large Data Bases
Approximate Temporal Aggregation

ICDE '04 Proceedings of the 20th International Conference on Data Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Aggregation queries for arbitrary regions in an n-dimensional space are powerful tools for data analysis in OLAP. A GROUP BY query in OLAP is very important since it allows us to summarize various trends along with any combination of dimensions. In this paper, we extend the previous aggregation queries by including the GROUP BY clause for arbitrary regions. We call the extension range-groupby queries and present an efficient algorithm for processing them. A typical method of achieving fast response time for aggregation queries is using the prefix-sum array, which stores precomputed partial aggregation values. A naive method for range-groupby queries maintains a prefix-sum array for each combination of the grouping dimensions in an n-dimensional cube, which incurs enormous storage overhead. Our algorithm maintains only one prefix-sum array and still effectively processes range-groupby queries for all possible combinations of multiple grouping dimensions. Compared with the naive method, our algorithm reduces the space overhead by $O(\frac{1}{2^n})$, while accessing almost the identical number of cells.