Processing star queries on hierarchically-clustered fact tables

Authors:
Nikos Karayannidis;Aris Tsois;Timos Sellis;Roland Pieringer;Volker Markl;Frank Ramsak;Robert Fenk;Klaus Elhardt;Rudolf Bayer
Affiliations:
Institute of Communication and Computer Systems and Department of Electrical and Computer Engineering, National Technical University of Athens, Zographou, Athens, Hellas;Institute of Communication and Computer Systems and Department of Electrical and Computer Engineering, National Technical University of Athens, Zographou, Athens, Hellas;Institute of Communication and Computer Systems and Department of Electrical and Computer Engineering, National Technical University of Athens, Zographou, Athens, Hellas;TransAction Software GmbH Gustav-Heinemann-Ring, München, Germany;IBM Almaden Research Center, San Jose, CA;Bayerisches Forschungszentrum für Wissensbasierte Systeme, Orleansstrá, München, Germany;Bayerisches Forschungszentrum für Wissensbasierte Systeme, Orleansstrá, München, Germany;TransAction Software GmbH Gustav-Heinemann-Ring, München, Germany;Institut förmatik, TU-München, Orleansstraße, München, Germany
Venue:
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Year:
2002

Citing 25
Cited 14

The design and analysis of spatial data structures

The design and analysis of spatial data structures
Multi-table joins through bitmapped join indices

ACM SIGMOD Record
An overview of data warehousing and OLAP technology

ACM SIGMOD Record
Improved query performance with variant indexes

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Data warehousing and OLAP for decision support

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Materialized views and data warehouses

ACM SIGMOD Record
An alternative storage organization for ROLAP aggregate views based on cubetrees

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Caching multidimensional queries using chunks

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Bitmap index design and evaluation

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Multidimensional access methods

ACM Computing Surveys (CSUR)
Query optimization for selections using bitmaps

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
The Grid File: An Adaptable, Symmetric Multikey File Structure

ACM Transactions on Database Systems (TODS)
Orthogonal optimization of subqueries and aggregation

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
A performance comparison of bitmap indexes

Proceedings of the tenth international conference on Information and knowledge management
Encoded Bitmap Indexing for Data Warehouses

ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
Performing Group-By before Join

Proceedings of the Tenth International Conference on Data Engineering
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Total

ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Back to the Future: Dynamic Hierarchical Clustering

ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
Including Group-By in Query Optimization

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Query Optimization by Predicate Move-Around

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Aggregate-Query Processing in Data Warehousing Environments

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Eager Aggregation and Lazy Aggregation

VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Answering Queries with Aggregation Using Views

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
The Universal B-Tree for Multidimensional Indexing: general Concepts

WWCA '97 Proceedings of the International Conference on Worldwide Computing and Its Applications
Improving OLAP Performance by Multidimensional Hierarchical Clustering

IDEAS '99 Proceedings of the 1999 International Symposium on Database Engineering & Applications

SISYPHUS: the implementation of a chunk-based storage manager for OLAP data cubes

Data & Knowledge Engineering - Special issue: Advances in OLAP
Processing OLAP queries in hierarchically clustered databases

Data & Knowledge Engineering - Special issue: Advances in OLAP
Exploiting hierarchical clustering in evaluating multidimensional aggregation queries

DOLAP '03 Proceedings of the 6th ACM international workshop on Data warehousing and OLAP
Hierarchies in a multidimensional model: from conceptual modeling to logical representation

Data & Knowledge Engineering - Special issue: WIDM 2004
Star join revisited: Performance internals for cluster architectures

Data & Knowledge Engineering
The generalized pre-grouping transformation: aggregate-query optimization in the presence of dependencies

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Hierarchical clustering for OLAP: the CUBE File approach

The VLDB Journal — The International Journal on Very Large Data Bases
Design of the ERATOSTHENES OLAP server

PCI'01 Proceedings of the 8th Panhellenic conference on Informatics
An improved OLAP join and aggregate algorithm based on dimension hierarchy

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
CORADD: correlation aware database designer for materialized views and indexes

Proceedings of the VLDB Endowment
Data warehouse design on the basis of Hierarchical Degenerate Snowflake (HDS)

International Journal of Business Intelligence and Data Mining
LinearDB: a relational approach to make data warehouse scale like MapReduce

DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications: Part II
Scatter-Gather-Merge: An efficient star-join query processing algorithm for data-parallel frameworks

Cluster Computing
Integrating Star and Snowflake Schemas in Data Warehouses

International Journal of Data Warehousing and Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

Star queries are the most prevalent kind of queries in data warehousing, OLAP and business intelligence applications. Thus, there is an imperative need for efficiently processing star queries. To this end, a new class of fact table organizations has emerged that exploits path-based surrogate keys in order to hierarchically cluster the fact table data of a star schema [DRSN98, MRB99, KS01]. In the context of these new organizations, star query processing changes radically. In this paper, we present a complete abstract processing plan that captures all the necessary steps in evaluating such queries over hierarchically clustered fact tables. Furthermore, we present optimizations for surrogate key processing and a novel early grouping transformation for grouping on the dimension hierarchies. Our algorithms have been already implemented in a commercial relational database management system (RDBMS) and the experimental evaluation, as well as customer feedback, indicates speedups of orders of magnitude for typical star queries in real world applications.