An array-based algorithm for simultaneous multidimensional aggregates

Authors:
Yihong Zhao;Prasad M. Deshpande;Jeffrey F. Naughton
Affiliations:
Computer Sciences Department, University of Wisconsin-Madison;Computer Sciences Department, University of Wisconsin-Madison;Computer Sciences Department, University of Wisconsin-Madison
Venue:
SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Year:
1997

Citing 4
Cited 152

OLAP, relational, and multidimensional database systems

ACM SIGMOD Record
Implementation techniques for main memory database systems

SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
Efficient Organization of Large Multidimensional Arrays

Proceedings of the Tenth International Conference on Data Engineering
On the Computation of Multidimensional Aggregates

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases

OLAP and statistical databases: similarities and differences

PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
A lower bound theorem for indexing schemes and its application to multidimensional range queries

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Dynamic assembly of views in data cubes

PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
An alternative storage organization for ROLAP aggregate views based on cubetrees

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Caching multidimensional queries using chunks

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Efficient support of parallel sparse computation for array intrinsic functions of Fortran 90

ICS '98 Proceedings of the 12th international conference on Supercomputing
Data cube approximation and histograms via wavelets

Proceedings of the seventh international conference on Information and knowledge management
Improving main memory utilization for array-based datacube computation

Proceedings of the 1st ACM international workshop on Data warehousing and OLAP
High performance multidimensional analysis of large datasets

Proceedings of the 1st ACM international workshop on Data warehousing and OLAP
Dynamic maintenance of multidimensional range data partitioning for parallel data processing

Proceedings of the 1st ACM international workshop on Data warehousing and OLAP
Approximate computation of multidimensional aggregates of sparse data using wavelets

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Bottom-up computation of sparse and Iceberg CUBE

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
DynaMat: a dynamic view management system for data warehouses

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Compressed data cubes for OLAP aggregate query approximation on continuous dimensions

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
A dynamic load balancing strategy for parallel datacube computation

Proceedings of the 2nd ACM international workshop on Data warehousing and OLAP
Requirement-based data cube schema design

Proceedings of the eighth international conference on Information and knowledge management
Using wavelet decomposition to support progressive and approximate range-sum queries over data cubes

Proceedings of the ninth international conference on Information and knowledge management
CubiST: a new algorithm for improving the performance of ad-hoc OLAP queries

Proceedings of the 3rd ACM international workshop on Data warehousing and OLAP
Efficient computation of Iceberg cubes with complex measures

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Iceberg-cube computation with PC clusters

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Proxy-server architectures for OLAP

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
A case for dynamic view management

ACM Transactions on Database Systems (TODS)
High performance multidimensional analysis and data mining

SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Loglinear-Based Quasi Cubes

Journal of Intelligent Information Systems
Efficient aggregation over objects with extent

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
An adaptive peer-to-peer network for distributed caching of OLAP results

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Dwarf: shrinking the PetaCube

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Parallelizing the Data Cube

Distributed and Parallel Databases - Special issue: Parallel and distributed data mining
Fully Dynamic Partitioning: Handling Data Skew in Parallel Data Cube Computation

Distributed and Parallel Databases
Object-Based Selective Materialization for Efficient Implementation of Spatial Data Cubes

IEEE Transactions on Knowledge and Data Engineering
Efficient Aggregation Algorithms for Compressed Data Warehouses

IEEE Transactions on Knowledge and Data Engineering
Coarse Grained Parallel On-Line Analytical Processing (OLAP) for Data Mining

ICCS '01 Proceedings of the International Conference on Computational Science-Part II
Parallelizing the Data Cube

ICDT '01 Proceedings of the 8th International Conference on Database Theory
Fast Computation of Sparse Datacubes

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
nD-SQL: A Multi-Dimensional Language for Interoperability and OLAP

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Materialized View Selection for Multidimensional Datasets

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Aggregation Algorithms for Very Large Compressed Data Warehouses

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Using Loglinear Models to Compress Datacube

WAIM '00 Proceedings of the First International Conference on Web-Age Information Management
Online Dynamic Reordering for Interactive Data Processing

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
What can Hierarchies do for Data Warehouses?

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Decision Tables: Scalable Classification Exploring RDBMS Capabilities

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Analysis of Accuracy of Data Reduction Techniques

DaWaK '99 Proceedings of the First International Conference on Data Warehousing and Knowledge Discovery
Elimination of Redundant Views in Multidimensional Aggregates

DaWaK 2000 Proceedings of the Second International Conference on Data Warehousing and Knowledge Discovery
Computing Partial Data Cubes for Parallel Data Warehousing Applications

Proceedings of the 8th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
DROLAP - A Dense-Region Based Approach to On-Line Analytical Processing

DEXA '99 Proceedings of the 10th International Conference on Database and Expert Systems Applications
Optimizing multiple dimensional queries simultaneously in multidimensional databases

The VLDB Journal — The International Journal on Very Large Data Bases
Online dynamic reordering

The VLDB Journal — The International Journal on Very Large Data Bases
Managing and analyzing massive data sets with data cubes

Handbook of massive data sets
Data warehousing

Handbook of massive data sets
Aggregate view management in data warehouses

Handbook of massive data sets
Implementing data cube construction using a cluster middleware: algorithms, implementation experience, and performance evaluation

Future Generation Computer Systems - Selected papers from CCGRID 2002
An aggregation algorithm using a multidimensional file in multidimensional OLAP

Information Sciences: an International Journal
DBMiner: a system for data mining in relational databases and data warehouses

CASCON '97 Proceedings of the 1997 conference of the Centre for Advanced Studies on Collaborative research
Optimizing Selections over Datacubes

SSDBM '00 Proceedings of the 12th International Conference on Scientific and Statistical Database Management
Serving Datacube Tuples from Main Memory

SSDBM '00 Proceedings of the 12th International Conference on Scientific and Statistical Database Management
QC-trees: an efficient summary structure for semantic OLAP

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Efficient OLAP operations for spatial data using peano trees

DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
Operators for multidimensional aggregate data

Multidimensional databases
Querying multidimensional data

Multidimensional databases
CubiST++: Evaluating Ad-Hoc CUBE Queries Using Statistics Trees

Distributed and Parallel Databases
Multidimensional data model and query language for informetrics

Journal of the American Society for Information Science and Technology
Attribute value reordering for efficient hybrid OLAP

DOLAP '03 Proceedings of the 6th ACM international workshop on Data warehousing and OLAP
Hierarchical dwarfs for the rollup cube

DOLAP '03 Proceedings of the 6th ACM international workshop on Data warehousing and OLAP
Parallel ROLAP Data Cube Construction on Shared-Nothing Multiprocessors

Distributed and Parallel Databases
A Wavelet Framework for Adapting Data Cube Views for OLAP

IEEE Transactions on Knowledge and Data Engineering
Range CUBE: Efficient Cube Computation by Exploiting Data Correlation

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Mining Constrained Gradients in Large Databases

IEEE Transactions on Knowledge and Data Engineering
Incremental maintenance of quotient cube for median

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Incremental maintenance of quotient cube based on Galois lattice

Journal of Computer Science and Technology
Array algorithms

Journal of Computing Sciences in Colleges
Divide-and-Approximate: A Novel Constraint Push Strategy for Iceberg Cube Mining

IEEE Transactions on Knowledge and Data Engineering
Multidimensional models: constructing data CUBE

CompSysTech '04 Proceedings of the 5th international conference on Computer systems and technologies
Compressing arrays by ordering attribute values

Information Processing Letters
Efficient computation of the skyline cube

VLDB '05 Proceedings of the 31st international conference on Very large data bases
MDL summarization with holes

VLDB '05 Proceedings of the 31st international conference on Very large data bases
C-store: a column-oriented DBMS

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Parallel querying of ROLAP cubes in the presence of hierarchies

Proceedings of the 8th ACM international workshop on Data warehousing and OLAP
Communication and Memory Optimal Parallel Data Cube Construction

IEEE Transactions on Parallel and Distributed Systems
Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams

Distributed and Parallel Databases
The cgmCUBE project: Optimizing parallel data cube generation for ROLAP

Distributed and Parallel Databases
An extendible multidimensional array system for MOLAP

Proceedings of the 2006 ACM symposium on Applied computing
Supporting ad-hoc ranking aggregates

Proceedings of the 2006 ACM SIGMOD international conference on Management of data
CURE for cubes: cubing using a ROLAP engine

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Regression Cubes with Lossless Compression and Aggregation

IEEE Transactions on Knowledge and Data Engineering
Towards multidimensional subspace skyline analysis

ACM Transactions on Database Systems (TODS)
Computing Iceberg Cubes by Top-Down and Bottom-Up Integration: The StarCubing Approach

IEEE Transactions on Knowledge and Data Engineering
Efficient Computation of Iceberg Cubes by Bounding Aggregate Functions

IEEE Transactions on Knowledge and Data Engineering
Answering ad hoc aggregate queries from data streams using prefix aggregate trees

Knowledge and Information Systems
Multi-dimensional regression analysis of time-series data streams

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Quotient cube: how to summarize the semantics of a data cube

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
A one-pass aggregation algorithm with the optimal buffer size in multidimensional OLAP

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
ROLAP implementations of the data cube

ACM Computing Surveys (CSUR)
Stasis: flexible transactional storage

OSDI '06 Proceedings of the 7th symposium on Operating systems design and implementation
COMBI-operator - database support for data mining applications

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Star-cubing: computing iceberg cubes by top-down and bottom-up integration

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
High-dimensional OLAP: a minimal cubing approach

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
The polynomial complexity of fully materialized coalesced cubes

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
GridDB: a data-centric overlay for scientific grids

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Optimal chunking of large multidimensional arrays for data warehousing

Proceedings of the ACM tenth international workshop on Data warehousing and OLAP
Efficient computation of view subsets

Proceedings of the ACM tenth international workshop on Data warehousing and OLAP
Mapgraph: efficient methods for complex olap hierarchies

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
PnP: sequential, external memory, and parallel iceberg cube computation

Distributed and Parallel Databases
History offset implementation scheme for large scale multidimensional data sets

Proceedings of the 2008 ACM symposium on Applied computing
ARCube: supporting ranking aggregate queries in partially materialized data cubes

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Supporting the data cube lifecycle: the power of ROLAP

The VLDB Journal — The International Journal on Very Large Data Bases
Online mining of fuzzy multidimensional weighted association rules

Applied Intelligence
A Summary Structure of Data Cube Preserving Semantics

RSEISP '07 Proceedings of the international conference on Rough Sets and Intelligent Systems Paradigms
A Probabilistic Approach for Computing Approximate Iceberg Cubes

DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
Computing data cubes without redundant aggregated nodes and single graph paths: the sequential MCG approach

SBBD '08 Proceedings of the 23rd Brazilian symposium on Databases
Efficient Storage and Querying of Horizontal Tables Using a PIVOT Operation in Commercial Relational DBMSs

IEICE - Transactions on Information and Systems
Computing data cubes using exact sub-graph matching: the sequential MCG approach

Proceedings of the 2009 ACM symposium on Applied Computing
The Multi-Tree Cubing algorithm for computing iceberg cubes

Journal of Intelligent Information Systems
Efficient Online Aggregates in Dense-Region-Based Data Cube Representations

DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
BitCube: A Bottom-Up Cubing Engineering

DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Compressing multidimensional structures: a case study

ECC'09 Proceedings of the 3rd international conference on European computing conference
Parallel OLAP with the Sidera server

Future Generation Computer Systems
Strategies for complex data cube queries

Applied Intelligence
Graph OLAP: a multi-dimensional framework for graph data analysis

Knowledge and Information Systems
Compressing arrays by ordering attribute values

Information Processing Letters
Knowledge grid support for treatment of traumatic brain injury victims

ICCSA'03 Proceedings of the 2003 international conference on Computational science and its applications: PartI
Integrating fuzziness with OLAP association rules mining

MLDM'03 Proceedings of the 3rd international conference on Machine learning and data mining in pattern recognition
Revisiting the cube lifecycle in the presence of hierarchies

The VLDB Journal — The International Journal on Very Large Data Bases
An efficient implementation for MOLAP basic data structure and its evaluation

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Sidera: a cluster-based server for online analytical processing

OTM'07 Proceedings of the 2007 OTM confederated international conference on On the move to meaningful internet systems: CoopIS, DOA, ODBASE, GADA, and IS - Volume Part II
What-if analysis in MOLAP environments

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 2
An incremental maintenance scheme of data cubes

DASFAA'08 Proceedings of the 13th international conference on Database systems for advanced applications
Linear programming approach for performance-driven data aggregation in networks of embedded sensors

Proceedings of the Conference on Design, Automation and Test in Europe
Visual cube and on-line analytical processing of images

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Massive structured data management solution

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Multidimensional arrays for warehousing data on clouds

Globe'10 Proceedings of the Third international conference on Data management in grid and peer-to-peer systems
Double table switch: an efficient partitioning algorithm for bottom-up computation of data cubes

ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
Extracting semantics in OLAP databases using emerging cubes

Information Sciences: an International Journal
Multidimensional cyclic graph approach: Representing a data cube without common sub-graphs

Information Sciences: an International Journal
Efficient online aggregates in dense-region-based data cube representations

Transactions on large-scale data- and knowledge-centered systems II
Efficient online aggregates in dense-region-based data cube representations

Transactions on large-scale data- and knowledge-centered systems II
Implementing vertical splitting for large scale multidimensional datasets and its evaluations

DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
EaCRS: an extendible array based compression scheme for high dimensional data

Proceedings of the Second Symposium on Information and Communication Technology
Parallel data cubes on multi-core processors with multiple disks

Proceedings of the 2011 Conference of the Center for Advanced Studies on Collaborative Research
Task scheduling for GPU accelerated OLAP systems

Proceedings of the 2011 Conference of the Center for Advanced Studies on Collaborative Research
Computing iceberg quotient cubes with bounding

DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
Efficient computation of multi-feature data cubes

KSEM'06 Proceedings of the First international conference on Knowledge Science, Engineering and Management
PMC: select materialized cells in data cubes

DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
An extendible array based implementation of relational tables for multi dimensional databases

DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
HQC: an efficient method for ROLAP with hierarchical dimensions

RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part II
An efficient indexing technique for computing high dimensional data cubes

WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Exploiting temporal correlation in temporal data warehouses

DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Attribute value reordering for efficient hybrid OLAP

Information Sciences: an International Journal
An OLAM-based framework for complex knowledge pattern discovery in distributed-and-heterogeneous-data-sources and cooperative information systems

DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery
Multi-level relationship outlier detection

International Journal of Business Intelligence and Data Mining
Normalised LCS-based method for indexing multidimensional data cube

International Journal of Intelligent Information and Database Systems
Permuting data on random-access block storage

Proceedings of the VLDB Endowment
An Adaptive Hybrid OLAP Architecture with optimized memory access patterns

Cluster Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Computing multiple related group-bys and aggregates is one of the core operations of On-Line Analytical Processing (OLAP) applications. Recently, Gray et al. [GBLP95] proposed the “Cube” operator, which computes group-by aggregations over all possible subsets of the specified dimensions. The rapid acceptance of the importance of this operator has led to a variant of the Cube being proposed for the SQL standard. Several efficient algorithms for Relational OLAP (ROLAP) have been developed to compute the Cube. However, to our knowledge there is nothing in the literature on how to compute the Cube for Multidimensional OLAP (MOLAP) systems, which store their data in sparse arrays rather than in tables. In this paper, we present a MOLAP algorithm to compute the Cube, and compare it to a leading ROLAP algorithm. The comparison between the two is interesting, since although they are computing the same function, one is value-based (the ROLAP algorithm) whereas the other is position-based (the MOLAP algorithm). Our tests show that, given appropriate compression techniques, the MOLAP algorithm is significantly faster than the ROLAP algorithm. In fact, the difference is so pronounced that this MOLAP algorithm may be useful for ROLAP systems as well as MOLAP systems, since in many cases, instead of cubing a table directly, it is faster to first convert the table to an array, cube the array, then convert the result back to a table.