Quotient cube: how to summarize the semantics of a data cube

Authors:
Laks V. S. Lakshmanan;Jian Pei;Jiawei Han
Affiliations:
U. of British Columbia;Simon Fraser U.;U. of Illinois
Venue:
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Year:
2002

Citing 13
Cited 74

Implementing data cubes efficiently

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
An array-based algorithm for simultaneous multidimensional aggregates

SIGMOD '97 Proceedings of the 1997 ACM SIGMOD international conference on Management of data
Quasi-cubes: exploiting approximations in multidimensional databases

ACM SIGMOD Record
Partial-sum queries in OLAP data cubes using covering codes

PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Data cube approximation and histograms via wavelets

Proceedings of the seventh international conference on Information and knowledge management
Bottom-up computation of sparse and Iceberg CUBE

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Compressed data cubes for OLAP aggregate query approximation on continuous dimensions

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Dwarf: shrinking the PetaCube

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Total

ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Fast Computation of Sparse Datacubes

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Intelligent Rollups in Multidimensional OLAP Data

Proceedings of the 27th International Conference on Very Large Data Bases
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
On the Computation of Multidimensional Aggregates

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases

Database research at the University of Illinois at Urbana-Champaign

ACM SIGMOD Record
QC-trees: an efficient summary structure for semantic OLAP

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
SOCQET: semantic OLAP with compressed cube and summarization

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Extracting semantics from data cubes using cube transversals and closures

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Range CUBE: Efficient Cube Computation by Exploiting Data Correlation

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Incremental maintenance of quotient cube for median

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Incremental maintenance of quotient cube based on Galois lattice

Journal of Computer Science and Technology
PrefixCube: prefix-sharing condensed data cube

Proceedings of the 7th ACM international workshop on Data warehousing and OLAP
Multi-structural databases

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Catching the best views of skyline: a semantic approach based on decisive subspaces

VLDB '05 Proceedings of the 31st international conference on Very large data bases
MDL summarization with holes

VLDB '05 Proceedings of the 31st international conference on Very large data bases
General purpose database summarization

VLDB '05 Proceedings of the 31st international conference on Very large data bases
The cgmCUBE project: Optimizing parallel data cube generation for ROLAP

Distributed and Parallel Databases
Semi-closed cube: an effective approach to trading off data cube size and query response time

Journal of Computer Science and Technology
CURE for cubes: cubing using a ROLAP engine

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Towards multidimensional subspace skyline analysis

ACM Transactions on Database Systems (TODS)
Computing Iceberg Cubes by Top-Down and Bottom-Up Integration: The StarCubing Approach

IEEE Transactions on Knowledge and Data Engineering
Comprehensive data warehouse exploration with qualified association-rule mining

Decision Support Systems
Answering ad hoc aggregate queries from data streams using prefix aggregate trees

Knowledge and Information Systems
ROLAP implementations of the data cube

ACM Computing Surveys (CSUR)
Finding hierarchical heavy hitters in data streams

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Star-cubing: computing iceberg cubes by top-down and bottom-up integration

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Efficacious data cube exploration by semantic summarization and compression

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
High-dimensional OLAP: a minimal cubing approach

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
A probabilistic model for data cube compression and query approximation

Proceedings of the ACM tenth international workshop on Data warehousing and OLAP
PnP: sequential, external memory, and parallel iceberg cube computation

Distributed and Parallel Databases
Why go logarithmic if we can go linear?: Towards effective distinct counting of search traffic

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Mining multiple-level fuzzy blocks from multidimensional data

Fuzzy Sets and Systems
ARCube: supporting ranking aggregate queries in partially materialized data cubes

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Sampling cube: a framework for statistical olap over sampling data

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Supporting the data cube lifecycle: the power of ROLAP

The VLDB Journal — The International Journal on Very Large Data Bases
Hierarchical clustering for OLAP: the CUBE File approach

The VLDB Journal — The International Journal on Very Large Data Bases
A Summary Structure of Data Cube Preserving Semantics

RSEISP '07 Proceedings of the international conference on Rough Sets and Intelligent Systems Paradigms
A New Bitmap Index and a New Data Cube Compression Technology

ICCSA '08 Proceedings of the international conference on Computational Science and Its Applications, Part II
Approximate Range-Sum Queries over Data Cubes Using Cosine Transform

DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
FCLOS: A client-server architecture for mobile OLAP

Data & Knowledge Engineering
Computing data cubes using exact sub-graph matching: the sequential MCG approach

Proceedings of the 2009 ACM symposium on Applied Computing
Emerging Cubes: Borders, size estimations and lossless reductions

Information Systems
What Can Formal Concept Analysis Do for Data Warehouses?

ICFCA '09 Proceedings of the 7th International Conference on Formal Concept Analysis
A Multiple Correspondence Analysis to Organize Data Cubes

Proceedings of the 2007 conference on Databases and Information Systems IV: Selected Papers from the Seventh International Baltic Conference DB&IS'2006
Closed Non Derivable Data Cubes Based on Non Derivable Minimal Generators

ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
CCBitmaps: A Space-Time Efficient Index Structure for OLAP

ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
Exact and Approximate Sizes of Convex Datacubes

DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Compressing multidimensional structures: a case study

ECC'09 Proceedings of the 3rd international conference on European computing conference
Reduced representations of Emerging Cubes for OLAP database mining

International Journal of Business Intelligence and Data Mining
An efficient method for maintaining data cubes incrementally

Information Sciences: an International Journal
PHC: a rapid parallel hierarchical cubing algorithm on high dimensional OLAP

ICA3PP'07 Proceedings of the 7th international conference on Algorithms and architectures for parallel processing
Revisiting the cube lifecycle in the presence of hierarchies

The VLDB Journal — The International Journal on Very Large Data Bases
A secure multiparty computation privacy preserving OLAP framework over distributed XML data

Proceedings of the 2010 ACM Symposium on Applied Computing
A high performance hierarchical cubing algorithm and efficient OLAP in high-dimensional data warehouse

PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
Real-time temporal data warehouse cubing

DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part II
Business intelligence for small and middle-sized entreprises

ACM SIGMOD Record
Double table switch: an efficient partitioning algorithm for bottom-up computation of data cubes

ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
Extracting semantics in OLAP databases using emerging cubes

Information Sciences: an International Journal
Multidimensional cyclic graph approach: Representing a data cube without common sub-graphs

Information Sciences: an International Journal
The agree concept lattice for multidimensional database analysis

ICFCA'11 Proceedings of the 9th international conference on Formal concept analysis
Privacy Preserving OLAP over Distributed XML Data: A Theoretically-Sound Secure-Multiparty-Computation Approach

Journal of Computer and System Sciences
Adapting OLAP analysis to the user's interest through virtual cubes

FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
Parallel data cubes on multi-core processors with multiple disks

Proceedings of the 2011 Conference of the Center for Advanced Studies on Collaborative Research
Computing iceberg quotient cubes with bounding

DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
On the computation of maximal-correlated cuboids cells

DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
Constrained closed datacubes

ICFCA'10 Proceedings of the 8th international conference on Formal Concept Analysis
PMC: select materialized cells in data cubes

DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
HQC: an efficient method for ROLAP with hierarchical dimensions

RSFDGrC'05 Proceedings of the 10th international conference on Rough Sets, Fuzzy Sets, Data Mining, and Granular Computing - Volume Part II
An efficient indexing technique for computing high dimensional data cubes

WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Lossless reduction of datacubes

DEXA'06 Proceedings of the 17th international conference on Database and Expert Systems Applications
Dynamic construction of user defined virtual cubes

NGITS'06 Proceedings of the 6th international conference on Next Generation Information Technologies and Systems
Using functional dependencies for reducing the size of a data cube

FoIKS'12 Proceedings of the 7th international conference on Foundations of Information and Knowledge Systems
Emerging cubes for trends analysis in OLAP databases

DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery
Convex cube: towards a unified structure for multidimensional databases

DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Efficient computation of combinatorial skyline queries

Information Systems
Constrained Cube Lattices for Multidimensional Database Mining

International Journal of Data Warehousing and Mining
Mining multidimensional contextual outliers from categorical relational data

Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Searching semantic data warehouses: models, issues, architectures

Proceedings of the 2nd International Workshop on Semantic Search over the Web

Quantified Score

Hi-index	0.00

Visualization

Abstract

Partitioning a data cube into sets of cells with "similar behavior" often better exposes the semantics in the cube. E.g., if we find that average boots sales in the West 10th store of Walmart was the same for winter as for the whole year, it signifies something interesting about the trend of boots sales in that location in that year. In this paper, we are interested in finding succinct summaries of the data cube, exploiting regularities present in the cube, with a clear basis. We would like the summary: (i) to be as concise as possible, (ii) to itself form a lattice preserving the rollup/drilldown semantics of the cube, and (iii) to allow the original cube to be fully recovered. We illustrate the utility of solving this problem and discuss the inherent challenges. We develop techniques for partitioning cube cells for obtaining succinct summaries, and introduce the quotient cube. We give efficient algorithms for computing it from a base table. For monotone aggregate functions (e.g., COUNT, MIN, MAX, SUM on non-negative measures, etc.), our solution is optimal (i.e., quotient cube of the least size). For nonmonotone functions (e.g., AVG), we obtain a locally optimal solution. We experimentally demonstrate the efficacy of our ideas and techniques and the scalability of our algorithms.