Automatic Aggregation Using Explicit Metadata

Authors:
Stéphane Grumbach;Leonardo Tininini
Affiliations:
-;-
Venue:
SSDBM '00 Proceedings of the 12th International Conference on Scientific and Statistical Database Management
Year:
2000

Citing 19
Cited 1

Statistical and Scientific Database Issues

IEEE Transactions on Software Engineering
Statistical relational tables for statistical database management

IEEE Transactions on Software Engineering
Extending relational algebra and relational calculus with set-valued attributes and aggregate functions

ACM Transactions on Database Systems (TODS)
Conceptual language for statistical data modeling

Data & Knowledge Engineering
Implementing data cubes efficiently

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Statistical inference and data mining

Communications of the ACM
The aggregate data problem: a system for their definition and management

ACM SIGMOD Record
An overview of data warehousing and OLAP technology

ACM SIGMOD Record
OLAP and statistical databases: similarities and differences

PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Towards on-line analytical mining in large databases

ACM SIGMOD Record
Querying aggregate data

PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A language and a physical organization technique for summary tables

SIGMOD '85 Proceedings of the 1985 ACM SIGMOD international conference on Management of data
On the content of materialized aggregate views

PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Total

ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Metadata Management for Large Statistical Databases

VLDB '82 Proceedings of the 8th International Conference on Very Large Data Bases
Statistical Databases: Characteristics, Problems, and some Solutions

VLDB '82 Proceedings of the 8th International Conference on Very Large Data Bases
Summarizability in OLAP and Statistical Data Bases

SSDBM '97 Proceedings of the Ninth International Conference on Scientific and Statistical Database Management
Proposal of a logical model for statistical data base

SSDBM'83 Proceedings of the 2nd international workshop on Proceedings of the Second International Workshop on Statistical Database Management
SUBJECT: a directory driven system for organizing and accessing large statistical databases

VLDB '81 Proceedings of the seventh international conference on Very Large Data Bases - Volume 7

Querying multidimensional data

Multidimensional databases

Quantified Score

Hi-index	0.00

Visualization

Abstract

The paper presents a logical data model for statistical data with an aggregation. The data are stored in standard relations from the relational model, while the metadata, defining the semantics of the relations, are represented by numerical dependencies, which specify the way the summary values are defined in terms of micro-data, as well as the interrelationships among summary values. The present model supports standard relational languages such as SQL. Relations with numerical dependencies are then seen as {\it statistical views} over initial relations of micro-data. Queries can be asked either against the views or directly against the initial relations, and in this later case answered, when possible, using the views. The numerical dependencies of the views are run together with the query to compute the answer to the query. This is handled in a completely automatic manner, with no need for the user to deal with the intricacy of metadata. The mechanism has been tested by an implementation in Prolog of meaningful examples of queries and dependencies. It is shown in particular that various classical problems in the realm of statistical and multidimensional databases can be easily modeled and solved in the present framework. Finally, the proposed formalism is shown to be useful for statistical database schema design.