Fast algorithms for universal quantification in large databases

Authors:
Goetz Graefe;Richard L. Cole
Affiliations:
Microsoft Corp., One Microsoft Way, Redmond, WA;Redbrick Systems
Venue:
ACM Transactions on Database Systems (TODS)
Year:
1995

Citing 26
Cited 10

Fragmentation: a technique for efficient query processing

ACM Transactions on Database Systems (TODS)
Join processing in database systems with large main memories

ACM Transactions on Database Systems (TODS)
GAMMA—a high performance dataflow database machine

Readings in database systems
A guide to the SQL standard (2nd ed.)

A guide to the SQL standard (2nd ed.)
A performance evaluation of four parallel join algorithms in a shared-nothing multiprocessor environment

SIGMOD '89 Proceedings of the 1989 ACM SIGMOD international conference on Management of data
The effect of bucket size tuning in the dynamic hybrid GRACE hash join method

VLDB '89 Proceedings of the 15th international conference on Very large data bases
An adaptive hash join algorithm for multiuser environments

Proceedings of the sixteenth international conference on Very large databases
FastSort: a distributed single-input single-output external sort

SIGMOD '90 Proceedings of the 1990 ACM SIGMOD international conference on Management of data
Tuning a parallel database algorithm on a shared-memory multiprocessor

Software—Practice & Experience
Query evaluation techniques for large databases

ACM Computing Surveys (CSUR)
The art of computer programming, volume 3: (2nd ed.) sorting and searching

The art of computer programming, volume 3: (2nd ed.) sorting and searching
Duplicate record elimination in large data files

ACM Transactions on Database Systems (TODS)
System R: relational approach to database management

ACM Transactions on Database Systems (TODS)
Optimizing the performance of a relational algebra database interface

Communications of the ACM
Implementation techniques for main memory database systems

SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
The Gamma Database Machine Project

IEEE Transactions on Knowledge and Data Engineering
Volcano— An Extensible and Parallel Query Evaluation System

IEEE Transactions on Knowledge and Data Engineering
Sort vs. Hash Revisited

IEEE Transactions on Knowledge and Data Engineering
Encapsulation of Parallelism and Architecture-Independence in Extensible Database Query Execution

IEEE Transactions on Software Engineering
Relational Division: Four Algorithms and Their Performance

Proceedings of the Fifth International Conference on Data Engineering
Supporting Universal Quantification in a Two-Dimensional Database Query Language

Proceedings of the Sixth International Conference on Data Engineering
HAS, a Relational Algebra Operator or Divide is not Enough to Conquer

Proceedings of the Second International Conference on Data Engineering
Hashing Methods and Relational Algebra Operations

VLDB '84 Proceedings of the 10th International Conference on Very Large Data Bases
An Overview of The System Software of A Parallel Relational Database Machine GRACE

VLDB '86 Proceedings of the 12th International Conference on Very Large Data Bases
Hash-Partitioned Join Method Using Dynamic Destaging Strategy

VLDB '88 Proceedings of the 14th International Conference on Very Large Data Bases
Theory of Relational Databases

Theory of Relational Databases

Algorithms and applications for universal quantification in relational databases

Information Systems - Special issue: Best papers from EDBT 2002
Universal Quantification in Relational Databases: A Classification of Data and Algorithms

EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
Optimizing Queries with Universal Quantification in Object-Oriented and Object-Relational Databases

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Processing frequent itemset discovery queries by division and set containment join operators

DMKD '03 Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery
On the complexity of division and set joins in the relational algebra

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
On the complexity of division and set joins in the relational algebra

Journal of Computer and System Sciences
Evaluating Universal Quantification in XML

IEEE Transactions on Knowledge and Data Engineering
An Implementation of a Query Language with Generalized Quantifiers

ER '08 Proceedings of the 27th International Conference on Conceptual Modeling
New algorithms for join and grouping operations

Computer Science - Research and Development
Efficient implementation of generalized quantification in relational query languages

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

Universal quantification is not supported directly in most database systems despite the fact that it adds significant power to a system's query processing and inference capabilities, in particular for the analysis of many-to-many relationships and of set-valued attributes. One of the main reasons for this omission has been that universal quantification algorithms and their performance have not been explored for large databases. In this article, we describe and compare three known algorithms and one recently proposed algorithm for relational division, the algebra operator that embodies universal quantification. For each algorithm, we investigate the performance effects of explicit duplicate removal and referential integrity enforcement, variants for inputs larger than memory, and parallel execution strategies. Analytical and experimental performance comparisons illustrate the substantial differences among the algorithms. Moreover, comparisons demonstrate that the recently proposed division algorithm evaluates a universal quantification predicate over two relations as fast as hash (semi-) join evaluates an existential quantification predicate over the same relations. Thus, existential and universal quantification can be supported with equal efficiency by adding the recently proposed algorithm to a query evaluation system. A second result of our study is that universal quantification should be expressed directly in a database query language, because most query optimizers do not recognize the rather indirect formulations available in SQL as relational division and therefore produce very poor evaluation plans for many universal quantification queries.