Multiple join size estimation by virtual domains (extended abstract)

  • Authors:
  • Allen Van Gelder

  • Affiliations:
  • -

  • Venue:
  • PODS '93 Proceedings of the twelfth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
  • Year:
  • 1993

Quantified Score

Hi-index 0.00

Visualization

Abstract

A model is described to estimate the size of intermediate relations produced by large relational algebra expressions, in particular, those containing several equi-joins. The intended application is within query optimization searches, where fast estimates are needed as many alternative plans are examined. It is shown that previous methods, which use an independence assumption when several attributes are joined, can lead to unrealistically low size estimates. This method attempts to overcome that problem by the introduction of “virtual domains”, which avoid the independence assumption. The method does not require extensive statistics about the database. After describing an “exact” version, an approximation that is simpler and faster is presented.