Using statistical sampling for query optimization in heterogeneous library information systems

  • Authors:
  • Gregory D. Speegle;Michael J. Donahoo

  • Affiliations:
  • Baylor University, Department of Engineering and Computer Science, Waco, Tx.;Baylor University, Department of Engineering and Computer Science, Waco, Tx.

  • Venue:
  • CSC '93 Proceedings of the 1993 ACM conference on Computer science
  • Year:
  • 1993

Quantified Score

Hi-index 0.00

Visualization

Abstract

Query optimization in heterogeneous database systems is not always possible since the component DBMS may not have the ability to transmit necessary information. However, these systems need query optimization because the cost of transmitting large quantities of data across diverse databases is very high. We propose a query strategy which uses hypothesis testing to determine which of two sets of data are larger. Our experiments show that this strategy is very likely to select the smaller set when the sampling results fall outside a region of uncertainty we call the “grey zone.” This provides query optimization without transmission of database statistics.