Divide-and-Conquer Algorithm for Computing Set Containment Joins

  • Authors:
  • Sergey Melnik;Hector Garcia-Molina

  • Affiliations:
  • -;-

  • Venue:
  • EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

A set containment join is a join between set-valued attributes of two relations, whose join condition is specified using the subset (驴) operator. Set containment joins are used in a variety of database applications. In this paper, we propose a novel partitioning algorithm called Divide-and-Conquer Set Join (DCJ) for computing set containment joins efficiently. We show that the divide-and-conquer approach outperforms previously suggested algorithms over a wide range of data sets. We present a detailed analysis of DCJ and previously known algorithms and describe their behavior in an implemented testbed.