Hash-based structural join algorithms

  • Authors:
  • Christian Mathis;Theo Härder

  • Affiliations:
  • Database and Information Systems, University of Kaiserslautern, Kaiserslautern, Germany;Database and Information Systems, University of Kaiserslautern, Kaiserslautern, Germany

  • Venue:
  • EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Algorithms for processing Structural Joins embody essential building blocks for XML query evaluation. Their design is a difficult task, because they have to satisfy many requirements, e. g., guarantee linear worst-case runtime; generate sorted, duplicate-free output; adapt to fiercely varying input sizes and element distributions; enable pipelining; and (probably) more. Therefore, it is not possible to design the structural join algorithm. Rather, the provision of different specialized operators, from which the query optimizer can choose, is beneficial for query efficiency. We propose new hash-based structural joins that can process unordered input sequences possibly containing duplicates. We also show that these algorithms can substantially reduce the number of sort operations on intermediate results for (complex) tree structured queries (twigs).