On the Efficiency of a Prefix Path Holistic Algorithm

  • Authors:
  • Radim Bača;Michal Krátký

  • Affiliations:
  • Department of Computer Science, Technical University of Ostrava, Czech Republic;Department of Computer Science, Technical University of Ostrava, Czech Republic

  • Venue:
  • XSym '09 Proceedings of the 6th International XML Database Symposium on Database and XML Technologies
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In recent years, many approaches to XML twig pattern searching have been developed. Holistic approaches such as TwigStack are particularly significant in that they provide a powerful theoretical model for optimal processing of some query types. Holistic algorithms use various partitionings of an XML document called streaming schemes and they prove algorithm optimality depending on query characteristics. In this article, we introduce a variant of the TwigStack algorithm which can work with various streaming schemes. Its efficiency does not deteriorate when the number of streams per query node is increased, as it does in the case of the iTwigJoin algorithm. Since the indices utilized by the iTwigJoin and our algorithm are exactly the same, we can use heuristics to select the appropriate algorithm. The aim of this paper is to show that the prefix path streaming scheme algorithms can be efficient even for documents with many labeled paths.