Index Structures for Matching XML Twigs Using Relational Query Processors

  • Authors:
  • Zhiyuan Chen;Johannes Gehrke;Flip Korn;Nick Koudas;Jayavel Shanmugasundaram;Divesh Srivastava

  • Affiliations:
  • University of Maryland at Baltimore County;Cornell University;AT&T Labs-Research;AT&T Labs-Research;Cornell University;AT&T Labs-Research

  • Venue:
  • ICDEW '05 Proceedings of the 21st International Conference on Data Engineering Workshops
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Various index structures have been proposed to speed up the evaluation of XML path expressions. However, existing XML path indices suffer from at least one of three limitations: they focus only on indexing the structure (relying on a separate index for node content), they are useful only for simple path expressions such as root-to-leaf paths, or they cannot be tightly integrated with a relational query processor. Moreover, there is no unified framework to compare these index structures. In this paper, we present a framework defining a family of index structures that includes most existing XMLpath indices.We also propose two novel index structures in this family, with different space-time tradeoffs, that are effective for the evaluation of XML branching path expressions (i.e., twigs) with value conditions. We also show how this family of index structures can be implemented using the access methods of the underlying database system. Finally, we present an experimental evaluation that shows the performance tradeoff between index space and matching time. The experimental results show that our novel indices achieve orders ofmagnitude improvement in performance for evaluating twig queries, albeit at a higher space cost, over the use of previously proposed XML path indices that can be tightly integrated with a relational query processor.