Efficient Creation and Incremental Maintenance of the HOPI Index for Complex XML Document Collections

  • Authors:
  • Ralf Schenkel;Anja Theobald;Gerhard Weikum

  • Affiliations:
  • Max-Planck-Institut für Informatik;Max-Planck-Institut für Informatik;Max-Planck-Institut für Informatik

  • Venue:
  • ICDE '05 Proceedings of the 21st International Conference on Data Engineering
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The HOPI index, a connection index for XML documents based on the concept of a 2-hop cover, provides space- and time-efficient reachability tests along the ancestor, descendant, and link axes to support path expressions with wildcards in XML search engines. This paper presents enhanced algorithms for building HOPI, shows how to augment the index with distance information, and discusses incremental index maintenance. Our experiments show substantial improvements over the existing divide-and-conquer algorithm for index creation, low space overhead for including distance information in the index, and efficient updates.