The Geometric Framework for Exact and Similarity Querying XML Data

  • Authors:
  • Michal Krátký;Jaroslav Pokorný;Tomáš Skopal;Václav Snásel

  • Affiliations:
  • -;-;-;-

  • Venue:
  • EurAsia-ICT '02 Proceedings of the First EurAsian Conference on Information and Communication Technology
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Using the terminology usual in databases, it is possible to view XML as a language for data modeling. To retrieve XML data from XML databases, several query languages have been proposed. The common feature of such languages is the use of regular path expressions. They enable the user to navigate through arbitrary long paths in XML data. If we considered a path content as a vector of path elements, we would be able to model XML paths as points within a multidimensional vector space. This paper introduces a geometric framework for indexing and querying XML data conceived in this way. In consequence, we can use certain data structures for indexing multidimensional points (objects). We use the UB-tree for indexing the vector spaces and the M-tree for indexing the metric spaces. The data structures for indexing the vector spaces lead rather to exact matching queries while the structures for indexing the metric spaces allow us to provide the similarity queries.