Enabling XPath Optional Axes Cardinality Estimation Using Path Synopses

  • Authors:
  • Yury Soldak;Maxim Lukichev

  • Affiliations:
  • Department of Computer Science, University of Saint-Petersburg, Russian Federation;Department of Computer Science, University of Saint-Petersburg, Russian Federation

  • Venue:
  • ADBIS '08 Proceedings of the 12th East European conference on Advances in Databases and Information Systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The effective support for XML query languages is becoming increasingly important with the emergence of new applications that access large volumes of XML data. The efficient query execution, especially in the distributed case, requires estimating of the path expression cardinalities. In this paper, we propose two novel techniques for the cardinality estimation of the simple path expressions with optional axes (following/preceding): the document order grouping (DG) and the neighborhood grouping (NG). Both techniques summarize the structure of source XML data in compact graph structures (path synopses) and use these summaries for cardinality estimation. We experimentally evaluated accuracy of the techniques, size of the summaries and studied performance of the prototypes. The wide range of source data was used in order to study the behavior of the structures and the area of techniques application.