Bulk data in main memory-based XQuery evaluation

  • Authors:
  • Stefanie Scherzinger

  • Affiliations:
  • Saarland University Database Group, Saarbrücken, Germany

  • Venue:
  • XIME-P '07 Proceedings of the 4th international workshop on XQuery implementation, experience and perspectives
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

XQuery processors that load the input into main memory suffer from huge memory demands. Yet for the evaluation of many queries, large parts of the input are actually irrelevant. In XML document projection, this data is recognized and not loaded in the first place. However, there are also queries where little can be gained by projection. We have observed that these queries tend to require large parts of the input only for generating output. This suggests that such "bulk" data may be stored and treated differently from data that is actually traversed in query evaluation. In this paper, we present a technique to recognize bulk data while loading XML documents for the evaluation of composition-free XQuery. Our approach is coupled with XML document projection, and utilizes a finite automaton that is expressly suited for matching path expressions. We show in an exploratory analysis that bulk data arises in practice, and discuss ongoing work along the line of bulk-bypassing in main memory-based XQuery engines.