Efficient support for ordered xpath processing in tree-unaware commercial relational databases

  • Authors:
  • Boon-Siew Seah;Klarinda G. Widjanarko;Sourav S. Bhowmick;Byron Choi;Erwin Leonardi

  • Affiliations:
  • School of Computer Engineering, Nanyang Technological University, Singapore and Singapore-MIT Alliance, Nanyang Technological University, Singapore;School of Computer Engineering, Nanyang Technological University, Singapore and Singapore-MIT Alliance, Nanyang Technological University, Singapore;School of Computer Engineering, Nanyang Technological University, Singapore and Singapore-MIT Alliance, Nanyang Technological University, Singapore;School of Computer Engineering, Nanyang Technological University, Singapore;School of Computer Engineering, Nanyang Technological University, Singapore and Singapore-MIT Alliance, Nanyang Technological University, Singapore

  • Venue:
  • DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a novel ordered XPATH evaluation in treeunaware RDBMS. The novelties of our approach lies in the followings. (a) We propose a novel XML storage scheme which comprises only leaf nodes, their corresponding data values, order encodings and their root-to-leaf paths. (b) We propose an algorithm for mapping ordered XPATH queries into SQL queries over the storage scheme. (c) We propose an optimization technique that enforces all mapped SQL queries to be evaluated in a "left-to-right" join order. By employing these techniques, we show, through a comprehensive experiment, that our approach not only scales well but also performs better than some representative tree-unaware approaches on more than 65% of our benchmark queries with the highest observed gain factor being 1939. In addition, our approach reduces significantly the performance gap between tree-aware and tree-unaware approaches and even outperforms a state-of-the-art tree-aware approach for certain benchmark queries.