A compact XML storage scheme supporting efficient path querying

  • Authors:
  • Xiangyu Hu;Haiwei Zhang;Xiaojie Yuan

  • Affiliations:
  • Department of Computer Science, Nankai University, Tianjin, China;Department of Computer Science, Nankai University, Tianjin, China;Department of Computer Science, Nankai University, Tianjin, China

  • Venue:
  • APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

XML is becoming the de facto standard to store, exchange and publish information over the web, the need to develop efficient techniques for storing and querying XML documents has emerged. Two challenges in handling XML are its inherent verbosity and the complexity of its structure. A number of labeling schemes have been proposed to support fast XPath query processing, though they require even more storage cost. Compaction of XML documents has become an increasing important research issue. In this paper, we propose a new XML storage scheme called RRZip. In our approach the structure and content data of an XML document are stored separately: relative region labeling scheme are used for both compacting and querying the structure while the content data can be compressed by general-purpose compressors. Experiments show that proposed storage scheme is space and time efficient.