A framework for using materialized XPath views in XML query processing

  • Authors:
  • Andrey Balmin;Fatma Özcan;Kevin S. Beyer;Roberta J. Cochrane;Hamid Pirahesh

  • Affiliations:
  • IBM Almaden Research Center, San Jose CA;IBM Almaden Research Center, San Jose CA;IBM Almaden Research Center, San Jose CA;IBM Almaden Research Center, San Jose CA;IBM Almaden Research Center, San Jose CA

  • Venue:
  • VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

XML languages, such as XQuery, XSLT and SQL/XML, employ XPath as the search and extraction language. XPath expressions often define complicated navigation, resulting in expensive query processing, especially when executed over large collections of documents. In this paper, we propose a framework for exploiting materialized XPath views to expedite processing of XML queries. We explore a class of materialized XPath views, which may contain XML fragments, typed data values, full paths, node references or any combination thereof. We develop an XPath matching algorithm to determine when such views can be used to answer a user query containing XPath expressions. We use the match information to identify the portion of an XPath expression in the user query which is not covered by the XPath view. Finally, we construct, possibly multiple, compensation expressions which need to be applied to the view to produce the query result. Experimental evaluation, using our prototype implementation, shows that the matching algorithm is very efficient and usually accounts for a small fraction of the total query compilation time.