Selectivity-based XML query processing in structured peer-to-peer networks

  • Authors:
  • Carmela Comito;Domenico Talia;Paolo Trunfio

  • Affiliations:
  • DEIS, University of Calabria, Rende (CS), Italy;DEIS, University of Calabria, ICAR-CNR, Rende (CS), Italy;DEIS, University of Calabria, Rende (CS), Italy

  • Venue:
  • Proceedings of the Fourteenth International Database Engineering & Applications Symposium
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

DHT-based structured P2P systems have been proposed to index and retrieve many types of contents, including distributed collections of XML documents. During the query processing, a DHT can be used to efficiently identify all nodes storing relevant documents. Most of the DHT-based systems in literature break a complex XML tree pattern query into a set of simple path queries. Each of such queries is separately looked up in the P2P network to locate relevant nodes. The intersection of the results over each single path produces the set of nodes storing documents matching the original query. The network traffic generated by this strategy increases with the number of paths in the query. To overcome this issue, an alternative strategy consists in resolving only the sub-query associated to the most selective path; then all nodes in the result set are queried with the original query. The main goal of this paper is providing an analytical and experimental study of the two approaches to assess their relative merits in different scenarios. On the basis of this study, we also introduce an Adaptive Path Selection (APS) search technique which resolves a multi-path XML query by querying either the most selective path or the whole path set, based on the selectivity of the paths in that query. Experimental results confirm that APS saves a significant amount of network traffic as compared to the two strategies from which it derives.