WebContent: efficient P2P Warehousing of web data

  • Authors:
  • S. Abiteboul;T. Allard;P. Chatalic;G. Gardarin;A. Ghitescu;F. Goasdoué;I. Manolescu;B. Nguyen;M. Ouazara;A. Somani;N. Travers;G. Vasile;S. Zoupanos

  • Affiliations:
  • Univ. Paris-Sud, France;Univ. Versailles-Saint-Quentin, France;Univ. Paris-Sud, France;Univ. Versailles-Saint-Quentin, France;Univ. Paris-Sud, France;Univ. Paris-Sud, France;Univ. Paris-Sud, France;Univ. Versailles-Saint-Quentin, France;Univ. Paris-Sud, France;Univ. Paris-Sud, France and IIT Bombay, India;CNAM, France;Univ. Paris-Sud, France;Univ. Paris-Sud, France

  • Venue:
  • Proceedings of the VLDB Endowment
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present the WebContent platform for managing distributed repositories of XML and semantic Web data. The platform allows integrating various data processing building blocks (crawling, translation, semantic annotation, full-text search, structured XML querying, and semantic querying), presented as Web services, into a large-scale efficient platform. Calls to various services are combined inside ActiveXML [8] documents, which are XML documents including service calls. An ActiveXML optimizer is used to: (i) efficiently distribute computations among sites; (ii) perform XQuery-specific optimizations by leveraging an algebraic XQuery optimizer; and (iii) given an XML query, chose among several distributed indices the most appropriate in order to answer the query.