Highly heterogeneous XML collections: how to retrieve precise results?

  • Authors:
  • Ismael Sanz;Marco Mesiti;Giovanna Guerrini;Rafael Berlanga Llavori

  • Affiliations:
  • Universitat Jaume I, Castellón, Spain;Università di Milano, Italy;Università di Genova, Italy;Universitat Jaume I, Castellón, Spain

  • Venue:
  • FQAS'06 Proceedings of the 7th international conference on Flexible Query Answering Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Highly heterogeneous XML collections are thematic collections exploiting different structures: the parent-child or ancestor-descendant relationships are not preserved and vocabulary discrepancies in the element names can occur. In this setting current approaches return answers with low precision. By means of similarity measures and semantic inverted indices we present an approach for improving the precision of query answers without compromising performance.