The Simplest XML Retrieval Baseline That Could Possibly Work

  • Authors:
  • Philipp Dopichaj

  • Affiliations:
  • University of Kaiserslautern, Kaiserslautern, Germany 67663

  • Venue:
  • Focused Access to XML Documents
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Five years of INEXhave produced many competing XMLelement retrieval methods that make use of the document structure. So far, no clearly best method has been identified, and there is even no clear evidence what parts of the document structure can be used to improve retrieval quality. Little research has been done on simply using standard information retrieval techniques for XMLretrieval. This paper aims at addressing this; it contains a detailed analysis of the BM25similarity measure in this context, revealing that this can form a viable baseline method.