Best entry points for structured document retrieval: part I: characteristics
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
ACM SIGIR Forum
Construction of a test collection for the focussed retrieval of structured documents
ECIR'03 Proceedings of the 25th European conference on IR research
User behaviour in the context of structured documents
ECIR'03 Proceedings of the 25th European conference on IR research
Sound and complete relevance assessment for XML retrieval
ACM Transactions on Information Systems (TOIS)
University of Waterloo at INEX2007: Adhoc and Link-the-Wiki Tracks
Focused Access to XML Documents
Wikipedia Ad Hoc Passage Retrieval and Wikipedia Document Linking
Focused Access to XML Documents
Hi-index | 0.00 |
In structured information retrieval, the aim is to exploit document structure to retrieve relevant components, allowing the user to go straight to the relevant material. This paper looks at the so-called best entry points (BEPs), which are intended to give the user the best starting point to access the relevant information in the document. We examine the relationship between BEPs and relevant components in the INEX 2006 ad hoc assessments. Our main findings are the following: First, although documents are short, assessors often choose the best entry point some distance from the start of the document. Second, many of the best entry points coincide with the first relevant character in relevant documents, showing a strong relation between the BEP and relevant text. Third, we find browsing BEPs in articles with a single relevant passages, and container BEPs or context BEPs in articles with more relevant passages.