Experimental Evaluation of Passage-Based Document Retrieval

  • Authors:
  • Affiliations:
  • Venue:
  • ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Abstract: Retrieval of electronic documents is a fundamental component for intelligent access to the contents of documents. Although the history of its research is long, it is still not a trivial task, in particular, when we retrieve long documents with short queries. For the retrieval of long documents, a method called passage-based document retrieval has proven to be effective. In this paper, we experimentally show that the passage-based retrieval is also advantageous for dealing with short queries on condition that documents are long. We employ a passage-based method based on density distributions of query terms in documents, and compare it with three conventional methods: the vector space model, pseudo-feedback and latent semantic indexing.