An empirical study on retrieval models for different document genres: patents and newspaper articles

  • Authors:
  • Makoto Iwayama;Atsushi Fujii;Noriko Kando;Yuzo Marukawa

  • Affiliations:
  • Hitachi, Ltd., Kokubunji, Japan;University of Tsukuba, Tsukuba, Japan and CREST, Japan Science and Technology Corporation;National Institute of Informatics, Chiyoda-ku, Japan;National Institute of Informatics, Chiyoda-ku, Japan

  • Venue:
  • Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Reflecting the rapid growth in the utilization of large test collections for information retrieval since the 1990s, extensive comparative experiments have been performed to explore the effectiveness of various retrieval models. However, most collections were intended for retrieving newspaper articles and technical abstracts. In this paper, we describe the process of producing a test collection for patent retrieval, the NTCIR-3 Patent Retrieval Collection, which includes two years of Japanese patent applications and 31 topics produced by professional patent searchers. We also report experimental results obtained by using this collection to re-examine the effectiveness of existing retrieval models in the context of patent retrieval. The relative superiority among existing retrieval models did not significantly differ depending on the document genre, that is, patents and newspaper articles. Issues related to patent retrieval are also discussed.