Term distillation in patent retrieval

  • Authors:
  • Hideo Itoh;Hiroko Mano;Yasushi Ogawa

  • Affiliations:
  • RICOH Co., Ltd., Bunkyo-ku, Tokyo, Japan;RICOH Co., Ltd., Bunkyo-ku, Tokyo, Japan;RICOH Co., Ltd., Bunkyo-ku, Tokyo, Japan

  • Venue:
  • PATENT '03 Proceedings of the ACL-2003 workshop on Patent corpus processing - Volume 20
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In cross-database retrieval, the domain of queries differs from that of the retrieval target in the distribution of term occurrences. This causes incorrect term weighting in the retrieval system which assigns to each term a retrieval weight based on the distribution of term occurrences. To resolve the problem, we propose "term distillation", a framework for query term selection in cross-database retrieval. The experiments using the NTCIR-3 patent retrieval test collection demonstrate that term distillation is effective for cross-database retrieval.