NTCIR-2 as a Rosetta stone in laboratory experiments of IR systems

  • Authors:
  • Sumio Fujita

  • Affiliations:
  • PATOLIS Corporation, Sumitomo Fudosan Kiba Building, 2-4-29, Shiohama, Koto-Ku, Tokyo, 135-0043, Japan

  • Venue:
  • Information Processing and Management: an International Journal - Special issue: Cross-language information retrieval
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a laboratory based evaluation study of cross-language information retrieval technologies, utilizing partially parallel test collections, NTCIR-2 (used together with NTCIR-1), where Japanese-English parallel document collections, parallel topic sets and their relevance judgments are available. These enable us to observe and compare monolingual retrieval processes in two languages as well as retrieval across languages. Our experiments focused on (1) the Rosetta stone question (whether a partially parallel collection helps in cross-language information access or not?) and (2) two aspects of retrieval difficulties namely "collection discrepancy" and "query discrepancy". Japanese and English monolingual retrieval systems are combined by dictionary based query translation modules so that a symmetrical bilingual evaluation environment is implemented.