A Combined Statistical Query Term Disambiguation in Cross-Language Information Retrieval

  • Authors:
  • Fatiha Sadat;Akira Maeda;Masatoshi Yoshikawa;Shunsuke Uemura

  • Affiliations:
  • -;-;-;-

  • Venue:
  • DEXA '02 Proceedings of the 13th International Workshop on Database and Expert Systems Applications
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

The diversity of information sources and the explosive growth of the Internet worldwide are compelling evidence of a need for information retrieval that can cross language boundaries. Ambiguity from failure to translate queries is one of the major causes for large drops ineffectiveness below monolingual performance, for the dictionary-based method in Cross-Language Information Retrieval. In this paper, we focus on the query translation and disambiguation, to improve the effectiveness of an information retrieval and to dramatically reduce errors such an approach normally makes. A combined statistical disambiguation method both before and after translation is proposed, to avoid the problem of wrong selection of target translations. We tested the effectiveness of the proposed disambiguation method, by an application to French-English Information Retrieval. Evaluations using TREC data collection proved a great effectiveness of the proposed disambiguation method.