Query term disambiguation for Web cross-language information retrieval using a search engine

  • Authors:
  • Akira Maeda;Fatiha Sadat;Masatoshi Yoshikawa;Shunsuke Uemura

  • Affiliations:
  • Graduate School of Information Science, Nara Institute of Science and Technology (NAIST), Japan;Graduate School of Information Science, Nara Institute of Science and Technology (NAIST), Japan;Graduate School of Information Science, Nara Institute of Science and Technology (NAIST), Japan;Graduate School of Information Science, Nara Institute of Science and Technology (NAIST), Japan

  • Venue:
  • IRAL '00 Proceedings of the fifth international workshop on on Information retrieval with Asian languages
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

With the worldwide growth of the Internet, research on Cross-Language Information Retrieval (CLIR) is being paid much attention. Existing CLIR approaches based on query translation require parallel corpora or comparable corpora for the disambiguation of translated query terms. However, those natural language resources are not readily available. In this paper, we propose a disambiguation method for dictionary-based query translation that is independent of the availability of such scarce language resources, while achieving adequate retrieval effectiveness by utilizing Web documents as a corpus and using co-occurrence information between terms within that corpus. In the experiments, our method achieved 97% of manual translation case in terms of the average precision.