Bilingual terminology acquisition from comparable corpora and phrasal translation to cross-language information retrieval

  • Authors:
  • Fatiha Sadat;Masatoshi Yoshikawa;Shunsuke Uemura

  • Affiliations:
  • Nara Institute of Science and Technology, Nara, Japan;Nagoya University, Nagoya, Japan;Nara Institute of Science and Technology, Nara, Japan

  • Venue:
  • ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The present paper will seek to present an approach to bilingual lexicon extraction from non-aligned comparable corpora, phrasal translation as well as evaluations on Cross-Language Information Retrieval. A two-stages translation model is proposed for the acquisition of bilingual terminology from comparable corpora, disambiguation and selection of best translation alternatives according to their linguistics-based knowledge. Different rescoring techniques are proposed and evaluated in order to select best phrasal translation alternatives. Results demonstrate that the proposed translation model yields better translations and retrieval effectiveness could be achieved across Japanese-English language pair.