Integrating cross-lingually relevant news articles and monolingual web documents in bilingual lexicon acquisition

  • Authors:
  • Takehito Utsuro;Kohei Hino;Mitsuhiro Kida;Seiichi Nakagawa;Satoshi Sato

  • Affiliations:
  • Kyoto University, Kyoto, Japan;Toyohashi University of Technology, Toyohashi, Japan;Kyoto University, Kyoto, Japan;Toyohashi University of Technology, Toyohashi, Japan;Kyoto University, Kyoto, Japan

  • Venue:
  • COLING '04 Proceedings of the 20th international conference on Computational Linguistics
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the framework of bilingual lexicon acquisition from cross-lingually relevant news articles on the Web, it is relatively harder to reliably estimate bilingual term correspondences for low frequency terms. Considering such a situation, this paper proposes to complementarily use much larger monolingual Web documents collected by search engines, as a resource for reliably re-estimating bilingual term correspondences. We experimentally show that, using a sufficient number of monolingual Web documents, it is quite possible to have reliable estimate of bilingual term correspondences for those low frequency terms.