Creating multilingual translation lexicons with regional variations using web corpora

  • Authors:
  • Pu-Jen Cheng;Yi-Cheng Pan;Wen-Hsiang Lu;Lee-Feng Chien

  • Affiliations:
  • Institute of Information Science, Taiwan;Institute of Information Science, Taiwan;National Cheng Kung Univ., Taiwan;Institute of Information Science, Taiwan and National Taiwan University, Taiwan

  • Venue:
  • ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

The purpose of this paper is to automatically create multilingual translation lexicons with regional variations. We propose a transitive translation approach to determine translation variations across languages that have insufficient corpora for translation via the mining of bilingual search-result pages and clues of geographic information obtained from Web search engines. The experimental results have shown the feasibility of the proposed approach in efficiently generating translation equivalents of various terms not covered by general translation dictionaries. It also revealed that the created translation lexicons can reflect different cultural aspects across regions such as Taiwan, Hong Kong and mainland China.