Word alignment of English-Chinese bilingual corpus based on chunks

  • Authors:
  • Le Sun;Youbing Jin;Lin Du;Yufang Sun

  • Affiliations:
  • Chinese Academy of Sciences, Beijing, P.R. China;Chinese Academy of Sciences, Beijing, P.R. China;Chinese Academy of Sciences, Beijing, P.R. China;Chinese Academy of Sciences, Beijing, P.R. China

  • Venue:
  • EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a method for the word alignment of English-Chinese corpus based on chunks is proposed. The chunks of English sentences are identified firstly. Then the chunk boundaries of Chinese sentences are predicted by the translations of English chunks and heuristic information. The ambiguities of Chinese chunk boundaries are resolved by the coterminous words in English chunks. With the chunk aligned bilingual corpus, a translation relation probability is proposed to align words. Finally, we evaluate our system by real corpus and present the experiment results.