Extraction of translation unit from Chinese-English parallel corpora

  • Authors:
  • Chang Baobao;Pernilla Danielsson;Wolfgang Teubert

  • Affiliations:
  • Peking University, Beijing, P. R. China;Birmingham University, Birmingham, United Kingdom;Birmingham University, Birmingham, United Kingdom

  • Venue:
  • SIGHAN '02 Proceedings of the first SIGHAN workshop on Chinese language processing - Volume 18
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

More and more researchers have recognized the potential value of the parallel corpus in the research on Machine Translation and Machine Aided Translation. This paper examines how Chinese English translation units could be extracted from parallel corpus. An iterative algorithm based on degree of word association is proposed to identify the multiword units for Chinese and English. Then the Chinese-English Translation Equivalent Pairs are extracted from the parallel corpus. We also made comparison between different statistical association measurement in this paper.