Improving function word alignment with frequency and syntactic information

  • Authors:
  • Jingyi Zhang;Hai Zhao

  • Affiliations:
  • Microsoft Key Laboratory of Intelligent Computing and Intelligent System and Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China;Microsoft Key Laboratory of Intelligent Computing and Intelligent System and Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China

  • Venue:
  • IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In statistical word alignment for machine translation, function words usually cause poor aligning performance because they do not have clear correspondence between different languages. This paper proposes a novel approach to improve word alignment by pruning alignments of function words from an existing alignment model with high precision and recall. Based on monolingual and bilingual frequency characteristics, a language-independent function word recognition algorithm is first proposed. Then a group of carefully defined syntactic structures combined with content word alignments are used for further function word alignment pruning. The experimental results show that the proposed approach improves both the quality of word alignment and the performance of statistical machine translation on Chinese-to-English, German-to-English and French-to-English language pairs.