A Syntactic-based Word Re-ordering for English-Vietnamese Statistical Machine Translation System

  • Authors:
  • Hong-Nhung Nguyen Thi;Dien Dinh

  • Affiliations:
  • Faculty of Information Technology, University of Science - National University of Ho Chi Minh City, Ho Chi Minh City, Vietnam;Faculty of Information Technology, University of Science - National University of Ho Chi Minh City, Ho Chi Minh City, Vietnam

  • Venue:
  • PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In machine translation, the re-ordering of word from source to target language is one of the major steps that affect mainly the performance of the system. Among many approaches for this type of problem, syntactic is an effective method for handling word-order in a statistical machine translation (SMT) system. In this paper, we introduce a word re-ordering approach that makes use the syntactic rules extracted from parse tree for the English-Vietnamese SMT system. Our word re-ordering rule set includes rules in noun phrase, verb phrase and adjective phrase. According to the experiment result, the noun phrase rules are the most significant rules of all. Compared with the MOSES phrase-based SMT system [1], these rules can improve BLEU score of 3.24 on our testing corpus. Moreover, we also conduct other experiments by using different combinations of rules to study their effectiveness. And we find that the translation performance for each corpus can be tuned by different ways of combination.