Pivot language approach for phrase-based statistical machine translation

  • Authors:
  • Hua Wu;Haifeng Wang

  • Affiliations:
  • Toshiba (China) Research and Development Center, Beijing, China 100738;Toshiba (China) Research and Development Center, Beijing, China 100738

  • Venue:
  • Machine Translation
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a novel method for phrase-based statistical machine translation based on the use of a pivot language. To translate between languages L s and L t with limited bilingual resources, we bring in a third language, L p , called the pivot language. For the language pairs L s 驴 L p and L p 驴 L t , there exist large bilingual corpora. Using only L s 驴 L p and L p 驴 L t bilingual corpora, we can build a translation model for L s 驴 L t . The advantage of this method lies in the fact that we can perform translation between L s and L t even if there is no bilingual corpus available for this language pair. Using BLEU as a metric, our pivot language approach significantly outperforms the standard model trained on a small bilingual corpus. Moreover, with a small L s 驴 L t bilingual corpus available, our method can further improve translation quality by using the additional L s 驴 L p and L p 驴 L t bilingual corpora.