Discriminative Phrase-Based Models for Arabic Machine Translation

  • Authors:
  • Cristina España-Bonet;Jesús Giménez;Lluís Màrquez

  • Affiliations:
  • TALP, Universitat Politècnica de Catalunya;TALP, Universitat Politècnica de Catalunya;TALP, Universitat Politècnica de Catalunya

  • Venue:
  • ACM Transactions on Asian Language Information Processing (TALIP)
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

A design for an Arabic-to-English translation system is presented. The core of the system implements a standard phrase-based statistical machine translation architecture, but it is extended by incorporating a local discriminative phrase selection model to address the semantic ambiguity of Arabic. Local classifiers are trained using linguistic information and context to translate a phrase, and this significantly increases the accuracy in phrase selection with respect to the most frequent translation traditionally considered. These classifiers are integrated into the translation system so that the global task gets benefits from the discriminative learning. As a result, we obtain significant improvements in the full translation task at the lexical, syntactic, and semantic levels as measured by an heterogeneous set of automatic evaluation metrics.