An Ngram-based reordering model

  • Authors:
  • Marta R. Costa-jussí;José A. R. Fonollosa

  • Affiliations:
  • Universitat Politècnica de Catalunya TALP Research Center, Department of Signal Theory and Communications, Campus Nord, Barcelona 08034, Spain;Universitat Politècnica de Catalunya TALP Research Center, Department of Signal Theory and Communications, Campus Nord, Barcelona 08034, Spain

  • Venue:
  • Computer Speech and Language
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes in detail a novel approach to the reordering challenge in statistical machine translation (SMT). This Ngram-based reordering (NbR) approach uses the powerful techniques of SMT systems to generate a weighted reordering graph. Thus, statistical criteria reordering constraints are supplied to an SMT system, and this allows an extension to the SMT decoding search. The NbR approach is capable of generalizing reorderings that have been learned during training, through the use of word classes instead of words themselves. Improvement in translation performance is demonstrated with the EPPS task (Spanish and German to English) and the BTEC task (Arabic to English).