Reordering modeling using weighted alignment matrices

  • Authors:
  • Wang Ling;Tiago Luís;João Graça;Luísa Coheur;Isabel Trancoso

  • Affiliations:
  • L2F Spoken Systems Lab, INESC-ID Lisboa;L2F Spoken Systems Lab, INESC-ID Lisboa;L2F Spoken Systems Lab, INESC-ID Lisboa;L2F Spoken Systems Lab, INESC-ID Lisboa;L2F Spoken Systems Lab, INESC-ID Lisboa

  • Venue:
  • HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In most statistical machine translation systems, the phrase/rule extraction algorithm uses alignments in the 1-best form, which might contain spurious alignment points. The usage of weighted alignment matrices that encode all possible alignments has been shown to generate better phrase tables for phrase-based systems. We propose two algorithms to generate the well known MSD reordering model using weighted alignment matrices. Experiments on the IWSLT 2010 evaluation datasets for two language pairs with different alignment algorithms show that our methods produce more accurate reordering models, as can be shown by an increase over the regular MSD models of 0.4 BLEU points in the BTEC French to English test set, and of 1.5 BLEU points in the DIALOG Chinese to English test set.