Searching for alignments in SMT. A novel approach based on an estimation of distribution algorithm

  • Authors:
  • Luis Rodríguez;Ismael García-Varea;José A. Gàmez

  • Affiliations:
  • Universidad de Castilla-La Mancha;Universidad de Castilla-La Mancha;Universidad de Castilla-La Mancha

  • Venue:
  • StatMT '06 Proceedings of the Workshop on Statistical Machine Translation
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In statistical machine translation, an alignment defines a mapping between the words in the source and in the target sentence. Alignments are used, on the one hand, to train the statistical models and, on the other, during the decoding process to link the words in the source sentence to the words in the partial hypotheses generated. In both cases, the quality of the alignments is crucial for the success of the translation process. In this paper, we propose an algorithm based on an Estimation of Distribution Algorithm for computing alignments between two sentences in a parallel corpus. This algorithm has been tested on different tasks involving different pair of languages. In the different experiments presented here for the two word-alignment shared tasks proposed in the HLT-NAACL 2003 and in the ACL 2005, the EDA-based algorithm outperforms the best participant systems.