Review of hypothesis alignment algorithms for MT system combination via confusion network decoding

  • Authors:
  • Antti-Veikko I. Rosti;Xiaodong He;Damianos Karakos;Gregor Leusch;Yuan Cao;Markus Freitag;Spyros Matsoukas;Hermann Ney;Jason R. Smith;Bing Zhang

  • Affiliations:
  • Apple Inc., Cupertino, CA;Microsoft Research, Redmond, WA;Johns Hopkins University, Baltimore, MD;SAIC, Monheimsallee, Aachen, Germany;Johns Hopkins University, Baltimore, MD;RWTH Aachen University, Aachen, Germany;Raytheon BBN Technologies, Cambridge, MA;RWTH Aachen University, Aachen, Germany;Johns Hopkins University, Baltimore, MD;Raytheon BBN Technologies, Cambridge, MA

  • Venue:
  • WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Confusion network decoding has proven to be one of the most successful approaches to machine translation system combination. The hypothesis alignment algorithm is a crucial part of building the confusion networks and many alternatives have been proposed in the literature. This paper describes a systematic comparison of five well known hypothesis alignment algorithms for MT system combination via confusion network decoding. Controlled experiments using identical pre-processing, decoding, and weight tuning methods on standard system combination evaluation sets are presented. Translation quality is assessed using case insensitive BLEU scores and bootstrapping is used to establish statistical significance of the score differences. All aligners yield significant BLEU score gains over the best individual system included in the combination. Incremental indirect hidden Markov model and a novel incremental inversion transduction grammar with flexible matching consistently yield the best translation quality, though keeping all things equal, the differences between aligners are relatively small.