Review of hypothesis alignment algorithms for MT system combination via confusion network decoding

Authors:
Antti-Veikko I. Rosti;Xiaodong He;Damianos Karakos;Gregor Leusch;Yuan Cao;Markus Freitag;Spyros Matsoukas;Hermann Ney;Jason R. Smith;Bing Zhang
Affiliations:
Apple Inc., Cupertino, CA;Microsoft Research, Redmond, WA;Johns Hopkins University, Baltimore, MD;SAIC, Monheimsallee, Aachen, Germany;Johns Hopkins University, Baltimore, MD;RWTH Aachen University, Aachen, Germany;Raytheon BBN Technologies, Cambridge, MA;RWTH Aachen University, Aachen, Germany;Johns Hopkins University, Baltimore, MD;Raytheon BBN Technologies, Cambridge, MA
Venue:
WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Year:
2012

Citing 20
Cited 0

A systematic comparison of various statistical alignment models

Computational Linguistics
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
Stochastic inversion transduction grammars and bilingual parsing of parallel corpora

Computational Linguistics
Three heads are better than one

ANLC '94 Proceedings of the fourth conference on Applied natural language processing
HMM-based word alignment in statistical translation

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
A parsing: fast exact Viterbi parse selection

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Minimum error rate training in statistical machine translation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Symmetric word alignments for statistical machine translation

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Machine translation system combination using ITG-based alignments

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Improving alignments for better confusion networks for combining machine translation systems

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Indirect-HMM-based hypothesis alignment for combining outputs from machine translation systems

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Incremental hypothesis alignment for building confusion networks with application to machine translation system combination

StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
Incremental hypothesis alignment with flexible matching for building confusion networks: BBN system description for WMT09 system combination task

StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Fluency, adequacy, or HTER?: exploring different human judgments with a tunable MT metric

StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Incremental HMM alignment for MT system combination

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Joint optimization for machine translation system combination

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
BBN system description for WMT10 system combination task

WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Findings of the 2011 Workshop on Statistical Machine Translation

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Description of the JHU system combination scheme for WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Confusion network decoding has proven to be one of the most successful approaches to machine translation system combination. The hypothesis alignment algorithm is a crucial part of building the confusion networks and many alternatives have been proposed in the literature. This paper describes a systematic comparison of five well known hypothesis alignment algorithms for MT system combination via confusion network decoding. Controlled experiments using identical pre-processing, decoding, and weight tuning methods on standard system combination evaluation sets are presented. Translation quality is assessed using case insensitive BLEU scores and bootstrapping is used to establish statistical significance of the score differences. All aligners yield significant BLEU score gains over the best individual system included in the combination. Incremental indirect hidden Markov model and a novel incremental inversion transduction grammar with flexible matching consistently yield the best translation quality, though keeping all things equal, the differences between aligners are relatively small.