Using N-gram based features for machine translation system combination

Authors:
Yong Zhao;Xiaodong He
Affiliations:
Georgia Institute of Technology, Atlanta, GA;Microsoft Research, Redmond, WA
Venue:
NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Year:
2009

Citing 3
Cited 2

BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Indirect-HMM-based hypothesis alignment for combining outputs from machine translation systems

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
System Combination for Machine Translation of Spoken and Written Language

IEEE Transactions on Audio, Speech, and Language Processing

Joint optimization for machine translation system combination

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Model combination for machine translation

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Conventional confusion network based system combination for machine translation (MT) heavily relies on features that are based on the measure of agreement of words in different translation hypotheses. This paper presents two new features that consider agreement of n-grams in different hypotheses to improve the performance of system combination. The first one is based on a sentence specific online n-gram language model, and the second one is based on n-gram voting. Experiments on a large scale Chinese-to-English MT task show that both features yield significant improvements on the translation performance, and a combination of them produces even better translation results.