Dynamic model interpolation for statistical machine translation

Authors:
Andrew Finch;Eiichiro Sumita
Affiliations:
National Institute for Science and Technology-Advanced Telecommunications Research Laboratories, Kyoto, Japan;National Institute for Science and Technology-Advanced Telecommunications Research Laboratories, Kyoto, Japan
Venue:
StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
Year:
2008

Citing 8
Cited 8

A systematic comparison of various statistical alignment models

Computational Linguistics
Improving language models by clustering training sentences

ANLC '94 Proceedings of the fourth conference on Applied natural language processing
Statistical phrase-based translation

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Minimum error rate training in statistical machine translation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Automatic evaluation of machine translation quality using n-gram co-occurrence statistics

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Moses: open source toolkit for statistical machine translation

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
Mixture-model adaptation for SMT

StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
Domain adaptation in statistical machine translation with mixture modelling

StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation

NICT@WMT09: model adaptation and transliteration for Spanish-English SMT

StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Domain adaptation for statistical machine translation with monolingual resources

StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Filtering syntactic constraints for statistical machine translation

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Context adaptation in statistical machine translation using models with exponentially decaying cache

DANLP 2010 Proceedings of the 2010 Workshop on Domain Adaptation for Natural Language Processing
Discriminative instance weighting for domain adaptation in statistical machine translation

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Phrasal syntactic category sequence model for phrase-based MT

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part II
Distributed speech translation technologies for multiparty multilingual communication

ACM Transactions on Speech and Language Processing (TSLP)
Perplexity minimization for translation model domain adaptation in statistical machine translation

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a technique for class-dependent decoding for statistical machine translation (SMT). The approach differs from previous methods of class-dependent translation in that the class-dependent forms of all models are integrated directly into the decoding process. We employ probabilistic mixture weights between models that can change dynamically on a segment-by-segment basis depending on the characteristics of the source segment. The effectiveness of this approach is demonstrated by evaluating its performance on travel conversation data. We used the approach to tackle the translation of questions and declarative sentences using class-dependent models. To achieve this, our system integrated two sets of models specifically built to deal with sentences that fall into one of two classes of dialog sentence: questions and declarations, with a third set of models built to handle the general class. The technique was thoroughly evaluated on data from 17 language pairs using 6 machine translation evaluation metrics. We found the results were corpus-dependent, but in most cases our system was able to improve translation performance, and for some languages the improvements were substantial.