A comparative study on reordering constraints in statistical machine translation

Authors:
Richard Zens;Hermann Ney
Affiliations:
RWTH Aachen - University of Technology;RWTH Aachen - University of Technology
Venue:
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Year:
2003

Citing 11
Cited 36

A statistical approach to machine translation

Computational Linguistics
Bootstrap percolation, the Schro¨der numbers, and the N-kings problem

SIAM Journal on Discrete Mathematics
Generating trees and the Catalan and Schro¨der numbers

Discrete Mathematics
The art of computer programming, volume 1 (3rd ed.): fundamental algorithms

The art of computer programming, volume 1 (3rd ed.): fundamental algorithms
Stochastic inversion transduction grammars and bilingual parsing of parallel corpora

Computational Linguistics
Decoding complexity in word-replacement translation models

Computational Linguistics
A polynomial-time algorithm for statistical machine translation

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Discriminative training and maximum entropy models for statistical machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Improved statistical alignment models

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Generation of word graphs in statistical machine translation

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Stochastic inversion transduction grammars with application to segmentation, bracketing, and alignment of parallel corpora

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

Stochastic lexicalized inversion transduction grammar for alignment

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Distortion models for statistical machine translation

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Left-to-right target generation for hierarchical phrase-based translation

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Empirical lower bounds on the complexity of translational equivalence

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Reordering constraints for phrase-based statistical machine translation

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Syntax-based alignment: supervised or unsupervised?

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Local phrase reordering models for statistical machine translation

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Do we need phrases?: challenging the conventional wisdom in statistical machine translation

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Transfer-based statistical translation of Taiwanese sign language using PCFG

ACM Transactions on Asian Language Information Processing (TALIP)
Inducing word alignments with bilexical synchronous trees

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Statistical machine translation

ACM Computing Surveys (CSUR)
Lexicalized Syntactic Reordering Framework for Word Alignment and Machine Translation

ICCPOL '09 Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy
Efficient search for Inversion Transduction Grammar

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Coarse-to-fine syntactic machine translation using language projections

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A unigram orientation model for statistical machine translation

HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Imposing constraints from the source tree on ITG constraints for SMT

SSST '08 Proceedings of the Second Workshop on Syntax and Structure in Statistical Translation
Comparing reordering constraints for SMT using efficient Bleu oracle computation

SSST '07 Proceedings of the NAACL-HLT 2007/AMTA Workshop on Syntax and Structure in Statistical Translation
Empirical lower bounds on alignment error rates in syntax-based machine translation

SSST '09 Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation
A quantitative analysis of reordering phenomena

StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
A POS-based model for long-range reorderings in SMT

StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Novel reordering approaches in phrase-based statistical machine translation

ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
A recursive statistical translation model

ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
Better word alignments with supervised ITG models

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Empirical lower bounds on translation unit error rate for the full class of inversion transduction grammars

IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
Learning linear ordering problems for better translation

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Bilingual segmentation for alignment and translation

CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Improved models of distortion cost for statistical machine translation

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
An efficient shift-reduce decoding algorithm for phrased-based machine translation

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Linguistically annotated reordering: Evaluation and analysis

Computational Linguistics
Reordering with source language collocations

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
A $${\mathcal{O}(|G|n^6)}$$ time extension of inversion transduction grammars

Machine Translation
Inversion transduction grammar constraints for mining parallel sentences from quasi-comparable corpora

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Cross-lingual language modeling with syntactic reordering for low-resource speech recognition

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
On hierarchical re-ordering and permutation parsing for phrase-based decoding

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Statistical translation after source reordering: Oracles, context-aware models, and empirical analysis

Natural Language Engineering
Oracle decoding as a new way to analyze phrase-based machine translation

Machine Translation

Quantified Score

Hi-index	0.00

Visualization

Abstract

In statistical machine translation, the generation of a translation hypothesis is computationally expensive. If arbitrary word-reorderings are permitted, the search problem is NP-hard. On the other hand, if we restrict the possible word-reorderings in an appropriate way, we obtain a polynomial-time search algorithm.In this paper, we compare two different reordering constraints, namely the ITG constraints and the IBM constraints. This comparison includes a theoretical discussion on the permitted number of reorderings for each of these constraints. We show a connection between the ITG constraints and the since 1870 known Schröder numbers.We evaluate these constraints on two tasks: the Verbmobil task and the Canadian Hansards task. The evaluation consists of two parts: First, we check how many of the Viterbi alignments of the training corpus satisfy each of these constraints. Second, we restrict the search to each of these constraints and compare the resulting translation hypotheses.The experiments will show that the baseline ITG constraints are not sufficient on the Canadian Hansards task. Therefore, we present an extension to the ITG constraints. These extended ITG constraints increase the alignment coverage from about 87% to 96%.