Hierarchical phrase-based translation grammars extracted from alignment posterior probabilities

Authors:
Adrià de Gispert;Juan Pino;William Byrne
Affiliations:
University of Cambridge, U.K.;University of Cambridge, U.K.;University of Cambridge, U.K.
Venue:
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Year:
2010

Citing 21
Cited 4

A systematic comparison of various statistical alignment models

Computational Linguistics
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
Statistical phrase-based translation

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Minimum error rate training in statistical machine translation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Effective phrase translation extraction from alignment models

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
A phrase-based, joint probability model for statistical machine translation

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Bitext alignment for statistical machine translation

Bitext alignment for statistical machine translation
Hierarchical Phrase-Based Translation

Computational Linguistics
The complexity of phrase alignment problems

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Tera-scale translation models via pattern matching

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
A systematic comparison of phrase-based, hierarchical and syntax-augmented statistical MT

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Rule filtering by pattern for efficient hierarchical translation

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Lattice Minimum Bayes-Risk decoding for statistical machine translation

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Improving phrase-based translation via word alignments from stochastic inversion transduction grammars

SSST '09 Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation
A Gibbs sampler for phrasal synchronous grammar induction

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Weighted alignment matrices for statistical machine translation

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Statistical Machine Translation

Statistical Machine Translation
Unsupervised syntactic alignment with inversion transduction grammars

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Efficient path counting transducers for minimum bayes-risk decoding of statistical machine translation lattices

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Hierarchical phrase-based translation with weighted finite-state transducers and shallow-n grammars

Computational Linguistics
HMM Word and Phrase Alignment for Statistical Machine Translation

IEEE Transactions on Audio, Speech, and Language Processing

Bayesian extraction of minimal SCFG rules for hierarchical phrase-based translation

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Leave-one-out phrase model training for large-scale deployment

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Unsupervised sub-tree alignment for tree-to-tree translation

Journal of Artificial Intelligence Research
Maximum-entropy word alignment and posterior-based phrase extraction for machine translation

Machine Translation

Quantified Score

Hi-index	0.00

Visualization

Abstract

We report on investigations into hierarchical phrase-based translation grammars based on rules extracted from posterior distributions over alignments of the parallel text. Rather than restrict rule extraction to a single alignment, such as Viterbi, we instead extract rules based on posterior distributions provided by the HMM word-to-word alignment model. We define translation grammars progressively by adding classes of rules to a basic phrase-based system. We assess these grammars in terms of their expressive power, measured by their ability to align the parallel text from which their rules are extracted, and the quality of the translations they yield. In Chinese-to-English translation, we find that rule extraction from posteriors gives translation improvements. We also find that grammars with rules with only one nonterminal, when extracted from posteriors, can outperform more complex grammars extracted from Viterbi alignments. Finally, we show that the best way to exploit source-to-target and target-to-source alignment models is to build two separate systems and combine their output translation lattices.