Symmetric word alignments for statistical machine translation

Authors:
Evgeny Matusov;Richard Zens;Hermann Ney
Affiliations:
RWTH Aachen University, Aachen, Germany;RWTH Aachen University, Aachen, Germany;RWTH Aachen University, Aachen, Germany
Venue:
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Year:
2004

Citing 9
Cited 22

An efficient algorithm for minimum-weight bibranching

Journal of Combinatorial Theory Series B
A systematic comparison of various statistical alignment models

Computational Linguistics
Models of translational equivalence among words

Computational Linguistics
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
Stochastic inversion transduction grammars and bilingual parsing of parallel corpora

Computational Linguistics
HMM-based word alignment in statistical translation

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Loosely tree-based alignment for machine translation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
A probability model to improve word alignment

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Improved statistical alignment models

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics

A discriminative matching approach to word alignment

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Alignment by agreement

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Structured Prediction, Dual Extragradient and Bregman Projections

The Journal of Machine Learning Research
Statistical machine translation

ACM Computing Surveys (CSUR)
Lattice Minimum Bayes-Risk decoding for statistical machine translation

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
The RWTH system combination system for WMT 2009

StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Novel reordering approaches in phrase-based statistical machine translation

ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
Weighted alignment matrices for statistical machine translation

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Cross-lingual annotation projection of semantic roles

Journal of Artificial Intelligence Research
Posterior Regularization for Structured Latent Variable Models

The Journal of Machine Learning Research
The RWTH system combination system for WMT 2010

WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Computing optimal alignments for the IBM-3 translation model

CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Enhancing morphological alignment for translating highly inflected languages

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
A power mean based algorithm for combining multiple alignment tables

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Word alignment via submodular maximization over matroids

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Probabilistic word alignment under the L0-norm

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Czech-English phrase-based machine translation

FinTAL'06 Proceedings of the 5th international conference on Advances in Natural Language Processing
Semi-supervised semantic role labeling via structural alignment

Computational Linguistics
The RWTH system combination system for WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Improving statistical word alignment with ensemble methods

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
SyMGiza++: symmetrized word alignment models for statistical machine translation

SIIS'11 Proceedings of the 2011 international conference on Security and Intelligent Information Systems
Review of hypothesis alignment algorithms for MT system combination via confusion network decoding

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we address the word alignment problem for statistical machine translation. We aim at creating a symmetric word alignment allowing for reliable one-to-many and many-to-one word relationships. We perform the iterative alignment training in the source-to-target and the target-to-source direction with the well-known IBM and HMM alignment models. Using these models, we robustly estimate the local costs of aligning a source word and a target word in each sentence pair. Then, we use efficient graph algorithms to determine the symmetric alignment with minimal total costs (i. e. maximal alignment probability). We evaluate the automatic alignments created in this way on the German--English Verbmobil task and the French--English Canadian Hansards task. We show statistically significant improvements of the alignment quality compared to the best results reported so far. On the Verbmobil task, we achieve an improvement of more than 1% absolute over the baseline error rate of 4.7%.