A new objective function for word alignment

Authors:
Tugba Bodrumlu;Kevin Knight;Sujith Ravi
Affiliations:
University of Southern California;University of Southern California;University of Southern California
Venue:
ILP '09 Proceedings of the Workshop on Integer Linear Programming for Natural Langauge Processing
Year:
2009

Citing 13
Cited 8

The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
Unsupervised learning of the morphology of a natural language

Computational Linguistics
Stochastic inversion transduction grammars and bilingual parsing of parallel corpora

Computational Linguistics
A word-to-word model of translational equivalence

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
HMM-based word alignment in statistical translation

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
The Alignment Template Approach to Statistical Machine Translation

Computational Linguistics
Dependency treelet translation: syntactically informed phrasal SMT

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Semi-supervised training for statistical word alignment

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A discriminative matching approach to word alignment

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Word alignment via quadratic assignment

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Hierarchical Phrase-Based Translation

Computational Linguistics
The complexity of phrase alignment problems

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Tiburon: a weighted tree automata toolkit

CIAA'06 Proceedings of the 11th international conference on Implementation and Application of Automata

Combining MDL transliteration training with discriminative modeling

NEWS '09 Proceedings of the 2009 Named Entities Workshop: Shared Task on Transliteration
Computing optimal alignments for the IBM-3 translation model

CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Fast, greedy model minimization for unsupervised tagging

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Bayesian word alignment for statistical machine translation

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Probabilistic word alignment under the L0-norm

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Crisis MT: developing a cookbook for MT in crisis situations

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Using context and phonetic features in models of etymological sound change

EACL 2012 Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH
Smaller alignment models for better translations: unsupervised word alignment with the l0-norm

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1

Quantified Score

Hi-index	0.00

Visualization

Abstract

We develop a new objective function for word alignment that measures the size of the bilingual dictionary induced by an alignment. A word alignment that results in a small dictionary is preferred over one that results in a large dictionary. In order to search for the alignment that minimizes this objective, we cast the problem as an integer linear program. We then extend our objective function to align corpora at the sub-word level, which we demonstrate on a small Turkish-English corpus.