A unigram orientation model for statistical machine translation

Authors:
Christoph Tillmann
Affiliations:
IBM T.J. Watson Research Center, Yorktown Heights, NY
Venue:
HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Year:
2004

Citing 6
Cited 66

The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
A polynomial-time algorithm for statistical machine translation

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Statistical phrase-based translation

NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
A phrase-based unigram model for statistical machine translation

NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
A comparative study on reordering constraints in statistical machine translation

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Effective phrase translation extraction from alignment models

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1

A localized prediction model for statistical machine translation

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Distortion models for statistical machine translation

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Left-to-right target generation for hierarchical phrase-based translation

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Segment choice models: feature-rich models for global distortion in statistical machine translation

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
A block bigram prediction model for statistical machine translation

ACM Transactions on Speech and Language Processing (TSLP)
Hierarchical Phrase-Based Translation

Computational Linguistics
Improving statistical MT by coupling reordering and decoding

Machine Translation
Introducing a Translation Dictionary into Phrase-Based SMT

IEICE - Transactions on Information and Systems
Large-Scale Statistical Machine Translation with Weighted Finite State Transducers

Proceedings of the 2009 conference on Finite-State Methods and Natural Language Processing: Post-proceedings of the 7th International Workshop FSMNLP 2008
Tera-scale translation models via pattern matching

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Improving mid-range reordering using templates of factors

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
SPMT: statistical machine translation with syntactified target language phrases

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
A simple and effective hierarchical phrase reordering model

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Kernel regression based machine translation

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
Imposing constraints from the source tree on ITG constraints for SMT

SSST '08 Proceedings of the Second Workshop on Syntax and Structure in Statistical Translation
Discriminative reordering with Chinese grammatical relations features

SSST '09 Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation
Reordering model using syntactic information of a source tree for statistical machine translation

SSST '09 Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation
Coupling hierarchical word reordering and decoding in phrase-based statistical machine translation

SSST '09 Proceedings of the Third Workshop on Syntax and Structure in Statistical Translation
CCG supertags in factored statistical machine translation

StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
Regularization and search for minimum error rate training

StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
Shared task: statistical machine translation between European languages

ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
Tree kernel-based SVM with structured syntactic knowledge for BTG-based phrase reordering

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Learning linear ordering problems for better translation

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
A direct syntax-driven reordering model for phrase-based machine translation

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Improved models of distortion cost for statistical machine translation

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Accurate non-hierarchical phrase-based translation

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Learning lexicalized reordering models from reordering graphs

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
SMT of Latvian, Lithuanian and Estonian Languages: a Comparative Study

Proceedings of the 2010 conference on Human Language Technologies -- The Baltic Perspective: Proceedings of the Fourth International Conference Baltic HLT 2010
LIMSI's statistical translation systems for WMT'10

WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Using collocation segmentation to augment the phrase table

WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Divide and translate: improving long distance reordering in statistical machine translation

WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Local lexical adaptation in machine translation through triangulation: SMT helping SMT

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Hierarchical phrase-based machine translation with word-based reordering model

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
A novel reordering model based on multi-layer phrase for statistical machine translation

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Syntax based reordering with automatically derived rules for improved statistical machine translation

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Improving reordering with linguistically informed bilingual n-grams

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Linguistically annotated reordering: Evaluation and analysis

Computational Linguistics
Exploitation of Machine Learning Techniques in Modelling Phrase Movements for Machine Translation

The Journal of Machine Learning Research
Syntax-based reordering for statistical machine translation

Computer Speech and Language
Reordering with source language collocations

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Improving reordering for statistical machine translation with smoothed priors and syntactic features

SSST-5 Proceedings of the Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation
LIMSI @ WMT11

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Joint WMT submission of the QUAERO project

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
ILLC-UvA translation system for EMNLP-WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
From n-gram-based to CRF-based translation models

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
A word reordering model for improved machine translation

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Soft dependency constraints for reordering in hierarchical phrase-based translation

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Statistical machine translation with local language models

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Chunk-lattices for verb reordering in Arabic---English statistical machine translation

Machine Translation
Toward statistical machine translation without parallel corpora

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Cutting the long tail: hybrid language models for translation style adaptation

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Continuous space translation models with neural networks

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
A class-based agreement model for generating accurately inflected translations

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Modified distortion matrices for phrase-based statistical machine translation

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Head finalization reordering for Chinese-to-Japanese machine translation

SSST-6 '12 Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation
On hierarchical re-ordering and permutation parsing for phrase-based decoding

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
CCG syntactic reordering models for phrase-based machine translation

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Joint WMT 2012 submission of the QUAERO project

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
LIMSI @ WMT'12

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Statistical translation after source reordering: Oracles, context-aware models, and empirical analysis

Natural Language Engineering
A model based transformation paradigm for cross-language collaborations

Advanced Engineering Informatics
Evaluating indirect strategies for Chinese-Spanish statistical machine translation

Journal of Artificial Intelligence Research
Automatic normalization of short texts by combining statistical and rule-based techniques

Language Resources and Evaluation
Syntax-Based Post-Ordering for Efficient Japanese-to-English Translation

ACM Transactions on Asian Language Information Processing (TALIP)
Statistical machine translation enhancements through linguistic levels: A survey

ACM Computing Surveys (CSUR)
Distortion Model Based on Word Sequence Labeling for Statistical Machine Translation

ACM Transactions on Asian Language Information Processing (TALIP)

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a unigram segmentation model for statistical machine translation where the segmentation units are blocks: pairs of phrases without internal structure. The segmentation model uses a novel orientation component to handle swapping of neighbor blocks. During training, we collect block unigram counts with orientation: we count how often a block occurs to the left or to the right of some predecessor block. The orientation model is shown to improve translation performance over two models: 1) no block re-ordering is used, and 2) the block swapping is controlled only by a language model. We show experimental results on a standard Arabic-English translation task.