Findings of the 2011 Workshop on Statistical Machine Translation

Authors:
Chris Callison-Burch;Philipp Koehn;Christof Monz;Omar F. Zaidan
Affiliations:
Johns Hopkins University;University of Edinburgh;University of Amsterdam;Johns Hopkins University
Venue:
WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Year:
2011

Citing 62
Cited 33

The surprise language exercises

ACM Transactions on Asian Language Information Processing (TALIP)
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Moses: open source toolkit for statistical machine translation

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
The ups and downs of preposition error detection in ESL writing

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
(Meta-) evaluation of machine translation

StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
Further meta-evaluation of machine translation

StatMT '08 Proceedings of the Third Workshop on Statistical Machine Translation
Findings of the 2009 workshop on statistical machine translation

StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Deep linguistic multilingual translation and bilingual dictionaries

StatMT '09 Proceedings of the Fourth Workshop on Statistical Machine Translation
Manual and automatic evaluation of machine translation between European languages

StatMT '06 Proceedings of the Workshop on Statistical Machine Translation
Edit distances with block movements and error rate confidence estimates

Machine Translation
Machine translation evaluation versus quality estimation

Machine Translation
Findings of the 2010 Joint Workshop on Statistical Machine Translation and Metrics for Machine Translation

WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Joshua 2.0: a toolkit for parsing-based machine translation with syntax, semirings, discriminative training and other goodies

WMT '10 Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR
Evaluate with confidence estimation: machine ranking of translation outputs using grammatical features

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
AMBER: a modified BLEU, enhanced ranking metric

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
TESLA at WMT 2011: translation evaluation and tunable metric

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Meteor 1.3: automatic metric for reliable optimization and evaluation of machine translation systems

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Approximating a deep-syntactic metric for MT evaluation and tuning

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Evaluation without references: IBM1 scores as evaluation metrics

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Morphemes and POS tags for n-gram based evaluation metrics

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
e-rating machine translation

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
TINE: a metric to assess MT adequacy

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Regression and ranking based optimisation for sentence level machine translation evaluation

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
MANY improvements for WMT'11

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The UPV-PRHLT combination system for WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
CMU system combination in WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The RWTH system combination system for WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Expected BLEU training for graphs: BBN system description for WMT11 system combination task

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The UZH system combination system for WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Description of the JHU system combination scheme for WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Personal translator at WMT2011: a rule-based MTsystem with hybrid components

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
LIMSI @ WMT11

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Shallow semantic trees for SMT

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
RegMT system for machine translation, system combination, and evaluation

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Improving translation model by monolingual data

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The CMU-ARK German-English translation system

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Noisy SMS machine translation in low-density languages

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Stochastic parse tree selection for an existing RBMT system

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Joint WMT submission of the QUAERO project

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
CMU syntax-based machine translation at WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The Uppsala-FBK systems at WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The Karlsruhe Institute of Technology translation systems for the WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
CMU Haitian Creole-English translation system for WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Experiments with word alignment, normalization and clause reordering for SMT between English and German

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The value of monolingual crowdsourcing in a real-world translation scenario: simulation using Haitian Creole emergency SMS messages

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The RWTH Aachen machine translation system for WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
ILLC-UvA translation system for EMNLP-WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
UPM system for the translation task

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Two-step translation with grammatical post-processing

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Influence of parser choice on dependency-based MT

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The LIGA (LIG/LIA) machine translation system for WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Factored translation with unsupervised word clusters

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The BM-I2R Haitian-Créole-to-English translation system description for the WMT 2011 evaluation campaign

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
The Universitat d'Alacant hybrid machine translation system for WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
LIUM's SMT machine translation systems for WMT 2011

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Spell checking techniques for replacement of unknown words and data cleaning for Haitian Creole SMS translation

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Joshua 3.0: syntax-based machine translation with the Thrax grammar extractor

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
DFKI hybrid machine translation system for WMT 2011: on the integration of SMT and RBMT

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
CEU-UPV English-Spanish system for WMT11

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Hierarchical phrase-based MT at the Charles University for the WMT 2011 shared task

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Crisis MT: developing a cookbook for MT in crisis situations

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Better evaluation metrics lead to better machine translation

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing

The CMU-ARK German-English translation system

WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
Improving pronoun translation for statistical machine translation

EACL '12 Proceedings of the Student Research Workshop at the 13th Conference of the European Chapter of the Association for Computational Linguistics
HyTER: meaning-equivalent semantics for translation evaluation

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
SemEval-2012 task 1: English Lexical Simplification

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Linguistically-augmented Bulgarian-to-English statistical machine translation model

EACL 2012 Proceedings of the Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra)
Clustered word classes for preordering in statistical machine translation

ROBUS-UNSUP '12 Proceedings of the Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP
Learning to translate with multiple objectives

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Prediction of learning curves in machine translation

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Character-level machine translation evaluation for languages with ambiguous word boundaries

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
PORT: a precision-order-recall MT evaluation metric for tuning

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Probabilistic finite state machines for regression-based MT evaluation

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Language model rest costs and space-efficient storage

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Document-wide decoding for phrase-based statistical machine translation

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Accurate unsupervised joint named-entity extraction from unaligned parallel text

NEWS '12 Proceedings of the 4th Named Entity Workshop
Linguistically-enriched models for Bulgarian-to-English machine translation

SSST-6 '12 Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation
Using parallel features in parsing of machine-translated sentences for correction of grammatical errors

SSST-6 '12 Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation
Putting human assessments of machine translation systems in order

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Findings of the 2012 workshop on statistical machine translation

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Semantic textual similarity for MT evaluation

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Class error rates for evaluation of machine translation output

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
SPEDE: probabilistic edit distance metrics for MT evaluation

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Morpheme- and POS-based IBM1 scores and language model scores for translation quality estimation

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Review of hypothesis alignment algorithms for MT system combination via confusion network decoding

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Towards effective use of training data in statistical machine translation

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
DEPFIX: a system for automatic correction of Czech MT outputs

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Analysing the effect of out-of-domain data on SMT systems

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Automatic normalization of short texts by combining statistical and rule-based techniques

Language Resources and Evaluation
No free lunch in factored phrase-based machine translation

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Crowdsourcing and the crisis-affected community

Information Retrieval
Automatically assessing machine summary content without a gold standard

Computational Linguistics
Identifying multilingual Wikipedia articles based on cross language similarity and activity

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Lattice BLEU oracles in machine translation

ACM Transactions on Speech and Language Processing (TSLP)
Sentence-level ranking with quality estimation

Machine Translation

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents the results of the WMT11 shared tasks, which included a translation task, a system combination task, and a task for machine translation evaluation metrics. We conducted a large-scale manual evaluation of 148 machine translation systems and 41 system combination entries. We used the ranking of these systems to measure how strongly automatic metrics correlate with human judgments of translation quality for 21 evaluation metrics. This year featured a Haitian Creole to English task translating SMS messages sent to an emergency response service in the aftermath of the Haitian earthquake. We also conducted a pilot 'tunable metrics' task to test whether optimizing a fixed system to different metrics would result in perceptibly different translation quality.