The CMU-EBMT machine translation system

  • Authors:
  • Ralf D. Brown

  • Affiliations:
  • Carnegie Mellon University Language Technologies Institute, Pittsburgh, USA 15213

  • Venue:
  • Machine Translation
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents an in-depth description of the features of the open-source CMU-EBMT example-based machine translation system. CMU-EBMT is a complete end-to-end system including lexicon induction, word and phrase alignment, corpus indexing and lookup, language model, decoder, and parameter tuning components. While it does not require them, it can take advantage of external alignment information and other annotations provided by GIZA++ and other systems. To illustrate a recent addition to CMU-EBMT, experiments are presented which show an improvement of 0.16 BLEU points (0.9% relative) on a cross-validated small-data English---Haitian translation task when using a new set of fine-grained log-linear feature values representing language model match lengths in addition to language model probabilities.