Dependency treelet translation: the convergence of statistical and example-based machine-translation?

  • Authors:
  • Christopher Quirk;Arul Menezes

  • Affiliations:
  • Microsoft Research, One Microsoft Way, Redmond, USA 98052;Microsoft Research, One Microsoft Way, Redmond, USA 98052

  • Venue:
  • Machine Translation
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe a novel approach to MT that combines the strengths of the two leading corpus-based approaches: Phrasal SMT and EBMT. We use a syntactically informed decoder and reordering model based on the source dependency tree, in combination with conventional SMT models to incorporate the power of phrasal SMT with the linguistic generality available in a parser. We show that this approach significantly outperforms a leading string-based Phrasal SMT decoder and an EBMT system. We present results from two radically different language pairs, and investigate the sensitivity of this approach to parse quality by using two distinct parsers and oracle experiments. We also validate our automated bleu scores with a small human evaluation.