Overcoming the customization bottleneck using example-based MT

  • Authors:
  • Stephen D. Richardson;William B. Dolan;Arul Menezes;Monica Corston-Oliver

  • Affiliations:
  • Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA;Butler Hill Group, Seattle WA

  • Venue:
  • DMMT '01 Proceedings of the workshop on Data-driven methods in machine translation - Volume 14
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe MSR-MT, a large-scale hybrid machine translation system under development for several language pairs. This system's ability to acquire its primary translation knowledge automatically by parsing a bilingual corpus of hundreds of thousands of sentence pairs and aligning resulting logical forms demonstrates true promise for overcoming the so-called MT customization bottleneck. Trained on English and Spanish technical prose, a blind evaluation shows that MSR-MT's integration of rule-based parsers, example based processing, and statistical techniques produces translations whose quality exceeds that of uncustomized commercial MT systems in this domain.