AMBER: a modified BLEU, enhanced ranking metric

  • Authors:
  • Boxing Chen;Roland Kuhn

  • Affiliations:
  • National Research Council of Canada, Gatineau, Québec, Canada;National Research Council of Canada, Gatineau, Québec, Canada

  • Venue:
  • WMT '11 Proceedings of the Sixth Workshop on Statistical Machine Translation
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper proposes a new automatic machine translation evaluation metric: AMBER, which is based on the metric BLEU but incorporates recall, extra penalties, and some text processing variants. There is very little linguistic information in AMBER. We evaluate its system-level correlation and sentence-level consistency scores with human rankings from the WMT shared evaluation task; AMBER achieves state-of-the-art performance.