Automatic translation error analysis

  • Authors:
  • Mark Fishel;Ondřej Bojar;Daniel Zeman;Jan Berka

  • Affiliations:
  • Department of Computer Science, University of Tartu, Estonia;Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czechia;Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czechia;Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czechia

  • Venue:
  • TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a method of automatic identification of various error types in machine translation output. The approach is mostly based on monolingual word alignment of the hypothesis and the reference translation. In addition to common lexical errors misplaced words are also detected. A comparison to manually classified MT errors is presented. Our error classification is inspired by that of Vilar (2006; [17]), although distinguishing some of their categories is beyond the reach of the current version of our system.