A Methodology for Evaluating Arabic Machine Translation Systems

Authors:
Ahmed Guessoum;Rached Zantout
Affiliations:
Department of Computer Science, The University of Sharjah, Sharjah, UAE;Faculty of Engineering, Department of Computer and Communications Engineering, Hariri Canadian University, Mechref, Lebanon 2010
Venue:
Machine Translation
Year:
2004

Citing 7
Cited 0

Toward a methodology for the evaluation of machine-assisted translation systems

Computers and Translation
A Methodology for a Semi-Automatic Evaluation of the Lexicons of Machine Translation Systems

Machine Translation
Lexical transfer: between a source rock and a hard target

COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 2
Evaluation metrics for knowledge-based machine translation

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
The KANT system: fast, accurate, high-quality translation in practical domains

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
Using test suites in evaluation of machine translation systems

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 2
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a methodology for evaluating Arabic Machine Translation (MT) systems. We are specifically interested in evaluating lexical coverage, grammatical coverage, semantic correctness and pronoun resolution correctness. The methodology presented is statistical and is based on earlier work on evaluating MT lexicons in which the idea of the importance of a specific word sense to a given application domain and how its presence or absence in the lexicon affects the MT system's lexical quality, which in turn will affect the overall system output quality. The same idea is used in this paper and generalized so as to apply to grammatical coverage, semantic correctness and correctness of pronoun resolution. The approach adopted in this paper has been implemented and applied to evaluating four English-Arabic commercial MT systems. The results of the evaluation of these systems are presented for the domain of the Internet and Arabization.