Toward a methodology for the evaluation of machine-assisted translation systems
Computers and Translation
Lexical transfer: between a source rock and a hard target
COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 2
Evaluation metrics for knowledge-based machine translation
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
The KANT system: fast, accurate, high-quality translation in practical domains
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
Using test suites in evaluation of machine translation systems
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 2
BLEU: a method for automatic evaluation of machine translation
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Hi-index | 0.00 |
This paper presents a methodology for evaluating Arabic Machine Translation (MT) systems. We are specifically interested in evaluating lexical coverage, grammatical coverage, semantic correctness and pronoun resolution correctness. The methodology presented is statistical and is based on earlier work on evaluating MT lexicons in which the idea of the importance of a specific word sense to a given application domain and how its presence or absence in the lexicon affects the MT system's lexical quality, which in turn will affect the overall system output quality. The same idea is used in this paper and generalized so as to apply to grammatical coverage, semantic correctness and correctness of pronoun resolution. The approach adopted in this paper has been implemented and applied to evaluating four English-Arabic commercial MT systems. The results of the evaluation of these systems are presented for the domain of the Internet and Arabization.