Guidelines for Electronic Text Encoding and Interchange: Volumes 1 and 2: P4
Guidelines for Electronic Text Encoding and Interchange: Volumes 1 and 2: P4
Computational Linguistics - Special issue on using large corpora: I
Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
Using test suites in evaluation of machine translation systems
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 2
Rapid development of translation tools: application to Persian and Turkish
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Hi-index | 0.00 |
This article is concerned with the building of a test data set for assisting the industrial user in machine translation evaluation. The emphasis is laid on the interest of an approach based on the study of bilingual corpus pragmatic characteristics. The study of one chapter of the maintenance manual of the Super Puma helicopter made it possible to identify the pragmatic characteristics relevant in the choice of the morpho-syntactic structures and translation processes actually used. The textual test set consists in a SGML file including the source text sequences aligned with the reference translation sequences and also including the pragmatic, formal and translational characteristics in the form of annotations (labels and formal descriptions).