A symbolic and surgical acquisition of terms through variation
Connectionist, Statistical, and Symbolic Approaches to Learning for Natural Language Processing
A comparison of alignment models for statistical machine translation
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
HLT '93 Proceedings of the workshop on Human Language Technology
Improving a general-purpose Statistical Translation Engine by terminological lexicons
COMPUTERM '02 COLING-02 on COMPUTERM 2002: second international workshop on computational terminology - Volume 14
Bilingual lexicon extraction from comparable corpora enhanced with parallel corpora
BUCC '11 Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web
French-english terminology extraction from comparable corpora
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Hi-index | 0.00 |
In a recent study Langlais (Langlais, 2002) has shown that the output of a Statistical Machine Translation (SMT) system deteriorates significantly the more the new text differs from the text the system has been trained on. Langlais shows that bilingual terminological databases are resources that can be taken into account to boost the performance of the statistical engine. This paper extends the notion of 'terminological databases' to an Intelligent Terminological Database (ITDB) capable to detect and reduce terms and their variants and to re-generate the authorized target language terms. The paper discusses the aims and the architecture of the ITDB and evaluates its integration with a SMT system.