ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Hi-index | 0.00 |
In the course of our current research on automatic information extraction from medical electronic literature, we have been facing the need to map big corpora onto the concepts of the UMLS Metathesaurus, both in French and in English. In order to meet our specific needs in terms of processing speed, we have developed a lightweight UMLS tagger, MetaCoDe, that processes large text collections at an acceptable speed, but at the cost of the sophistication of the treatments. In this paper, we describe MetaCoDe and evaluate its quality, allowing potential users to balance the gain in speed against the loss in quality.