Journal of Chemical Information & Computer Sciences
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Extraction and search of chemical formulae in text documents on the web
Proceedings of the 16th international conference on World Wide Web
An Efficient Statistical Approach for Automatic Organic Chemistry Summarization
GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Hi-index | 0.00 |
This paper investigates the problem of automatic chemical Term Recognition (TR) and proposes to tackle the problem by fusing Symbolic and statistical techniques. Unlike other solutions described in the literature, which only use complex and costly human made ruledbased matching algorithms, we show that the combination of a seven rules matching algorithm and a naïve Bayes classifier achieves high performances. Through experiments performed on different kind of available Organic Chemistry texts, we show that our hybrid approach is also consistent across different data sets.