SIGDOC '86 Proceedings of the 5th annual international conference on Systems documentation
Information extraction from biomedical literature: methodology, evaluation and an application
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Extracting the names of genes and gene products with a hidden Markov model
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Literature Extraction of Protein Functions Using Sentence Pattern Mining
IEEE Transactions on Knowledge and Data Engineering
GAPSCORE: finding gene and protein names one word at a time
Bioinformatics
Converting Semi-structured Clinical Medical Records into Information and Knowledge
ICDEW '05 Proceedings of the 21st International Conference on Data Engineering Workshops
Using concept-based indexing to improve language modeling approach to genomic IR
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
An efficient filter for approximate membership checking
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Probabilistic models for topic learning from images and captions in online biomedical literatures
Proceedings of the 18th ACM conference on Information and knowledge management
Annotating and recognising named entities in clinical notes
ACLstudent '09 Proceedings of the ACL-IJCNLP 2009 Student Research Workshop
Cascading classifiers for named entity recognition in clinical notes
WBIE '09 Proceedings of the Workshop on Biomedical Information Extraction
Compositional information extraction methodology from medical reports
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications: Part II
Voting techniques for a multi-terminology based biomedical information retrieval
AIME'11 Proceedings of the 13th conference on Artificial intelligence in medicine
Web Semantics: Science, Services and Agents on the World Wide Web
Artificial Intelligence in Medicine
Enhancing biomedical concept extraction using semantic relationship weights
International Journal of Data Mining and Bioinformatics
Hi-index | 0.00 |
Dictionary-based biological concept extraction is still the state-ofthe-art approach to large-scale biomedical literature annotation and indexing. The exact dictionary lookup is a very simple approach, but always achieves low extraction recall because a biological term often has many variants while a dictionary is impossible to collect all of them. We propose a generic extraction approach, referred to as approximate dictionary lookup, to cope with term variations and implement it as an extraction system called MaxMatcher. The basic idea of this approach is to capture the significant words instead of all words to a particular concept. The new approach dramatically improves the extraction recall while maintaining the precision. In a comparative study on GENIA corpus, the recall of the new approach reaches a 57% recall while the exact dictionary lookup only achieves a 26% recall.