Text representation for intelligent text retrieval: a classification-oriented view
Text-based intelligent systems
Learned vector-space models for document retrieval
TREC-2 Proceedings of the second conference on Text retrieval conference
Computational Methods for Intelligent Information Access
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Graph Drawing: Algorithms for the Visualization of Graphs
Graph Drawing: Algorithms for the Visualization of Graphs
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
The Frame-Based Module of the SUISEKI Information Extraction System
IEEE Intelligent Systems
Extracting molecular binding relationships from biomedical text
ANLC '00 Proceedings of the sixth conference on Applied natural language processing
Protein association discovery in biomedical literature
Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries
SPIRO-V: a collaborative approach to controlled vocabularies gathering and management
Proceedings of the 10th annual joint conference on Digital libraries
Hi-index | 0.00 |
There is a large and growing body of web accessible biomedical literature. As this body of electronic literature grows, so does the possibility that document analysis techniques can be used to automatically extract useful biomedical information from them, particularly in the discovery of key concepts dealing with genes, proteins, drugs, and diseases and associations among these concepts. VCGS (Vocabulary Cluster Generating System) was designed to automatically extract and determine associations among tokens from a subset of biomedical literature namely cancer. Such information has notable potential to automate database construction in biomedicine, instead of relying on experts' analysis. This paper reports on the mechanisms for automatically generating clusters of tokens. A formal evaluation of the system, based on a subset of 5338 Pubmed titles and abstracts, has been conducted against the Swiss-Prot database in which the associations among concepts are entered by experts by hand.