Visualization of a document collection: the vibe system
Information Processing and Management: an International Journal
Web document clustering: a feasibility demonstration
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Grouper: a dynamic clustering interface to Web search results
WWW '99 Proceedings of the eighth international conference on World Wide Web
A vector space model for automatic indexing
Communications of the ACM
A reference ontology for biomedical informatics: the foundational model of anatomy
Journal of Biomedical Informatics - Special issue: Unified medical language system
Using reasoning to guide annotation with gene ontology terms in GOAT
ACM SIGMOD Record
A Concept-Driven Algorithm for Clustering Search Results
IEEE Intelligent Systems
An application of text categorization methods to gene ontology annotation
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Visual analytics: Storylines: Visual exploration and analysis in latent semantic spaces
Computers and Graphics
Promoting Insight-Based Evaluation of Visualizations: From Contest to Benchmark Repository
IEEE Transactions on Visualization and Computer Graphics
IVEA: an information visualization tool for personalized exploratory document collection analysis
ESWC'08 Proceedings of the 5th European semantic web conference on The semantic web: research and applications
A Relation Mining and Visualization Framework for Automated Text Summarization
PReMI '09 Proceedings of the 3rd International Conference on Pattern Recognition and Machine Intelligence
Journal of Biomedical Informatics
Hi-index | 0.00 |
As biomedical science progresses, ontologies play an increasingly important role in easing the understanding of biomedical information. Although much research, such as Gene Ontology annotation, has been proposed to utilize ontologies to help users understand biomedical information easily, most of the research does not focus on capturing gene-related terms and their relationships within biomedical document collections. Understanding key gene-related terms as well as their semantic relationships is essential for comprehending the conceptual structure of biomedical document collections and avoiding information overload for users. To address this issue, we propose a novel approach called `GOClonto' to automatically generate ontologies for conceptualization of biomedical document collections. Based on GO (Gene Ontology), GOClonto extracts gene-related terms from biomedical text, applies latent semantic analysis to identify key gene-related terms, allocates documents based on the key gene-related terms, and utilizes GO to automatically generate a corpus-related gene ontology. The experimental results show that GOClonto is able to identify key gene-related terms. For a test biomedical document collection, GOClonto shows better performance than other clustering algorithms in terms of F-measure. Moreover, the ontology generated by GOClonto shows a significant informative conceptual structure.