A graph-based recovery and decomposition of Swanson's hypothesis using semantic predications

Authors:
Delroy Cameron;Olivier Bodenreider;Hima Yalamanchili;Tu Danh;Sreeram Vallabhaneni;Krishnaprasad Thirunarayan;Amit P. Sheth;Thomas C. Rindflesch
Affiliations:
Ohio Center of Excellence in Knowledge-enabled Computing (Kno.e.sis), Wright State University, Dayton, OH 45435, USA;Lister Hill National Center for Biomedical Communications, National Library of Medicine, 8600 Rockville Pike, Bethesda, MD 20894, USA;Biomedical Sciences Division, Wright State University, Dayton, OH 45435, USA;Biomedical Sciences Division, Wright State University, Dayton, OH 45435, USA;Biomedical Sciences Division, Wright State University, Dayton, OH 45435, USA;Ohio Center of Excellence in Knowledge-enabled Computing (Kno.e.sis), Wright State University, Dayton, OH 45435, USA;Ohio Center of Excellence in Knowledge-enabled Computing (Kno.e.sis), Wright State University, Dayton, OH 45435, USA;Lister Hill National Center for Biomedical Communications, National Library of Medicine, 8600 Rockville Pike, Bethesda, MD 20894, USA
Venue:
Journal of Biomedical Informatics
Year:
2013

Citing 19
Cited 0

Toward discovery support systems: a replication, re-examination, and extension of Swanson's work on literature-based discovery of a connection between Raynaud's and fish oil

Journal of the American Society for Information Science
An interactive system for finding complementary literatures: a stimulus to scientific discovery

Artificial Intelligence - Special issue on scientific discovery
Using latent semantic indexing for literature based discovery

Journal of the American Society for Information Science
Literature-based discovery by lexical statistics

Journal of the American Society for Information Science
Using concepts in literature-based discovery: simulating Swanson's Raynaud-fish oil and migraine-magnesium discoveries

Journal of the American Society for Information Science and Technology
The ρ operator: discovering and ranking associations on the semantic web

ACM SIGMOD Record
Ρ-Queries: enabling querying for semantic associations on the semantic web

WWW '03 Proceedings of the 12th international conference on World Wide Web
LitLinker: capturing connections across the biomedical literature

Proceedings of the 2nd international conference on Knowledge capture
Text mining: generating hypotheses from MEDLINE

Journal of the American Society for Information Science and Technology
The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text

Journal of Biomedical Informatics - Special issue: Unified medical language system
SemRank: ranking complex relationship search results on the semantic web

WWW '05 Proceedings of the 14th international conference on World Wide Web
On six degrees of separation in DBLP-DB and more

ACM SIGMOD Record
Mining MEDLINE for implicit links between dietary substances and diseases

Bioinformatics
Using statistical and knowledge-based approaches for literature-based discovery

Journal of Biomedical Informatics
SPARQ2L: towards support for subgraph extraction queries in rdf databases

Proceedings of the 16th international conference on World Wide Web
Semantics-empowered text exploration for knowledge discovery

Proceedings of the 48th Annual Southeast Regional Conference
An up-to-date knowledge-based literature search and exploration framework for focused bioscience domains

Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium
Semantic Predications for Complex Information Needs in Biomedical Literature

BIBM '11 Proceedings of the 2011 IEEE International Conference on Bioinformatics and Biomedicine
Literature-based discovery: Beyond the ABCs

Journal of the American Society for Information Science and Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

Objectives: This paper presents a methodology for recovering and decomposing Swanson's Raynaud Syndrome-Fish Oil hypothesis semi-automatically. The methodology leverages the semantics of assertions extracted from biomedical literature (called semantic predications) along with structured background knowledge and graph-based algorithms to semi-automatically capture the informative associations originally discovered manually by Swanson. Demonstrating that Swanson's manually intensive techniques can be undertaken semi-automatically, paves the way for fully automatic semantics-based hypothesis generation from scientific literature. Methods: Semantic predications obtained from biomedical literature allow the construction of labeled directed graphs which contain various associations among concepts from the literature. By aggregating such associations into informative subgraphs, some of the relevant details originally articulated by Swanson have been uncovered. However, by leveraging background knowledge to bridge important knowledge gaps in the literature, a methodology for semi-automatically capturing the detailed associations originally explicated in natural language by Swanson, has been developed. Results: Our methodology not only recovered the three associations commonly recognized as Swanson's hypothesis, but also decomposed them into an additional 16 detailed associations, formulated as chains of semantic predications. Altogether, 14 out of the 19 associations that can be attributed to Swanson were retrieved using our approach. To the best of our knowledge, such an in-depth recovery and decomposition of Swanson's hypothesis has never been attempted. Conclusion: In this work therefore, we presented a methodology to semi-automatically recover and decompose Swanson's RS-DFO hypothesis using semantic representations and graph algorithms. Our methodology provides new insights into potential prerequisites for semantics-driven Literature-Based Discovery (LBD). Based on our observations, three critical aspects of LBD include: (1) the need for more expressive representations beyond Swanson's ABC model; (2) an ability to accurately extract semantic information from text; and (3) the semantic integration of scientific literature and structured background knowledge.