Ontology-centric integration and navigation of the dengue literature

  • Authors:
  • Menaka Rajapakse;Rajaraman Kanagasabai;Wee Tiong Ang;Anitha Veeramani;Mark J. Schreiber;Christopher J. O. Baker

  • Affiliations:
  • Institute for Infocomm Research (I2R), Data Mining Department, 21, Heng Mui Keng Terrace, Singapore 119613, Singapore;Institute for Infocomm Research (I2R), Data Mining Department, 21, Heng Mui Keng Terrace, Singapore 119613, Singapore;Institute for Infocomm Research (I2R), Data Mining Department, 21, Heng Mui Keng Terrace, Singapore 119613, Singapore;Institute for Infocomm Research (I2R), Data Mining Department, 21, Heng Mui Keng Terrace, Singapore 119613, Singapore;Novartis Institute for Tropical Diseases Pte Ltd. (NITD), #05-01 Chromos, 10 Biopolis Road, Singapore 138670, Singapore;Institute for Infocomm Research (I2R), Data Mining Department, 21, Heng Mui Keng Terrace, Singapore 119613, Singapore

  • Venue:
  • Journal of Biomedical Informatics
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Uninhibited access to the unstructured information distributed across the web and in scientific literature databases continues to be beyond the reach of scientists and health professionals. To address this challenge we have developed a literature driven, ontology-centric navigation infrastructure consisting of a content acquisition engine, a domain-specific ontology (in OWL-DL) and an ontology instantiation pipeline delivering sentences derived by domain-specific text mining. A visual query tool for reasoning over A-box instances in the populated ontology is presented and used to build conceptual queries that can be issued to the knowledgebase. We have deployed this generic infrastructure to facilitate data integration and knowledge sharing in the domain of dengue, which is one of the most prevalent viral diseases that continue to infect millions of people in the tropical and subtropical regions annually. Using our unique methodology we illustrate simplified search and discovery on dengue information derived from distributed resources and aggregated according to dengue ontology. Furthermore we apply data mining to the instantiated ontology to elucidate trends in the mentions of dengue serotypes in scientific abstracts since 1974.