A lightweight approach for extracting disease-symptom relation with metamap toward automated generation of disease knowledge base

  • Authors:
  • Takashi Okumura;Yuka Tateisi

  • Affiliations:
  • National Institute of Public Health, Saitama, Japan;National Institute of Public Health, Saitama, Japan

  • Venue:
  • HIS'12 Proceedings of the First international conference on Health Information Science
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Diagnostic decision support systems necessitate disease knowledge base, and this part may occupy dominant portion in the total development cost of such systems. Accordingly, toward automated generation of disease knowledge base, we conducted a preliminary study for efficient extraction of symptomatic expressions, utilizing MetaMap, a tool for assigning UMLS (Unified Medical Language System) semantic tags onto phrases in a given medical literature text. We first utilized several tags in the MetaMap output, related to symptoms and findings, for extraction of symptomatic terms. This straightforward approach resulted in Recall 82% and Precision 64%. Then, we applied a heuristics that exploits certain patterns of tag sequences that frequently appear in typical symptomatic expressions. This simple approach achieved 7% recall gain, without sacrificing precision. Although the extracted information requires manual inspection, the study suggested that the simple approach can extract symptomatic expressions, at very low cost. Failure analysis of the output was also performed to further improve the performance.