Identifying important concepts from medical documents

  • Authors:
  • Quanzhi Li;Yi-Fang Brook Wu

  • Affiliations:
  • Information Systems Department, New Jersey Institute of Technology, Newark, NJ 07102, USA;Information Systems Department, New Jersey Institute of Technology, Newark, NJ 07102, USA

  • Venue:
  • Journal of Biomedical Informatics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automated medical concept recognition is important for medical informatics such as medical document retrieval and text mining research. In this paper, we present a software tool called keyphrase identification program (KIP) for identifying topical concepts from medical documents. KIP combines two functions: noun phrase extraction and keyphrase identification. The former automatically extracts noun phrases from medical literature as keyphrase candidates. The latter assigns weights to extracted noun phrases for a medical document based on how important they are to that document and how domain specific they are in the medical domain. The experimental results show that our noun phrase extractor is effective in identifying noun phrases from medical documents, so is the keyphrase extractor in identifying important medical conceptual terms. They both performed better than the systems they were compared to.