Disease mention recognition with specific features

  • Authors:
  • Md. Faisal Mahbub Chowdhury;Alberto Lavelli

  • Affiliations:
  • Human Language Technology Research Unit, Fondazione Bruno Kessler, Trento, Italy and University of Trento, Italy;University of Trento, Italy

  • Venue:
  • BioNLP '10 Proceedings of the 2010 Workshop on Biomedical Natural Language Processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Despite an increasing amount of research on biomedical named entity recognition, there has been not enough work done on disease mention recognition. Difficulty of obtaining adequate corpora is one of the key reasons which hindered this particular research. Previous studies argue that correct identification of disease mentions is the key issue for further improvement of the disease-centric knowledge extraction tasks. In this paper, we present a machine learning based approach that uses a feature set tailored for disease mention recognition and outperforms the state-of-the-art results. The paper also discusses why a feature set for the well studied gene/protein mention recognition task is not necessarily equally effective for other biomedical semantic types such as diseases.