Extracting information from summary of product characteristics for improving drugs prescription safety

  • Authors:
  • Stefania Rubrichi;Silvana Quaglini;Alex Spengler;Patrick Gallinari

  • Affiliations:
  • Laboratory for Biomedical Informatics "Mario Stefanelli", Department of Computers and Systems Science, University of Pavia, Pavia, Italy;Laboratory for Biomedical Informatics "Mario Stefanelli", Department of Computers and Systems Science, University of Pavia, Pavia, Italy;Laboratoire d'Informatique de Paris 6, Université Pierre et Marie Curie, Paris, France;Laboratoire d'Informatique de Paris 6, Université Pierre et Marie Curie, Paris, France

  • Venue:
  • AIME'11 Proceedings of the 13th conference on Artificial intelligence in medicine
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Information about medications is critical in supporting decision-making during the prescription process and thus in improving the safety and quality of care. The Summary of Product Characteristics (SPC) represents the basis of information for health professionals on how to use medicines. However, this information is locked in free-text and, as such, cannot be actively accessed and elaborated by computerized applications. In this work, we propose a machine learning based system for the automatic recognition of drug-related entities (active ingredient, interaction effects, etc.) in SPCs, focusing on drug interactions. Our approach learns to classify this information in a structured prediction framework, relying on conditional random fields. The classifier is trained and evaluated using a corpus of a hundred SPCs. They have been hand-annotated with thirteen semantic labels that have been derived from a previously developed domain ontology. Our evaluations show that the model exhibits high overall performance, with an average F1-measure of about 90%.