A system for the extraction and representation of summary of product characteristics content

Authors:
Stefania Rubrichi;Silvana Quaglini;Alex Spengler;Paola Russo;Patrick Gallinari
Affiliations:
Laboratory for Biomedical Informatics "Mario Stefanelli", Dipartimento di Ingegneria Industriale e dell'Informazione, University of Pavia, via Fearrata 1, 27100 Pavia, Italy;Laboratory for Biomedical Informatics "Mario Stefanelli", Dipartimento di Ingegneria Industriale e dell'Informazione, University of Pavia, via Fearrata 1, 27100 Pavia, Italy;Laboratoire d'Informatique de Paris 6, Université Pierre et Marie Curie, 4 place Jussieu, 75005 Paris, France;Amyloidosis Research and Treatment Center and Department of Biochemistry, IRCCS Policlinico San Matteo FDN and University of Pavia, piazzale Golgi 2, 27100 Pavia, Italy;Laboratoire d'Informatique de Paris 6, Université Pierre et Marie Curie, 4 place Jussieu, 75005 Paris, France
Venue:
Artificial Intelligence in Medicine
Year:
2013

Citing 9
Cited 1

S-CREAM - Semi-automatic CREAtion of Metadata

EKAW '02 Proceedings of the 13th International Conference on Knowledge Engineering and Knowledge Management. Ontologies and the Semantic Web
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Extracting human protein interactions from MEDLINE using a full-sentence parser

Bioinformatics
Ontology Learning and Population from Text: Algorithms, Evaluation and Applications

Ontology Learning and Population from Text: Algorithms, Evaluation and Applications
Semantic retrieval for the accurate identification of relational concepts in massive textbases

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Kernel-based learning for biomedical relation extraction

Journal of the American Society for Information Science and Technology
Event-based information extraction for the biomedical domain: the Caderige project

JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
Summary of Product Characteristics content extraction for a safe drugs usage

Journal of Biomedical Informatics
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm

IEEE Transactions on Information Theory

Guest Editorial: Artificial intelligence in medicine AIME 2011

Artificial Intelligence in Medicine

Quantified Score

Hi-index	0.00

Visualization

Abstract

Objective: Information about medications is critical in supporting decision-making during the prescription process and thus in improving the safety and quality of care. In this work, we propose a methodology for the automatic recognition of drug-related entities (active ingredient, interaction effects, etc.) in textual drug descriptions, and their further location in a previously developed domain ontology. Methods and material: The summary of product characteristics (SPC) represents the basis of information for health professionals on how to use medicines. However, this information is locked in free-text and, as such, cannot be actively accessed and elaborated by computerized applications. Our approach exploits a combination of machine learning and rule-based methods. It consists of two stages. Initially it learns to classify this information in a structured prediction framework, relying on conditional random fields. The classifier is trained and evaluated using a corpus of about a hundred SPCs. They have been hand-annotated with different semantic labels that have been derived from the domain ontology. At a second stage the extracted entities are added in the domain ontology corresponding concepts as new instances, using a set of rules manually-constructed from the corpus. Results: Our evaluations show that the extraction module exhibits high overall performance, with an average F1-measure of 88% for contraindications and 90% for interactions. Conclusion: SPCs can be exploited to provide structured information for computer-based decision support systems.