Fine-Grained certainty level annotations used for coarser-grained e-health scenarios: certainty classification of diagnostic statements in swedish clinical text

Authors:
Sumithra Velupillai;Maria Kvist
Affiliations:
Dept. of Computer and Systems Sciences (DSV), Stockholm University, Kista, Sweden;Dept. of Computer and Systems Sciences (DSV), Stockholm University, Kista, Sweden and Dept. of Clinical Immunology and Transfusion Medicine, Karolinska University Hospital, Stockholm, Sweden
Venue:
CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part II
Year:
2012

Citing 3
Cited 0

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
A factuality profiler for eventualities in text

A factuality profiler for eventualities in text
ConText: An algorithm for determining negation, experiencer, and temporal status from clinical reports

Journal of Biomedical Informatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

An important task in information access methods is distinguishing factual information from speculative or negated information. Fine-grained certainty levels of diagnostic statements in Swedish clinical text are annotated in a corpus from a medical university hospital. The annotation model has two polarities (positive and negative) and three certainty levels. However, there are many e-health scenarios where such fine-grained certainty levels are not practical for information extraction. Instead, more coarse-grained groups are needed. We present three scenarios: adverse event surveillance, decision support alerts and automatic summaries and collapse the fine-grained certainty level classifications into coarser-grained groups. We build automatic classifiers for each scenario and analyze the results quantitatively. Annotation discrepancies are analyzed qualitatively through manual corpus analysis. Our main findings are that it is feasible to use a corpus of fine-grained certainty level annotations to build classifiers for coarser-grained real-world scenarios: 0.89, 0.91 and 0.8 F-score (overall average).