Creating and evaluating a consensus for negated and speculative words in a Swedish clinical corpus

  • Authors:
  • Hercules Dalianis;Maria Skeppstedt

  • Affiliations:
  • Stockholm University, Kista, Sweden;Stockholm University, Kista, Sweden

  • Venue:
  • NeSp-NLP '10 Proceedings of the Workshop on Negation and Speculation in Natural Language Processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we describe the creation of a consensus corpus that was obtained through combining three individual annotations of the same clinical corpus in Swedish. We used a few basic rules that were executed automatically to create the consensus. The corpus contains negation words, speculative words, uncertain expressions and certain expressions. We evaluated the consensus using it for negation and speculation cue detection. We used Stanford NER, which is based on the machine learning algorithm Conditional Random Fields for the training and detection. For comparison we also used the clinical part of the BioScope Corpus and trained it with Stanford NER. For our clinical consensus corpus in Swedish we obtained a precision of 87.9 percent and a recall of 91.7 percent for negation cues, and for English with the Bioscope Corpus we obtained a precision of 97.6 percent and a recall of 96.7 percent for negation cues.