Lexical patterns, features and knowledge resources for coreference resolution in clinical notes

Authors:
Phil Gooch;Abdul Roudsari
Affiliations:
Centre for Health Informatics, City University, London, UK;Centre for Health Informatics, City University, London, UK and School of Health Information Science, University of Victoria, BC, Canada
Venue:
Journal of Biomedical Informatics
Year:
2012

Citing 12
Cited 1

WordNet: a lexical database for English

Communications of the ACM
On coreferring: coreference in MUC and related annotation schemes

Computational Linguistics
A reference ontology for biomedical informatics: the foundational model of anatomy

Journal of Biomedical Informatics - Special issue: Unified medical language system
A model-theoretic coreference scoring scheme

MUC6 '95 Proceedings of the 6th conference on Message understanding
On coreference resolution performance metrics

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Supervised noun phrase coreference research: the first fifteen years

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Evaluation metrics for end-to-end coreference resolution systems

SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Narrowing the modeling gap: a cluster-ranking approach to coreference resolution

Journal of Artificial Intelligence Research
Text Processing with GATE

Text Processing with GATE
Methodological Review: Coreference resolution: A review of general methodologies and applications in the clinical domain

Journal of Biomedical Informatics
Stanford's multi-pass sieve coreference resolution system at the CoNLL-2011 shared task

CONLL Shared Task '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task
Blanc: Implementing the rand index for coreference evaluation

Natural Language Engineering

End-to-end coreference resolution for clinical narratives

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Generation of entity coreference chains provides a means to extract linked narrative events from clinical notes, but despite being a well-researched topic in natural language processing, general-purpose coreference tools perform poorly on clinical texts. This paper presents a knowledge-centric and pattern-based approach to resolving coreference across a wide variety of clinical records from two corpora (Ontology Development and Information Extraction (ODIE) and i2b2/VA), and describes a method for generating coreference chains using progressively pruned linked lists that reduces the search space and facilitates evaluation by a number of metrics. Independent evaluation results give an F-measure for each corpus of 79.2% and 87.5%, respectively. A baseline of blind coreference of mentions of the same class gives F-measures of 65.3% and 51.9% respectively. For the ODIE corpus, recall is significantly improved over the baseline (p0.05). For the i2b2/VA corpus, recall, precision, and F-measure are significantly improved over the baseline (p