Exploring semi-supervised coreference resolution of medical concepts using semantic and temporal features

Authors:
Preethi Raghavan;Eric Fosler-Lussier;Albert M. Lai
Affiliations:
The Ohio State University, Columbus, Ohio;The Ohio State University, Columbus, Ohio;The Ohio State University, Columbus, Ohio
Venue:
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Year:
2012

Citing 18
Cited 0

Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Analyzing the effectiveness and applicability of co-training

Proceedings of the ninth international conference on Information and knowledge management
Two biomedical sublanguages: a description based on the theories of Zellig Harris

Journal of Biomedical Informatics - Special issue: Sublanguage
A machine learning approach to coreference resolution of noun phrases

Computational Linguistics - Special issue on computational anaphora resolution
Introduction: named entity recognition in biomedicine

Journal of Biomedical Informatics - Special issue: Named entity recognition in biomedicine
Applying Co-Training to reference resolution

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Task-Oriented Extraction of Temporal Information: The Case of Clinical Narratives

TIME '06 Proceedings of the Thirteenth International Symposium on Temporal Representation and Reasoning
A temporal constraint structure for extracting temporal information from clinical narrative

Journal of Biomedical Informatics
Methodological Review: Temporal reasoning with medical data-A review with emphasis on medical natural language processing

Journal of Biomedical Informatics
Using Kullback-Leibler distance for text categorization

ECIR'03 Proceedings of the 25th European conference on IR research
Alternating projections for learning with expectation constraints

UAI '09 Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence
Supervised noun phrase coreference research: the first fifteen years

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Posterior Regularization for Structured Latent Variable Models

The Journal of Machine Learning Research
A multi-pass sieve for coreference resolution

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Leveraging natural language processing of clinical narratives for phenotype modeling

PIKM '10 Proceedings of the 3rd workshop on Ph.D. students in information and knowledge management
Building timelines from narrative clinical records: initial results based-on deep natural language understanding

BioNLP '11 Proceedings of BioNLP 2011 Workshop
Methodological Review: Coreference resolution: A review of general methodologies and applications in the clinical domain

Journal of Biomedical Informatics
k-Neighborhood decentralization: A comprehensive solution to index the UMLS for large scale knowledge discovery

Journal of Biomedical Informatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

We investigate the task of medical concept coreference resolution in clinical text using two semi-supervised methods, co-training and multi-view learning with posterior regularization. By extracting semantic and temporal features of medical concepts found in clinical text, we create conditionally independent data views; co-training MaxEnt classifiers on this data works almost as well as supervised learning for the task of pairwise coreference resolution of medical concepts. We also train MaxEnt models with expectation constraints, using posterior regularization, and find that posterior regularization performs comparably to or slightly better than co-training. We describe the process of semantic and temporal feature extraction and demonstrate our methods on a corpus of case reports from the New England Journal of Medicine and a corpus of patient narratives obtained from The Ohio State University Wexner Medical Center.