Deterministic coreference resolution based on entity-centric, precision-ranked rules

Authors:
Heeyoung Lee;Angel Chang;Yves Peirsman;Nathanael Chambers;Mihai Surdeanu;Dan Jurafsky
Affiliations:
Stanford University;Stanford University;University of Leuven;United States Naval Academy;University of Arizona;Stanford University
Venue:
Computational Linguistics
Year:
2013

Citing 57
Cited 0

An algorithm for pronominal anaphora resolution

Computational Linguistics
A Computational Approach to Grammatical Coding of English Words

Journal of the ACM (JACM)
Cogniac: a discourse processing engine

Cogniac: a discourse processing engine
An empirically based system for processing definite descriptions

Computational Linguistics
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
A machine learning approach to coreference resolution of noun phrases

Computational Linguistics - Special issue on computational anaphora resolution
A centering approach to pronouns

ACL '87 Proceedings of the 25th annual meeting on Association for Computational Linguistics
Flexible parsing of discretely uttered sentences

COLING '82 Proceedings of the 9th conference on Computational linguistics - Volume 1
Identifying anaphoric and non-anaphoric noun phrases to improve coreference resolution

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A model-theoretic coreference scoring scheme

MUC6 '95 Proceedings of the 6th conference on Message understanding
Improving machine learning approaches to coreference resolution

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Accurate unlexicalized parsing

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Coreference for NLP applications

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
A mention-synchronous coreference resolution algorithm based on the Bell tree

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Learning to resolve bridging references

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Incorporating non-local information into information extraction systems by Gibbs sampling

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Bootstrapping path-based pronoun resolution

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
An NP-cluster based approach to coreference resolution

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
A high-performance coreference resolution system using a constraint-based multi-agent strategy

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
On coreference resolution performance metrics

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
A large-scale exploration of effective global features for a joint entity detection and tracking model

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Enforcing transitivity in coreference resolution

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Disambiguating between generic and referential "you" in dialog

ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
CogNIAC: high precision coreference with limited knowledge and linguistic resources

ANARESOLUTION '97 Proceedings of a Workshop on Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts
The Stanford typed dependencies representation

CrossParser '08 Coling 2008: Proceedings of the workshop on Cross-Framework and Cross-Domain Parser Evaluation
Identifying non-referential it: a machine learning approach incorporating linguistically motivated patterns

FeatureEng '05 Proceedings of the ACL Workshop on Feature Engineering for Machine Learning in Natural Language Processing
Understanding the value of features for coreference resolution

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Unsupervised models for coreference resolution

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Joint unsupervised coreference resolution with Markov logic

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Graph-cut-based anaphoricity determination for coreference resolution

NAACL '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Using decision trees for conference resolution

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Supervised models for coreference resolution

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Simple coreference resolution with rich syntactic and semantic features

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Coreference resolution in a modular, entity-centered model

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
From baby steps to Leapfrog: how "Less is More" in unsupervised dependency parsing

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Supervised noun phrase coreference research: the first fifteen years

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Coreference resolution across corpora: languages, coding schemes, and preprocessing information

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
The same-head heuristic for coreference

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
A multi-pass sieve for coreference resolution

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Domain adaptation of rule-based annotators for named-entity recognition tasks

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Bootstrapping coreference resolution using word associations

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
CoNLL-2011 shared task: modeling unrestricted coreference in OntoNotes

CONLL Shared Task '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task
Stanford's multi-pass sieve coreference resolution system at the CoNLL-2011 shared task

CONLL Shared Task '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task
RelaxCor participation in CoNLL shared task on coreference resolution

CONLL Shared Task '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task
Inference protocols for coreference resolution

CONLL Shared Task '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task
Unrestricted coreference resolution via global hypergraph partitioning

CONLL Shared Task '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task
Blanc: Implementing the rand index for coreference evaluation

Natural Language Engineering
He said, she said: gender in the ACL anthology

ACL '12 Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries
Coreference semantics from web features

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Joint entity and event coreference resolution across documents

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
CoNLL-2012 shared task: Modeling Multilingual Unrestricted Coreference in OntoNotes

CoNLL '12 Joint Conference on EMNLP and CoNLL - Shared Task
Latent structure perceptron with feature induction for unrestricted coreference resolution

CoNLL '12 Joint Conference on EMNLP and CoNLL - Shared Task
Combining the best of two worlds: a hybrid approach to multilingual coreference resolution

CoNLL '12 Joint Conference on EMNLP and CoNLL - Shared Task
ICT: system description for CoNLL-2012

CoNLL '12 Joint Conference on EMNLP and CoNLL - Shared Task
A mixed deterministic model for coreference resolution

CoNLL '12 Joint Conference on EMNLP and CoNLL - Shared Task
Chinese coreference resolution via ordered filtering

CoNLL '12 Joint Conference on EMNLP and CoNLL - Shared Task
Hybrid rule-based algorithm for coreference resolution

CoNLL '12 Joint Conference on EMNLP and CoNLL - Shared Task

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a new deterministic approach to coreference resolution that combines the global information and precise features of modern machine-learning models with the transparency and modularity of deterministic, rule-based systems. Our sieve architecture applies a battery of deterministic coreference models one at a time from highest to lowest precision, where each model builds on the previous model's cluster output. The two stages of our sieve-based architecture, a mention detection stage that heavily favors recall, followed by coreference sieves that are precision-oriented, offer a powerful way to achieve both high precision and high recall. Further, our approach makes use of global information through an entity-centric model that encourages the sharing of features across all mentions that point to the same real-world entity. Despite its simplicity, our approach gives state-of-the-art performance on several corpora and genres, and has also been incorporated into hybrid state-of-the-art coreference systems for Chinese and Arabic. Our system thus offers a new paradigm for combining knowledge in rule-based systems that has implications throughout computational linguistics.