Coreference resolution: an empirical study based on SemEval-2010 shared Task 1

Authors:
Lluís Màrquez;Marta Recasens;Emili Sapena
Affiliations:
Departament de Llenguatges i Sistemes Informàtics, TALP Research Center, Universitat Politècnica de Catalunya, Barcelona, Spain 08034;Departament de Lingüística, CLiC Research Center, Universitat de Barcelona, Barcelona, Spain 08007;Departament de Llenguatges i Sistemes Informàtics, TALP Research Center, Universitat Politècnica de Catalunya, Barcelona, Spain 08034
Venue:
Language Resources and Evaluation
Year:
2013

Citing 33
Cited 1

On the foundations of relaxation labeling processes

Readings in computer vision: issues, problems, principles, and paradigms
C4.5: programs for machine learning

C4.5: programs for machine learning
A machine learning approach to coreference resolution of noun phrases

Computational Linguistics - Special issue on computational anaphora resolution
Reference resolution beyond coreference: a conceptual frame and its application

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
A model-theoretic coreference scoring scheme

MUC6 '95 Proceedings of the 6th conference on Message understanding
Improving machine learning approaches to coreference resolution

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
A mention-synchronous coreference resolution algorithm based on the Bell tree

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
On coreference resolution performance metrics

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
A large-scale exploration of effective global features for a joint entity detection and tracking model

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Extracting product features and opinions from reviews

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Two uses of anaphora resolution in summarization

Information Processing and Management: an International Journal
OntoNotes: A Unified Relational Semantic Representation

ICSC '07 Proceedings of the International Conference on Semantic Computing
Enforcing transitivity in coreference resolution

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Using coreference chains for text summarization

CorefApp '99 Proceedings of the Workshop on Coreference and its Applications
Using coreference for question answering

CorefApp '99 Proceedings of the Workshop on Coreference and its Applications
Understanding the value of features for coreference resolution

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Using decision trees for conference resolution

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2
Conundrums in noun phrase coreference resolution: making sense of the state-of-the-art

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
A Deeper Look into Features for Coreference Resolution

DAARC '09 Proceedings of the 7th Discourse Anaphora and Anaphor Resolution Colloquium on Anaphora Processing and Applications
Supervised models for coreference resolution

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Supervised noun phrase coreference research: the first fifteen years

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Beyond NomBank: a study of implicit arguments for nominal predicates

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Coreference resolution with reconcile

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
SemEval-2010 task 1: Coreference resolution in multiple languages

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
SemEval-2010 task 10: Linking events and their participants in discourse

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
RelaxCor: A global relaxation labeling approach to coreference resolution

SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Machine reading at the University of Washington

FAM-LbR '10 Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
AnCora-CO: Coreferentially annotated corpora for Spanish and Catalan

Language Resources and Evaluation
Recognising entailment within discourse

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Evaluation metrics for end-to-end coreference resolution systems

SIGDIAL '10 Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue
A global relaxation labeling approach to coreference resolution

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
CoNLL-2011 shared task: modeling unrestricted coreference in OntoNotes

CONLL Shared Task '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task
Blanc: Implementing the rand index for coreference evaluation

Natural Language Engineering

A constraint-based hypergraph partitioning approach to coreference resolution

Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents an empirical evaluation of coreference resolution that covers several interrelated dimensions. The main goal is to complete the comparative analysis from the SemEval-2010 task on Coreference Resolution in Multiple Languages. To do so, the study restricts the number of languages and systems involved, but extends and deepens the analysis of the system outputs, including a more qualitative discussion. The paper compares three automatic coreference resolution systems for three languages (English, Catalan and Spanish) in four evaluation settings, and using four evaluation measures. Given that our main goal is not to provide a comparison between resolution algorithms, these are merely used as tools to shed light on the different conditions under which coreference resolution is evaluated. Although the dimensions are strongly interdependent, making it very difficult to extract general principles, the study reveals a series of interesting issues in relation to coreference resolution: the portability of systems across languages, the influence of the type and quality of input annotations, and the behavior of the scoring measures.