Evaluating hybrid versus data-driven coreference resolution

Authors:
Iris Hendrickx;Veronique Hoste;Walter Daelemans
Affiliations:
University of Antwerp, CNTS, Language Technology Group Universiteitsplein 1, Antwerp, Belgium;University College Ghent, Language and Translation Technology Team;University of Antwerp, CNTS, Language Technology Group Universiteitsplein 1, Antwerp, Belgium
Venue:
DAARC'07 Proceedings of the 6th discourse anaphora and anaphor resolution conference on Anaphora: analysis, algorithms and applications
Year:
2007

Citing 15
Cited 2

C4.5: programs for machine learning

C4.5: programs for machine learning
A maximum entropy approach to natural language processing

Computational Linguistics
Wrappers for feature subset selection

Artificial Intelligence - Special issue on relevance
Forgetting Exceptions is Harmful in Language Learning

Machine Learning - Special issue on natural language learning
Efficient progressive sampling

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
A trainable approach to coreference resolution for information extraction

A trainable approach to coreference resolution for information extraction
On coreferring: coreference in MUC and related annotation schemes

Computational Linguistics
A machine learning approach to coreference resolution of noun phrases

Computational Linguistics - Special issue on computational anaphora resolution
Evaluating automated and manual acquisition of anaphora resolution strategies

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
A model-theoretic coreference scoring scheme

MUC6 '95 Proceedings of the 6th conference on Message understanding
Text and knowledge mining for coreference resolution

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Coreference resolution using competition learning approach

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Combining sample selection and error-driven pruning for machine learning of coreference rules

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
The influence of minimum edit distance on reference resolution

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Memory-Based Language Processing (Studies in Natural Language Processing)

Memory-Based Language Processing (Studies in Natural Language Processing)

Disambiguation of the neuter pronoun and its effect on pronominal coreference resolution

TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
Analysis and reference resolution of bridge anaphora across different text genres

DAARC'11 Proceedings of the 8th international conference on Anaphora Processing and Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a systematic evaluation of a hybrid approach of combined rule-based filtering and machine learning to Dutch coreference resolution. Through the application of a selection of linguistically-motivated negative and positive filters, which we apply in isolation and combined, we study the effect of these filters on precision and recall using two different learning techniques: memory-based learning and maximum entropy modeling. Our results show that by using the hybrid approach, we can reduce up to 92% of the training material without performance loss. We also show that the filters improve the overall precision of the classifiers leading to higher F-scores on the test set.