Evaluating hybrid versus data-driven coreference resolution

  • Authors:
  • Iris Hendrickx;Veronique Hoste;Walter Daelemans

  • Affiliations:
  • University of Antwerp, CNTS, Language Technology Group Universiteitsplein 1, Antwerp, Belgium;University College Ghent, Language and Translation Technology Team;University of Antwerp, CNTS, Language Technology Group Universiteitsplein 1, Antwerp, Belgium

  • Venue:
  • DAARC'07 Proceedings of the 6th discourse anaphora and anaphor resolution conference on Anaphora: analysis, algorithms and applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a systematic evaluation of a hybrid approach of combined rule-based filtering and machine learning to Dutch coreference resolution. Through the application of a selection of linguistically-motivated negative and positive filters, which we apply in isolation and combined, we study the effect of these filters on precision and recall using two different learning techniques: memory-based learning and maximum entropy modeling. Our results show that by using the hybrid approach, we can reduce up to 92% of the training material without performance loss. We also show that the filters improve the overall precision of the classifiers leading to higher F-scores on the test set.