Blanc: Implementing the rand index for coreference evaluation

  • Authors:
  • M. Recasens;E. Hovy

  • Affiliations:
  • Clic, university of barcelona, gran via 585, barcelona 08007, spain email: mrecasens@ub.edu;Usc information sciences institute, 4676 admiralty way, marina del rey, ca 90292, usa email: hovy@isi.edu

  • Venue:
  • Natural Language Engineering
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper addresses the current state of coreference resolution evaluation, in which different measures (notably, MUC, B3, CEAF, and ACE-value) are applied in different studies. None of them is fully adequate, and their measures are not commensurate. We enumerate the desiderata for a coreference scoring measure, discuss the strong and weak points of the existing measures, and propose the BiLateral Assessment of Noun-Phrase Coreference, a variation of the Rand index created to suit the coreference task. The BiLateral Assessment of Noun-Phrase Coreference rewards both coreference and non-coreference links by averaging the F-scores of the two types, does not ignore singletons-the main problem with the MUC score-and does not inflate the score in their presence-a problem with the B3 and CEAF scores. In addition, its fine granularity is consistent over the whole range of scores and affords better discrimination between systems.