Is Hillary Rodham Clinton the president?: disambiguating names across documents

  • Authors:
  • Yael Ravin;Zunaid Kazi

  • Affiliations:
  • T. J. Watson Research Center, Yorktown Heights, NY;T. J. Watson Research Center, Yorktown Heights, NY

  • Venue:
  • CorefApp '99 Proceedings of the Workshop on Coreference and its Applications
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

A number of research and software development groups have developed name identification technology, but few have addressed the issue of cross-document coreference, or identifying the same named entities across documents. In a collection of documents, where there are multiple discourse contexts, there exists a many-to-many correspondence between names and entities, making it a challenge to automatically map them correctly. Recently, Bagga and Baldwin proposed a method for determining whether two names refer to the same entity by measuring the similarity between the document contexts in which they appear. Inspired by their approach, we have revisited our current cross-document coreference heuristics that make relatively simple decisions based on matching strings and entity types. We have devised an improved and promising algorithm, which we discuss in this paper.