Structured named entities in two distinct press corpora: contemporary broadcast news and old newspapers

  • Authors:
  • Sophie Rosset;Cyril Grouin;Karën Fort;Olivier Galibert;Juliette Kahn;Pierre Zweigenbaum

  • Affiliations:
  • LIMSI-CNRS, France;LIMSI-CNRS, France;INIST-CNRS, France and LIPN, France;LNE, France;LNE, France;LIMSI-CNRS, France

  • Venue:
  • LAW VI '12 Proceedings of the Sixth Linguistic Annotation Workshop
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper compares the reference annotation of structured named entities in two corpora with different origins and properties. It addresses two questions linked to such a comparison. On the one hand, what specific issues were raised by reusing the same annotation scheme on a corpus that differs from the first in terms of media and that predates it by more than a century? On the other hand, what contrasts were observed in the resulting annotations across the two corpora?