Population of a knowledge base for news metadata from unstructured text and web data

  • Authors:
  • Rosa Stern;Benoît Sagot

  • Affiliations:
  • France AFP-Medialab, Paris, France;INRIA-Alpage, Paris, France

  • Venue:
  • AKBC-WEKEX '12 Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a practical use case of knowledge base (KB) population at the French news agency AFP. The target KB instances are entities relevant for news production and content enrichment. In order to acquire uniquely identified entities over news wires, i.e. textual data, and integrate the resulting KB in the Linked Data framework, a series of data models need to be aligned: Web data resources are harvested for creating a wide coverage entity database, which is in turn used to link entities to their mentions in French news wires. Finally, the extracted entities are selected for instantiation in the target KB. We describe our methodology along with the resources created and used for the target KB population.