Cluster-Centric Approach to News Event Extraction

  • Authors:
  • Jakub Piskorski;Hristo Tanev;Martin Atkinson;Erik Van Der Goot

  • Affiliations:
  • Joint Research Centre of the European Commission, Institute for the Protection and Security of the Citizen, Via Fermi 2749, 21027 Ispra, Italy;Joint Research Centre of the European Commission, Institute for the Protection and Security of the Citizen, Via Fermi 2749, 21027 Ispra, Italy;Joint Research Centre of the European Commission, Institute for the Protection and Security of the Citizen, Via Fermi 2749, 21027 Ispra, Italy;Joint Research Centre of the European Commission, Institute for the Protection and Security of the Citizen, Via Fermi 2749, 21027 Ispra, Italy

  • Venue:
  • Proceedings of the 2008 conference on New Trends in Multimedia and Network Information Systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a real-time and multilingual news event extraction system developed at the Joint Research Centre of the European Commission. It is capable of accurately and efficiently extracting violent and natural disaster events from online news. In particular, a linguistically relatively lightweight approach is deployed, in which clustered news are heavily exploited at all stages of processing. The paper focuses on the system's architecture, real-time news clustering, geolocating clusters, event extraction grammar development, adapting the system to the processing of new languages, cluster-level information fusion, visual event tracking and accuracy evaluation.