Online news event extraction for global crisis surveillance

  • Authors:
  • Jakub Piskorski;Hristo Tanev;Martin Atkinson;Eric van der Goot;Vanni Zavarella

  • Affiliations:
  • Polish Academy of Sciences, ul. J.K. Ordona, Warszawa, Poland;Joint Research of the European Commission, Ispra, Italy;Joint Research of the European Commission, Ispra, Italy;Joint Research of the European Commission, Ispra, Italy;Joint Research of the European Commission, Ispra, Italy

  • Venue:
  • Transactions on computational collective intelligence V
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article presents a real-time and multilingual news event extraction system developed at the Joint Research Centre of the European Commission. It is capable of accurately and efficiently extracting violent and natural disaster events from online news. In particular, a linguistically relatively lightweight approach is deployed, in which clustered news are heavily exploited at all stages of processing. Furthermore, the technique applied for event extraction assumes the inverted-pyramid style of writing news articles, i.e., the most important parts of the story are placed in the beginning and the least important facts are left toward the end. The article focuses on the system's architecture, real-time news clustering, geo-locating and geocoding clusters, event extraction grammar development, adapting the system to the processing of new languages, cluster-level information fusion, visual event tracking, event extraction accuracy evaluation, and detecting event reporting boundaries in news article streams. This article is an extended version of [20].