Building a hierarchy of events and topics for newspaper digital libraries

  • Authors:
  • Aurora Pons-Porrata;Rafael Berlanga-Llavori;José Ruiz-Shulcloper

  • Affiliations:
  • Universidad de Oriente, Santiago de Cuba, Cuba;Universitat Jaume I, Castellón, Spain;Institute of Cybernetics, Mathematics and Physics, La Habana, Cuba

  • Venue:
  • ECIR'03 Proceedings of the 25th European conference on IR research
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we propose an incremental hierarchical clustering algorithm for on-line event detection. This algorithm is applied to a set of newspaper articles in order to discover the structure of topics and events that they describe. In the first level, articles with a high temporal-semantic similarity are clustered together into events. In the next levels of the hierarchy, these events are successively clustered so that composite events and topics can be discovered. The results obtained for the F1-measure and the Detection Cost demonstrate the validity of our algorithm for on-line event detection tasks.