On-line event and topic detection by using the compact sets clustering algorithm

  • Authors:
  • Aurora Pons-Porrata;Rafael Berlanga-Llavori;José/ Ruiz-Shulcloper

  • Affiliations:
  • Universidad de Oriente, 90500 Santiago de Cuba, Cuba. E-mail: aurora@app.uo.edu.cu;Universitat Jaume I, E-12071 Castelló/n, Spain. Tel.: +34 964 269310/ Fax: +34 964 728435/ E-mail: berlanga@lsi.uji.es;Institute of Cybernetics, Mathematics and Physics, 10400 La Habana, Cuba. E-mail: recpat@cidet.icmf.inf.cu

  • Venue:
  • Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology - IBERAMIA '02
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper we propose a new incremental clustering algorithm for Event Detection, which is based on the mathematical properties of the compact sets. Additionally, this algorithm makes use of the temporal references appearing in the document texts to measure the similarity between documents according to the events that they describe. In order to discover the structure of topics and composite events, the clustering algorithm is hierarchically applied to the stream of newspaper articles. Thus, in the first level, documents with a high temporal-semantic similarity are clustered together into events. In the next levels of the hierarchy, these events are successively clustered into more complex events and topics. The evaluation results demonstrate that regarding the temporal references of documents improves the quality of the system-generated clusters, and that the overall performance of the proposed system compares favorably to other on-line detection systems of the literature.