Text segmentation by clustering cohesion

  • Authors:
  • Raúl Abella Pérez;José Eladio Medina Pagola

  • Affiliations:
  • Advanced Technologies Application Centre, Ciudad de la Habana, Cuba;Advanced Technologies Application Centre, Ciudad de la Habana, Cuba

  • Venue:
  • CIARP'10 Proceedings of the 15th Iberoamerican congress conference on Progress in pattern recognition, image analysis, computer vision, and applications
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

An automatic linear text segmentation in order to detect the best topic boundaries is a difficult and very useful task in many text processing systems. Some methods have tried to solve this problem with reasonable results, but they present some drawbacks as well. In this work, we propose a new method, called ClustSeg, based on a predefined window and a clustering algorithm to decide the topic cohesion. We compare our proposal against the best known methods, with a better performance against these algorithms.