Finding Text Boundaries and Finding Topic Boundaries: Two Different Tasks?

  • Authors:
  • Alexandre Labadié;Violaine Prince

  • Affiliations:
  • LIRMM, Montpellier Cedex 5, France 34392;LIRMM, Montpellier Cedex 5, France 34392

  • Venue:
  • GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The goal of this paper is to demonstrate that usual evaluation methods for text segmentation are not adapted for every task linked to text segmentation. To do so we differentiated the task of finding text boundaries in a corpus of concatenated texts from the task of finding transitions between topics inside the same text. We worked on a corpus of twenty two French political discourses trying to find boundaries between them when they are concatenated, and to find topic boundaries inside them when they are not. We compared the results of our distance based method to the well known c99 algorithm.