Thematic segmentation of texts: two methods for two kinds of texts

  • Authors:
  • Olivier Ferret;Brigitte Grau;Nicolas Masson

  • Affiliations:
  • LIMSI-CNRS, Orsay, France;LIMSI-CNRS, Orsay, France;LIMSI-CNRS, Orsay, France

  • Venue:
  • COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

To segment texts in thematic units, we present here how a basic principle relying on word distribution can be applied on different kind of texts. We start from an existing method well adapted for scientific texts, and we propose its adaptation to other kinds of texts by using semantic links between words. These relations are found in a lexical network, automatically built from a large corpus. We will compare their results and give criteria to choose the more suitable method according to text characteristics.