A bootstrapping approach for robust topic analysis

  • Authors:
  • Olivier Ferret;Brigitte Grau

  • Affiliations:
  • LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France;LIMSI-CNRS, BP 133, 91403 Orsay Cedex, France

  • Venue:
  • Natural Language Engineering
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Topic analysis is important for many applications dealing with texts, such as text summarization or information extraction. However, it can be done with great precision only if it relies on structured knowledge, which is difficult to produce on a large scale. In this paper, we propose using bootstrapping to solve this problem: a first topic analysis based on a weakly structured source of knowledge, a collocation network, is used for learning explicit topic representations that then support a more precise and reliable topic analysis.