A Multilevel Text Processing Model of Newsgroup Dynamics

  • Authors:
  • G. Sampath;Miroslav Martinovic

  • Affiliations:
  • -;-

  • Venue:
  • NLDB '02 Proceedings of the 6th International Conference on Applications of Natural Language to Information Systems-Revised Papers
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a multilevel model of discussions in USENET newsgroups that includes the use of statistical and linguistic methods to obtain lexical, semantic and discourse characteristics of the text. We expose constraints that make information extraction and summarization more amenable to analysis at different levels. Our model makes use of posting structure, times of posting, time spans, and length and depth of a thread in order to extract higher-level information on subject matter, interest level, topicality, and discussion trends.