A sentence level probabilistic model for evolutionary theme pattern mining from news corpora

  • Authors:
  • Shizhu Liu;Yuval Merhav;Wai Gen Yee;Nazli Goharian;Ophir Frieder

  • Affiliations:
  • Illinois institute of Technology, Chicago, IL;Illinois institute of Technology, Chicago, IL;Illinois institute of Technology, Chicago, IL;Illinois institute of Technology, Chicago, IL;Illinois institute of Technology, Chicago, IL

  • Venue:
  • Proceedings of the 2009 ACM symposium on Applied Computing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Some recent topic model-based methods have been proposed to discover and summarize the evolutionary patterns of themes in temporal text collections. However, the theme patterns extracted by these methods are hard to interpret and evaluate. To produce a more descriptive representation of the theme pattern, we not only give new representations of sentences and themes with named entities, but we also propose a sentence-level probabilistic model based on the new representation pattern. Compared with other topic model methods, our approach not only gets each topic's distribution per term, but also generates candidate summary sentences of the themes as well. Consequently, the results are easier to understand and can be evaluated using the top sentences produced by our probabilistic model. Experimentation with the proposed methods on the Tsunami dataset shows that the proposed methods are useful in the discovery of evolutionary theme patterns.