CorrRank: update summarization based on topic correlation analysis

  • Authors:
  • Lei Huang;Yanxiang He

  • Affiliations:
  • School of Computing, Wuhan University, Wuhan, China;School of Computing, Wuhan University, Wuhan, China

  • Venue:
  • ICIC'10 Proceedings of the Advanced intelligent computing theories and applications, and 6th international conference on Intelligent computing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we propose a novel update summarization framework based on topic correlation analysis. The topics are first extracted from the two document sets provided in the task of update summarization by means of Latent Dirichlet Allocation (LDA) topic model. Then, the correlation between the new topics and the old topics are identified, based on which we further defined four categories of topic evolution patterns to capture the topic shift between the two document collections. We develop a new sentence ranking algorithm, i.e. CorrRank, which fully incorporates the topic evolution in the process of sentence ranking and sentence selection in update summarization. We choose the DUC 2008 and 2009 query-oriented multi-document update summarization tasks to examine the proposed model. Experimental results show the effectiveness of the LDA topic correlation analysis based update summarization framework.