Multi-document summarization using sentence-based topic models

  • Authors:
  • Dingding Wang;Shenghuo Zhu;Tao Li;Yihong Gong

  • Affiliations:
  • Florida International University, Miami, FL;NEC Laboratories America, Cupertino, CA;Florida International University, Miami, FL;NEC Laboratories America, Cupertino, CA

  • Venue:
  • ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

Most of the existing multi-document summarization methods decompose the documents into sentences and work directly in the sentence space using a term-sentence matrix. However, the knowledge on the document side, i.e. the topics embedded in the documents, can help the context understanding and guide the sentence selection in the summarization procedure. In this paper, we propose a new Bayesian sentence-based topic model for summarization by making use of both the term-document and term-sentence associations. An efficient variational Bayesian algorithm is derived for model parameter estimation. Experimental results on benchmark data sets show the effectiveness of the proposed model for the multi-document summarization task.