A hybrid sentence ordering strategy in multi-document summarization

  • Authors:
  • Yanxiang He;Dexi Liu;Hua Yang;Donghong Ji;Chong Teng;Wenqing Qi

  • Affiliations:
  • School of Computer, Wuhan University, Wuhan, P.R. China;School of Computer, Wuhan University, Wuhan, P.R. China;School of Computer, Wuhan University, Wuhan, P.R. China;Center for Study of Language and Information, Wuhan University, Wuhan, P.R. China;School of Computer, Wuhan University, Wuhan, P.R. China;School of Computer, Wuhan University, Wuhan, P.R. China

  • Venue:
  • WISE'06 Proceedings of the 7th international conference on Web Information Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In extractive summarization, a proper arrangement of extracted sentences must be found if we want to generate a logical, coherent and readable summary. This issue is special in multi-document summarization. In this paper, several existing methods each of which generate a reference relation are combined through linear combination of the resulting relations. We use 4 types of relationships between sentences (chronological relation, positional relation, topical relation and dependent relation) to build a graph model where the vertices are sentences and edges are weighed relationships of the 4 types. And then apply a variation of page rank to get the ordering of sentences for multi-document summaries. We tested our hybrid model with two automatic methods: distance to manual ordering and ROUGE score. Evaluation results show a significant improvement of the ordering over strategies losing some relations. The results also indicate that this hybrid model is robust for articles with different genre which were used on DUC2004 and DUC2005.