Impact of document structure on hierarchical summarization

  • Authors:
  • Fu Lee Wang;Christopher C. Yang

  • Affiliations:
  • Department of Computer Science, City University of Hong Kong, Kowloon Tong, Hong Kong;Department of Systems Engineering and Engineering Management, Chinese University of Hong Kong, Shatin, Hong Kong

  • Venue:
  • ICADL'06 Proceedings of the 9th international conference on Asian Digital Libraries: achievements, Challenges and Opportunities
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Hierarchical summarization technique summarizes a large document based on the hierarchical structure and salient features of the document. Previous study has shown that hierarchical summarization is a promising technique which can effectively extract the most important information from the source document. Hierarchical summarization has been extended to summarization of multiple documents. Three hierarchical structures were proposed to organize a set of related documents. This paper investigates the impact of document structure on hierarchical summarization. The results show that the hierarchical summarization of multiple documents organized in hierarchical structure outperforms other multi-document summarization systems without using the hierarchical structure. Moreover, the hierarchical summarization by event topics extracts a set of sentences significantly different from hierarchical summarization of other hierarchical structures and performs the best when the summary is highly-compressed.