Query-focused multidocument summarization based on hybrid relevance analysis and surface feature salience

  • Authors:
  • Jen-Yuan Yeh;Hao-Ren Ke;Wei-Pang Yang

  • Affiliations:
  • Dept. of Computer Science, National Chiao Tung University, Hsinchu, Taiwan;Institute of Information Management, National Chiao Tung University, Hsinchu, Taiwan;Dept. of Information Management, National Dong Hwa University, Hualien, Taiwan

  • Venue:
  • SMO'06 Proceedings of the 6th WSEAS International Conference on Simulation, Modelling and Optimization
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Query-focused multidocument summarization is to synthesize from a set of topic-related documents a brief, well-organized, fluent summary for the purpose of answering an information need that cannot be met by just stating a name, date, quantity, etc. In this paper, the task is essentially treated as a sentence retrieval task. We propose a hybrid relevance analysis to evaluate the relevance of a sentence to the query. This is achieved by combining similarities computed from the vector space model and latent semantic analysis. Surface features are also examined to discern the impact of low-level features for query-focused multidocument summarization. In addition, a modified Maximal Marginal Relevance is proposed to reduce redundancy by taking into account shallow feature salience. The experimental results show the proposed method obtained competitive results when evaluated with the DUC 2005 corpus.