An Extractive Text Summarizer Based on Significant Words

  • Authors:
  • Xiaoyue Liu;Jonathan J. Webster;Chunyu Kit

  • Affiliations:
  • Department of Chinese, Translation and Linguistics, City University of Hong Kong, Hong Kong;Department of Chinese, Translation and Linguistics, City University of Hong Kong, Hong Kong;Department of Chinese, Translation and Linguistics, City University of Hong Kong, Hong Kong

  • Venue:
  • ICCPOL '09 Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Document summarization can be viewed as a reductive distilling of source text through content condensation, while words with high quantities of information are believed to carry more content and thereby importance. In this paper, we propose a new quantification measure for word significance used in natural language processing (NLP) tasks, and successfully apply it to an extractive text summarization approach. In a query-based summarization setting, the correlation between user queries and sentences to be scored is established from both the micro (i.e. at the word level) and the macro (i.e. at the sentence level) perspectives, resulting in an effective ranking formula. The experiments, both on a generic single document summarization evaluation, and on a query-based multi-document evaluation, verify the effectiveness of the proposed measures and show that the proposed approach achieves a state-of-the-art performance.