A summarization system for Chinese news from multiple sources

  • Authors:
  • Hsin-Hsi Chen;June-Jei Kuo;Sheng-Jie Huang;Chuan-Jie Lin;Hung-Chia Wung

  • Affiliations:
  • Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan, ROC;Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan, ROC;Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan, ROC;Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan, ROC;Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan, ROC

  • Venue:
  • Journal of the American Society for Information Science and Technology
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article proposes a summarization system for multiple documents. It employs not only named entities and other signatures to cluster news from different sources, but also employs punctuation marks, linking elements, and topic chains to identify the meaningful units (MUs). Using nouns and verbs to identify the similar MUs, focusing and browsing models are applied to represent the summarization results. To reduce information loss during summarization, informative words in a document are introduced. For the evaluation, a question answering system (QA system) is proposed to substitute the human assessors. In large-scale experiments containing 140 questions to 17,877 documents, the results show that those models using informative words outperform pure heuristic voting-only strategy by news reporters. This model can be easily further applied to summarize multilingual news from multiple sources.