Heuristics based automatic text summarization of unstructured text

  • Authors:
  • M. K. Dalal;M. A. Zaveri

  • Affiliations:
  • Sarvajanik College of Engineering & Technology, Athwa-Lines, Surat, India;S. V. National Institute of Technology, Ichchhanath, Surat, India

  • Venue:
  • Proceedings of the International Conference & Workshop on Emerging Trends in Technology
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Automatic Text Summarization is a specialized text mining task of generating a summary or abstract from single or multiple input text documents. Various heuristic and semi-supervised learning methods have been explored by researchers in this field to generate generic as well as user-oriented summaries. This paper examines the effectiveness of well-known summarization heuristics when applied to the task of generating single-document summary extracts of variable length. For evaluating the quality of the summaries, the original text documents and their summaries were scored by different human judges based on soft metrics like topic-coverage, relative coherence, novelty and information content; and their scores were statistically compared. It was experimentally verified that in 65% of the documents there was less than 10% variance between the scores assigned to the original texts and their summaries.