Detecting human features in summaries --- symbol sequence statistical regularity

  • Authors:
  • George Giannakopoulos;Vangelis Karkaletsis;George A. Vouros

  • Affiliations:
  • Software and Knowledge Engineering Laboratory, National Center of Scientific Research "Demokritos", Greece;Software and Knowledge Engineering Laboratory, National Center of Scientific Research "Demokritos", Greece;Department of Digital Systems, University of Pireaus, Greece

  • Venue:
  • SETN'12 Proceedings of the 7th Hellenic conference on Artificial Intelligence: theories and applications
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

The presented work studies textual summaries, aiming to detect the qualities of human multi-document summaries, in contrast to automatically extracted ones. The measured features are based on a generic statistical regularity measure, named Symbol Sequence Statistical Regularity (SSSR). The measure is calculated over both character and word n-grams of various ranks, given a set of human and automatically extracted multi-document summaries from two different corpora. The results of the experiments indicate that the proposed measure provides enough distinctive power to discriminate between the human and non-human summaries. The results hint on the qualities a human summary holds, increasing intuition related to how a good summary should be generated.