Challenges in evaluating summaries of short stories

  • Authors:
  • Anna Kazantseva;Stan Szpakowicz

  • Affiliations:
  • University of Ottawa, Ottawa, Canada;University of Ottawa, Ottawa, Canada and Polish Academy of Sciences, Warsaw, Poland

  • Venue:
  • SumQA '06 Proceedings of the Workshop on Task-Focused Summarization and Question Answering
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents experiments with the evaluation of automatically produced summaries of literary short stories. The summaries are tailored to a particular purpose of helping a reader decide whether she wants to read the story. The evaluation procedure includes extrinsic and intrinsic measures, as well as subjective and factual judgments about the summaries pronounced by human subjects. The experiments confirm the experience of summarizing more conventional genres: sentence overlap between human- and machine-made summaries is not a complete picture of the quality of a summary. In fact, in our case, sentence overlap does not correlate well with human judgment. We explain the evaluation procedures and discuss several challenges of evaluating summaries of works of fiction.