Seed and Grow: augmenting statistically generated summary sentences using schematic word patterns

  • Authors:
  • Stephen Wan;Robert Dale;Mark Dras;Cécile Paris

  • Affiliations:
  • Macquarie University, Sydney, NSW and ICT Centre, CSIRO, Sydney, Australia;Macquarie University, Sydney, NSW;Macquarie University, Sydney, NSW;ICT Centre, CSIRO, Sydney, Australia

  • Venue:
  • EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We examine the problem of content selection in statistical novel sentence generation. Our approach models the processes performed by professional editors when incorporating material from additional sentences to support some initially chosen key summary sentence, a process we refer to as Sentence Augmentation. We propose and evaluate a method called "Seed and Grow" for selecting such auxiliary information. Additionally, we argue that this can be performed using schemata, as represented by word-pair co-occurrences, and demonstrate its use in statistical summary sentence generation. Evaluation results are supportive, indicating that a schemata model significantly improves over the baseline.