Instance-based natural language generation

  • Authors:
  • S. Varges;C. Mellish

  • Affiliations:
  • Department of information engineering and computer science, university of trento, via sommarive, 14 38050 povo (tn), italy e-mail: varges@disi.unitn.it;Department of computing science, university of aberdeen, king's college, aberdeen ab24 3ue, uk e-mail: c.mellish@abdn.ac.uk

  • Venue:
  • Natural Language Engineering
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We investigate the use of instance-based ranking methods for surface realization in natural language generation. Our approach to instance-based natural language generation (IBNLG) employs two components: a rule system that ‘overgenerates’ a number of realization candidates from a meaning representation and an instance-based ranker that scores the candidates according to their similarity to examples taken from a training corpus. We develop an efficient search technique for identifying the optimal candidate based on a novel extension of the A* algorithm. The rule system is produced automatically from a semantically annotated fragment of the Penn Treebank II containing management succession texts. We detail the annotation scheme and grammar induction algorithm and evaluate the efficiency and output of the generator. We also discuss issues such as input coverage (completeness) and fluency that are relevant to surface generation in general.