Towards a surface realization-oriented corpus annotation

  • Authors:
  • Leo Wanner;Simon Mille;Bernd Bohnet

  • Affiliations:
  • ICREA and Universitat Pompeu Fabra, Barcelona, Spain;Universitat Pompeu Fabra, Barcelona, Spain;Universität Stuttgart, Stuttgart, Germany

  • Venue:
  • INLG '12 Proceedings of the Seventh International Natural Language Generation Conference
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Until recently, deep stochastic surface realization has been hindered by the lack of semantically annotated corpora. This is about to change. Such corpora are increasingly available, e.g., in the context of CoNLL shared tasks. However, recent experiments with CoNLL 2009 corpora show that these popular resources, which serve well for other applications, may not do so for generation. The attempts to adapt them for generation resulted so far in a better performance of the realizers, but not yet in a genuinely semantic generation-oriented annotation schema. Our goal is to initiate a debate on how a generation suitable annotation schema should be defined. We define some general principles of a semantic generation-oriented annotation and propose an annotation schema that is based on these principles. Experiments shows that making the semantic corpora comply with the suggested principles does not need to have a negative impact on the quality of the stochastic generators trained on them.