What is in a text and what does it do: qualitative evaluations of an NLG system -- the BT-Nurse -- using content analysis and discourse analysis

  • Authors:
  • Rahul Sambaraju;Ehud Reiter;Robert Logie;Andy McKinlay;Chris McVittie;Albert Gatt;Cindy Sykes

  • Affiliations:
  • Queen Margaret Univ, UK;Univ of Aberdeen, UK;Univ of Edinburgh, UK;Univ of Edinburgh, UK;Queen Margaret Univ, UK;Univ of Malta, Malta;Edinburgh Royal Infirmary, U. K

  • Venue:
  • ENLG '11 Proceedings of the 13th European Workshop on Natural Language Generation
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Evaluations of NLG systems generally are quantiative, that is, based on corpus comparison statistics and/or results of experiments with people. Outcomes of such evaluations are important in demonstrating whether or not an NLG system is successful, but leave gaps in understanding why this is the case. Alternatively, qualitative evaluations carried out by experts provide knowledge on where a system needs to be improved. In this paper we describe two such evaluations carried out for the BT-Nurse system, using two different methodologies (content analysis and discourse analysis). The outcomes of such evaluations are discussed in comparison to what was learnt from a quantitiave evaluation of BT-Nurse. Implications for the role of similar evaluations in NLG are also discussed.