An investigation concerning the generation of text summarisation classifiers using secondary data

  • Authors:
  • Matias Garcia-Constantino;Frans Coenen;P.-J. Noble;Alan Radford;Christian Setzkorn;Aine Tierney

  • Affiliations:
  • Department of Computer Science, The University of Liverpool, Liverpool, UK;Department of Computer Science, The University of Liverpool, Liverpool, UK;School of Veterinary Science, University of Liverpool, Leahurst, Neston, UK;School of Veterinary Science, University of Liverpool, Leahurst, Neston, UK;School of Veterinary Science, University of Liverpool, Leahurst, Neston, UK;School of Veterinary Science, University of Liverpool, Leahurst, Neston, UK

  • Venue:
  • MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

An investigation into the potential effectiveness of generating text classifiers from secondary data for the purpose of text summarisation is described. The application scenario assumes a questionnaire corpus where we wish to provide a summary regarding the nature of the free text element of such questionnaires, but no suitable training data is available. The advocated approach is to build the desired text summarisation classifiers using secondary data and then apply these classifiers, for the purpose of text summarisation, to the primary data. We refer to this approach using the acronym CGUSD (Classifier Generation Using Secondary Data). The approach is evaluated using real questionnaire data obtained as part of the SAVSNET (Small Animal Veterinary Surveillance Network) project.