A semi-automated approach to building text summarisation classifiers

  • Authors:
  • Matias Garcia-Constantino;Frans Coenen;P.-J. Noble;Alan Radford;Christian Setzkorn

  • Affiliations:
  • Department of Computer Science, The University of Liverpool, Liverpool, UK;Department of Computer Science, The University of Liverpool, Liverpool, UK;School of Veterinary Science, University of Liverpool, Neston, UK;School of Veterinary Science, University of Liverpool, Neston, UK;School of Veterinary Science, University of Liverpool, Neston, UK

  • Venue:
  • MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

An investigation into the extraction of useful information from the free text element of questionnaires, using a semi-automated summarisation extraction technique to generate text summarisation classifiers, is described. A realisation of the proposed technique, SARSET (Semi-Automated Rule Summarisation Extraction Tool), is presented and evaluated using real questionnaire data. The results of this approach are compared against the results obtained using two alternative techniques to build text summarisation classifiers. The first of these uses standard rule-based classifier generators, and the second is founded on the concept of building classifiers using secondary data. The results demonstrate that the proposed semi-automated approach outperforms the other two approaches considered.