A weakly-supervised approach to argumentative zoning of scientific documents

  • Authors:
  • Yufan Guo;Anna Korhonen;Thierry Poibeau

  • Affiliations:
  • University of Cambridge, UK;University of Cambridge, UK;LaTTiCe, CNRS & ENS, France

  • Venue:
  • EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Argumentative Zoning (AZ) -- analysis of the argumentative structure of a scientific paper -- has proved useful for a number of information access tasks. Current approaches to AZ rely on supervised machine learning (ML). Requiring large amounts of annotated data, these approaches are expensive to develop and port to different domains and tasks. A potential solution to this problem is to use weakly-supervised ML instead. We investigate the performance of four weakly-supervised classifiers on scientific abstract data annotated for multiple AZ classes. Our best classifier based on the combination of active learning and self-training outperforms our best supervised classifier, yielding a high accuracy of 81% when using just 10% of the labeled data. This result suggests that weakly-supervised learning could be employed to improve the practical applicability and portability of AZ across different information access tasks.