Evaluating unsupervised learning for natural language processing tasks

  • Authors:
  • Andreas Vlachos

  • Affiliations:
  • University of Wisconsin-Madison

  • Venue:
  • EMNLP '11 Proceedings of the First Workshop on Unsupervised Learning in NLP
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The development of unsupervised learning methods for natural language processing tasks has become an important and popular area of research. The primary advantage of these methods is that they do not require annotated data to learn a model. However, this advantage makes them difficult to evaluate against a manually labeled gold standard. Using unsupervised part-of-speech tagging as our case study, we discuss the reasons that render this evaluation paradigm unsuitable for the evaluation of unsupervised learning methods. Instead, we argue that the rarely used in-context evaluation is more appropriate and more informative, as it takes into account the way these methods are likely to be applied. Finally, bearing the issue of evaluation in mind, we propose directions for future work in unsupervised natural language processing.