Perspectives on crowdsourcing annotations for natural language processing

  • Authors:
  • Aobo Wang;Cong Duy Hoang;Min-Yen Kan

  • Affiliations:
  • AS6 04-13 Computing 1, 13 Computing Drive National University of Singapore, Singapore, Singapore 117417;Human Language Technology Department, Institute for Infocomm Research (I²R), A*STAR, Singapore, Singapore 138632;AS6 05-12 Computing 1, 13 Computing Drive National University of Singapore, Singapore, Singapore 117417

  • Venue:
  • Language Resources and Evaluation
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Crowdsourcing has emerged as a new method for obtaining annotations for training models for machine learning. While many variants of this process exist, they largely differ in their methods of motivating subjects to contribute and the scale of their applications. To date, there has yet to be a study that helps the practitioner to decide what form an annotation application should take to best reach its objectives within the constraints of a project. To fill this gap, we provide a faceted analysis of crowdsourcing from a practitioner's perspective, and show how our facets apply to existing published crowdsourced annotation applications. We then summarize how the major crowdsourcing genres fill different parts of this multi-dimensional space, which leads to our recommendations on the potential opportunities crowdsourcing offers to future annotation efforts.