Optimizing plurality for human intelligence tasks

  • Authors:
  • Luyi Mo;Reynold Cheng;Ben Kao;Xuan S. Yang;Chenghui Ren;Siyu Lei;David W. Cheung;Eric Lo

  • Affiliations:
  • University of Hong Kong, Hong Kong, Hong Kong;University of Hong Kong, Hong Kong, Hong Kong;University of Hong Kong, Hong Kong, Hong Kong;University of Hong Kong, Hong Kong, Hong Kong;University of Hong Kong, Hong Kong, Hong Kong;University of Hong Kong, Hong Kong, Hong Kong;University of Hong Kong, Hong Kong, Hong Kong;Hong Kong Polytechnic University, Hong Kong, Hong Kong

  • Venue:
  • Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In a crowdsourcing system, Human Intelligence Tasks (HITs) (e.g., translating sentences, matching photos, tagging videos with keywords) can be conveniently specified. HITs are made available to a large pool of workers, who are paid upon completing the HITs they have selected. Since workers may have different capabilities, some difficult HITs may not be satisfactorily performed by a single worker. If more workers are employed to perform a HIT, the quality of the HIT's answer could be statistically improved. Given a set of HITs and a fixed "budget", we address the important problem of determining the number of workers (or plurality) of each HIT so that the overall answer quality is optimized. We propose a dynamic programming (DP) algorithm for solving the plurality assignment problem (PAP). We identify two interesting properties, namely, monotonicity and diminishing return, which are satisfied by a HIT if the quality of the HIT's answer increases monotonically at a decreasing rate with its plurality. We show for HITs that satisfy the two properties (e.g., multiple-choice-question HITs), the PAP is approximable. We propose an efficient greedy algorithm for such case. We conduct extensive experiments on synthetic and real datasets to evaluate our algorithms. Our experiments show that our greedy algorithm provides close-to-optimal solutions in practice.