Optimizing plurality for human intelligence tasks

Authors:
Luyi Mo;Reynold Cheng;Ben Kao;Xuan S. Yang;Chenghui Ren;Siyu Lei;David W. Cheung;Eric Lo
Affiliations:
University of Hong Kong, Hong Kong, Hong Kong;University of Hong Kong, Hong Kong, Hong Kong;University of Hong Kong, Hong Kong, Hong Kong;University of Hong Kong, Hong Kong, Hong Kong;University of Hong Kong, Hong Kong, Hong Kong;University of Hong Kong, Hong Kong, Hong Kong;University of Hong Kong, Hong Kong, Hong Kong;Hong Kong Polytechnic University, Hong Kong, Hong Kong
Venue:
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Year:
2013

Citing 12
Cited 0

Knapsack problems: algorithms and computer implementations

Knapsack problems: algorithms and computer implementations
Human-assisted graph search: it's okay to ask questions

Proceedings of the VLDB Endowment
CrowdDB: answering queries with crowdsourcing

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Demonstration of Qurk: a query processor for humanoperators

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Human-powered sorts and joins

Proceedings of the VLDB Endowment
CrowdScreen: algorithms for filtering data with humans

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
So who won?: dynamic max discovery with the crowd

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
CDAS: a crowdsourcing data analytics system

Proceedings of the VLDB Endowment
CrowdER: crowdsourcing entity resolution

Proceedings of the VLDB Endowment
Whom to ask?: jury selection for decision making tasks on micro-blog services

Proceedings of the VLDB Endowment
AutoMan: a platform for integrating human-based and digital computation

Proceedings of the ACM international conference on Object oriented programming systems languages and applications
On incentive-based tagging

ICDE '13 Proceedings of the 2013 IEEE International Conference on Data Engineering (ICDE 2013)

Quantified Score

Hi-index	0.00

Visualization

Abstract

In a crowdsourcing system, Human Intelligence Tasks (HITs) (e.g., translating sentences, matching photos, tagging videos with keywords) can be conveniently specified. HITs are made available to a large pool of workers, who are paid upon completing the HITs they have selected. Since workers may have different capabilities, some difficult HITs may not be satisfactorily performed by a single worker. If more workers are employed to perform a HIT, the quality of the HIT's answer could be statistically improved. Given a set of HITs and a fixed "budget", we address the important problem of determining the number of workers (or plurality) of each HIT so that the overall answer quality is optimized. We propose a dynamic programming (DP) algorithm for solving the plurality assignment problem (PAP). We identify two interesting properties, namely, monotonicity and diminishing return, which are satisfied by a HIT if the quality of the HIT's answer increases monotonically at a decreasing rate with its plurality. We show for HITs that satisfy the two properties (e.g., multiple-choice-question HITs), the PAP is approximable. We propose an efficient greedy algorithm for such case. We conduct extensive experiments on synthetic and real datasets to evaluate our algorithms. Our experiments show that our greedy algorithm provides close-to-optimal solutions in practice.