Evaluating the crowd with confidence

Authors:
Manas Joglekar;Hector Garcia-Molina;Aditya Parameswaran
Affiliations:
Stanford University, Stanford, CA, USA;Stanford University, Stanford, CA, USA;Stanford University, Stanford, CA, USA
Venue:
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2013

Citing 18
Cited 0

Get another label? improving data quality and data mining using multiple, noisy labelers

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Crowdsourcing for relevance evaluation

ACM SIGIR Forum
Supervised learning from multiple experts: whom to trust when everyone lies a bit

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Efficiently learning the accuracy of labeling sources for selective sampling

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Cheap and fast---but is it good?: evaluating non-expert annotations for natural language tasks

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Feasibility of human-in-the-loop minimum error rate training

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Re: CAPTCHAs: understanding CAPTCHA-solving services in an economic context

USENIX Security'10 Proceedings of the 19th USENIX conference on Security
Human-assisted graph search: it's okay to ask questions

Proceedings of the VLDB Endowment
Theory and Use of the EM Algorithm

Foundations and Trends in Signal Processing
Human-powered sorts and joins

Proceedings of the VLDB Endowment
Eliminating spammers and ranking annotators for crowdsourced labeling tasks

The Journal of Machine Learning Research
CrowdScreen: algorithms for filtering data with humans

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
So who won?: dynamic max discovery with the crowd

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
CDAS: a crowdsourcing data analytics system

Proceedings of the VLDB Endowment
Learning from crowds in the presence of schools of thought

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Active sampling for entity matching

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
CrowdER: crowdsourcing entity resolution

Proceedings of the VLDB Endowment
Reflections on Stanford's MOOCs

Communications of the ACM

Quantified Score

Hi-index	0.00

Visualization

Abstract

Worker quality control is a crucial aspect of crowdsourcing systems; typically occupying a large fraction of the time and money invested on crowdsourcing. In this work, we devise techniques to generate confidence intervals for worker error rate estimates, thereby enabling a better evaluation of worker quality. We show that our techniques generate correct confidence intervals on a range of real-world datasets, and demonstrate wide applicability by using them to evict poorly performing workers, and provide confidence intervals on the accuracy of the answers.