Opinion mining of Spanish customer comments with non-expert annotations on Mechanical Turk

Authors:
Bart Mellebeek;Francesc Benavent;Jens Grivolla;Joan Codina;Marta R. Costa-jussà;Rafael Banchs
Affiliations:
Barcelona Media Innovation Center, Barcelona, Spain;Barcelona Media Innovation Center, Barcelona, Spain;Barcelona Media Innovation Center, Barcelona, Spain;Barcelona Media Innovation Center, Barcelona, Spain;Barcelona Media Innovation Center, Barcelona, Spain;Barcelona Media Innovation Center, Barcelona, Spain
Venue:
CSLDAMT '10 Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk
Year:
2010

Citing 5
Cited 4

The kappa statistic: a second look

Computational Linguistics
Internet-scale collection of human-reviewed data

Proceedings of the 16th international conference on World Wide Web
Financial incentives and the "performance of crowds"

Proceedings of the ACM SIGKDD Workshop on Human Computation
Cheap and fast---but is it good?: evaluating non-expert annotations for natural language tasks

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
The WEKA data mining software: an update

ACM SIGKDD Explorations Newsletter

Creating speech and language data with Amazon's Mechanical Turk

CSLDAMT '10 Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk
How good is the crowd at "real" WSD?

LAW V '11 Proceedings of the 5th Linguistic Annotation Workshop
Building subjectivity lexicon(s) from scratch for essay data

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Perspectives on crowdsourcing annotations for natural language processing

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

One of the major bottlenecks in the development of data-driven AI Systems is the cost of reliable human annotations. The recent advent of several crowdsourcing platforms such as Amazon's Mechanical Turk, allowing requesters the access to affordable and rapid results of a global workforce, greatly facilitates the creation of massive training data. Most of the available studies on the effectiveness of crowdsourcing report on English data. We use Mechanical Turk annotations to train an Opinion Mining System to classify Spanish consumer comments. We design three different Human Intelligence Task (HIT) strategies and report high inter-annotator agreement between non-experts and expert annotators. We evaluate the advantages/drawbacks of each HIT design and show that, in our case, the use of non-expert annotations is a viable and cost-effective alternative to expert annotations.