Profanity use in online communities

Authors:
Sara Sood;Judd Antin;Elizabeth Churchill
Affiliations:
Pomona College, Claremont, CA, USA;Yahoo! Research, Santa Clara, California, United States;Yahoo! Research, Santa Clara, California, United States
Venue:
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Year:
2012

Citing 18
Cited 2

Filtering objectionable internet content

ICIS '99 Proceedings of the 20th international conference on Information Systems
Statistical color models with application to skin detection

International Journal of Computer Vision
Finding Naked People

ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume II - Volume II
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
WebGuard: A Web Filtering Engine Combining Textual, Structural, and Visual Content-Based Analysis

IEEE Transactions on Knowledge and Data Engineering
Internet content filtering using isotonic separation on content category ratings

ACM Transactions on Internet Technology (TOIT)
Get another label? improving data quality and data mining using multiple, noisy labelers

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Fast, cheap, and creative: evaluating translation quality using Amazon's Mechanical Turk

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
A collusion-resistant automation scheme for social moderation systems

CCNC'09 Proceedings of the 6th IEEE Conference on Consumer Communications and Networking Conference
How useful are your comments?: analyzing and predicting youtube comments and comment ratings

Proceedings of the 19th international conference on World wide web
Rethinking grammatical error annotation and evaluation with the Amazon Mechanical Turk

IUNLPBEA '10 Proceedings of the NAACL HLT 2010 Fifth Workshop on Innovative Use of NLP for Building Educational Applications
Smokey: automatic recognition of hostile messages

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Tokenizing micro-blogging messages using a text classification approach

AND '10 Proceedings of the fourth workshop on Analytics for noisy unstructured text data
A Smart Filtering System for Newly Coined Profanities by Using Approximate String Alignment

CIT '10 Proceedings of the 2010 10th IEEE International Conference on Computer and Information Technology
Normative influences on thoughtful online participation

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Language use as a reflection of socialization in online communities

LSM '11 Proceedings of the Workshop on Languages in Social Media
System for screening objectionable images

Computer Communications
Automatic identification of personal insults on social news sites

Journal of the American Society for Information Science and Technology

Turkopticon: interrupting worker invisibility in amazon mechanical turk

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Cursing in English on twitter

Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing

Quantified Score

Hi-index	0.01

Visualization

Abstract

As user-generated Web content increases, the amount of inappropriate and/or objectionable content also grows. Several scholarly communities are addressing how to detect and manage such content: research in computer vision focuses on detection of inappropriate images, natural language processing technology has advanced to recognize insults. However, profanity detection systems remain flawed. Current list-based profanity detection systems have two limitations. First, they are easy to circumvent and easily become stale - that is, they cannot adapt to misspellings, abbreviations, and the fast pace of profane slang evolution. Secondly, they offer a one-size fits all solution; they typically do not accommodate domain, community and context specific needs. However, social settings have their own normative behaviors - what is deemed acceptable in one community may not be in another. In this paper, through analysis of comments from a social news site, we provide evidence that current systems are performing poorly and evaluate the cases on which they fail. We then address community differences regarding creation/tolerance of profanity and suggest a shift to more contextually nuanced profanity detection systems.