Filtering objectionable internet content
ICIS '99 Proceedings of the 20th international conference on Information Systems
Statistical color models with application to skin detection
International Journal of Computer Vision
ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume II - Volume II
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
WebGuard: A Web Filtering Engine Combining Textual, Structural, and Visual Content-Based Analysis
IEEE Transactions on Knowledge and Data Engineering
Internet content filtering using isotonic separation on content category ratings
ACM Transactions on Internet Technology (TOIT)
Get another label? improving data quality and data mining using multiple, noisy labelers
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Fast, cheap, and creative: evaluating translation quality using Amazon's Mechanical Turk
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
A collusion-resistant automation scheme for social moderation systems
CCNC'09 Proceedings of the 6th IEEE Conference on Consumer Communications and Networking Conference
How useful are your comments?: analyzing and predicting youtube comments and comment ratings
Proceedings of the 19th international conference on World wide web
Rethinking grammatical error annotation and evaluation with the Amazon Mechanical Turk
IUNLPBEA '10 Proceedings of the NAACL HLT 2010 Fifth Workshop on Innovative Use of NLP for Building Educational Applications
Smokey: automatic recognition of hostile messages
AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence
Tokenizing micro-blogging messages using a text classification approach
AND '10 Proceedings of the fourth workshop on Analytics for noisy unstructured text data
A Smart Filtering System for Newly Coined Profanities by Using Approximate String Alignment
CIT '10 Proceedings of the 2010 10th IEEE International Conference on Computer and Information Technology
Normative influences on thoughtful online participation
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Language use as a reflection of socialization in online communities
LSM '11 Proceedings of the Workshop on Languages in Social Media
System for screening objectionable images
Computer Communications
Automatic identification of personal insults on social news sites
Journal of the American Society for Information Science and Technology
Turkopticon: interrupting worker invisibility in amazon mechanical turk
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing
Hi-index | 0.01 |
As user-generated Web content increases, the amount of inappropriate and/or objectionable content also grows. Several scholarly communities are addressing how to detect and manage such content: research in computer vision focuses on detection of inappropriate images, natural language processing technology has advanced to recognize insults. However, profanity detection systems remain flawed. Current list-based profanity detection systems have two limitations. First, they are easy to circumvent and easily become stale - that is, they cannot adapt to misspellings, abbreviations, and the fast pace of profane slang evolution. Secondly, they offer a one-size fits all solution; they typically do not accommodate domain, community and context specific needs. However, social settings have their own normative behaviors - what is deemed acceptable in one community may not be in another. In this paper, through analysis of comments from a social news site, we provide evidence that current systems are performing poorly and evaluate the cases on which they fail. We then address community differences regarding creation/tolerance of profanity and suggest a shift to more contextually nuanced profanity detection systems.