Detecting hate speech on the world wide web

Authors:
William Warner;Julia Hirschberg
Affiliations:
Columbia University, New York, NY;Columbia University, New York, NY
Venue:
LSM '12 Proceedings of the Second Workshop on Language in Social Media
Year:
2012

Citing 9
Cited 1

Making large-scale support vector machine learning practical

Advances in kernel methods
Unsupervised word sense disambiguation rivaling supervised methods

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Thumbs up?: sentiment classification using machine learning techniques

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Identifying sources of opinions with conditional random fields and extraction patterns

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Opinion Mining and Sentiment Analysis

Foundations and Trends in Information Retrieval
Expanding domain sentiment lexicon through double propagation

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Why label when you can search?: alternatives to active learning for applying human resources to build classification models under extreme class imbalance

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Offensive language detection using multi-level classification

AI'10 Proceedings of the 23rd Canadian conference on Advances in Artificial Intelligence

Content matters: a study of hate groups detection based on social networks analysis and web mining

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present an approach to detecting hate speech in online text, where hate speech is defined as abusive speech targeting specific group characteristics, such as ethnic origin, religion, gender, or sexual orientation. While hate speech against any group may exhibit some common characteristics, we have observed that hatred against each different group is typically characterized by the use of a small set of high frequency stereotypical words; however, such words may be used in either a positive or a negative sense, making our task similar to that of words sense disambiguation. In this paper we describe our definition of hate speech, the collection and annotation of our hate speech corpus, and a mechanism for detecting some commonly used methods of evading common "dirty word" filters. We describe pilot classification experiments in which we classify anti-semitic speech reaching an accuracy 94%, precision of 68% and recall at 60%, for an F1 measure of. 6375.