Genre Classification and Domain Transfer for Information Filtering
Proceedings of the 24th BCS-IRSG European Colloquium on IR Research: Advances in Information Retrieval
Topic-oriented information detection and scoring
PAISI'11 Proceedings of the 6th Pacific Asia conference on Intelligence and security informatics
Common Sense Reasoning for Detection, Prevention, and Mitigation of Cyberbullying
ACM Transactions on Interactive Intelligent Systems (TiiS) - Special Issue on Common Sense for Interactive Systems
Hi-index | 0.00 |
In this poster we present an overview of the techniques we used to develop and evaluate a text categorisation system to automatically classify racist texts. Detecting racism is difficult because the presence of indicator words is insufficient to indicate racist texts, unlike some other text classification tasks. Support Vector Machines (SVM) are used to automatically categorise web pages based on whether or not they are racist. Different interpretations of what constitutes a term are taken, and in this poster we look at three representations of a web page within an SVM -- bag-of-words, bigrams and part-of-speech tags.