Applying interestingness measures to Ansar forum texts

Authors:
D. B. Skillicorn
Affiliations:
Queen's University, Canada
Venue:
ACM SIGKDD Workshop on Intelligence and Security Informatics
Year:
2010

Citing 1
Cited 2

Automatically classifying documents by ideological and organizational affiliation

ISI'09 Proceedings of the 2009 IEEE international conference on Intelligence and security informatics

Where do I start?: algorithmic strategies to guide intelligence analysts

Proceedings of the ACM SIGKDD Workshop on Intelligence and Security Informatics
Lessons from a Jihadi Corpus

ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Documents from the Ansar aljihad forum are ranked using a number of word-usage models. Analysis of overall content shows that postings fall strongly into two categories. A model describing Salafist-jihadi content generates a very clear single-factor ranking of postings. This ranking could be interpreted as selecting the most radical postings, and so could direct analyst attention to the most significant documents. A model for deception creates a multifactor ranking that produces a similar ordering, with low-deception postings identified with highly Salafist-jihadi ones. This suggests either that such postings are extremely sincere, or that personal pronoun use and intricate structuring are also markers of Salafist-jihadi language. Although the overall approach is relatively straightforward, the choice of parameters to maximize the usefulness of the results is intricate.