A comparative study of ID3 and backpropagation for English text-to-speech mapping
Proceedings of the seventh international conference (1990) on Machine learning
The nature of statistical learning theory
The nature of statistical learning theory
Design and implementation of automatic indexing for information retrieval with Arabic documents
Journal of the American Society for Information Science
Journal of the American Society for Information Science
Information Retrieval
Mining e-mail content for author identification forensics
ACM SIGMOD Record
Machine Learning
Authorship Attribution with Support Vector Machines
Applied Intelligence
Arabic finite-state morphological analysis and generation
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Language independent authorship attribution using character level language models
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
A morphologically sensitive clustering algorithm for identifying Arabic roots
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Automatic tagging of Arabic text: from raw text to base phrase chunks
HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Authorship analysis in cybercrime investigation
ISI'03 Proceedings of the 1st NSF/NIJ conference on Intelligence and security informatics
Towards the measurement of Arabic Weblogs credibility automatically
Proceedings of the 11th International Conference on Information Integration and Web-based Applications & Services
Artificial immune system for illicit content identification in social media
Journal of the American Society for Information Science and Technology
A sock puppet detection algorithm on virtual spaces
Knowledge-Based Systems
The Effect of Stemming on Arabic Text Classification: An Empirical Study
International Journal of Information Retrieval Research
Hi-index | 0.00 |
The advent and rapid proliferation of internet communication has allowed the realization of numerous security issues. The anonymous nature of online mediums such as email, web sites, and forums provides an attractive communication method for criminal activity. Increased globalization and the boundless nature of the internet have further amplified these concerns due to the addition of a multilingual dimension. The world's social and political climate has caused Arabic to draw a great deal of attention. In this study we apply authorship identification techniques to Arabic web forum messages. Our research uses lexical, syntactic, structural, and content-specific writing style features for authorship identification. We address some of the problematic characteristics of Arabic in route to the development of an Arabic language model that provides a respectable level of classification accuracy for authorship discrimination. We also run experiments to evaluate the effectiveness of different feature types and classification techniques on our dataset.