Communications of the ACM
Extracting significant time varying features from text
Proceedings of the eighth international conference on Information and knowledge management
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Explicitly representing expected cost: an alternative to ROC representation
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Robust Classification for Imprecise Environments
Machine Learning
Evaluating cost-sensitive Unsolicited Bulk Email categorization
Proceedings of the 2002 ACM symposium on Applied computing
Topic Detection and Tracking: Event-Based Information Organization
Topic Detection and Tracking: Event-Based Information Organization
The Case against Accuracy Estimation for Comparing Induction Algorithms
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Handbook of data mining and knowledge discovery
Communications of the ACM - Program compaction
A comparison of event models for Naive Bayes anti-spam e-mail filtering
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Improved robustness of signature-based near-replica detection via lexicon randomization
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
SF-HME system: a hierarchical mixtures-of-experts classification system for spam filtering
Proceedings of the 2006 ACM symposium on Applied computing
The challenges of service-side personalized spam filtering: scalability and beyond
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Workload models of spam and legitimate e-mails
Performance Evaluation
Online supervised spam filter evaluation
ACM Transactions on Information Systems (TOIS)
An incremental cluster-based approach to spam filtering
Expert Systems with Applications: An International Journal
Adaptive e-mail intention finding mechanism based on e-mail words social networks
Proceedings of the 2007 workshop on Large scale attack defense
Lexicon randomization for near-duplicate detection with I-Match
The Journal of Supercomputing
Trusting spam reporters: A reporter-based reputation system for email filtering
ACM Transactions on Information Systems (TOIS)
Email Spam Filtering: A Systematic Review
Foundations and Trends in Information Retrieval
Journal of Computer Security
An Operable Email Based Intelligent Personal Assistant
World Wide Web
Review: A review of machine learning approaches to Spam filtering
Expert Systems with Applications: An International Journal
Journal of Computer Security - Best papers of the Sec Track at the 2006 ACM Symposium
Spam Filtering: the Influence of the Temporal Distribution of Training Data
Proceedings of the 2006 conference on STAIRS 2006: Proceedings of the Third Starting AI Researchers' Symposium
A survey of learning-based techniques of email spam filtering
Artificial Intelligence Review
Learning, detecting, understanding, and predicting concept changes
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Combining SVM classifiers for email anti-spam filtering
IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
Spam filtering and email-mediated applications
WImBI'06 Proceedings of the 1st WICI international conference on Web intelligence meets brain informatics
EuroGP'08 Proceedings of the 11th European conference on Genetic programming
A survey and experimental evaluation of image spam filtering techniques
Pattern Recognition Letters
Spam detection using web page content: a new battleground
Proceedings of the 8th Annual Collaboration, Electronic messaging, Anti-Abuse and Spam Conference
PCA document reconstruction for email classification
Computational Statistics & Data Analysis
A neural model in anti-spam systems
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part II
On the utility of incremental feature selection for the classification of textual data streams
PCI'05 Proceedings of the 10th Panhellenic conference on Advances in Informatics
On effective e-mail classification via neural networks
DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
Spam e-mail classification based on the IFWB algorithm
ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part I
Hi-index | 0.00 |
Spam, also known as Unsolicited Commercial Email (UCE), is the bane of email communication. Many data mining researchers have addressed the problem of detecting spam, generally by treating it as a static text classification problem. True in vivo spam filtering has characteristics that make it a rich and challenging domain for data mining. Indeed, real-world datasets with these characteristics are typically difficult to acquire and to share. This paper demonstrates some of these characteristics and argues that researchers should pursue in vivo spam filtering as an accessible domain for investigating them.