Semi Supervised Image Spam Hunter: A Regularized Discriminant EM Approach

Authors:
Yan Gao;Ming Yang;Alok Choudhary
Affiliations:
Dept. of EECS, Northwestern University, Evanston, USA;NEC Laboratories America, Cupertino, USA;Dept. of EECS, Northwestern University, Evanston, USA
Venue:
ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
Year:
2009

Citing 16
Cited 2

A Computational Approach to Edge Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Support-Vector Networks

Machine Learning
Image Indexing Using Color Correlograms

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Adversarial classification

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Face Recognition Using Laplacianfaces

IEEE Transactions on Pattern Analysis and Machine Intelligence
Awarded Best Paper! - Scalable Centralized Bayesian Spam Mitigation with Bogofilter

LISA '04 Proceedings of the 18th USENIX conference on System administration
Leveraging Social Networks to Fight Spam

Computer
Probabilistic Boosting-Tree: Learning Discriminative Models for Classification, Recognition, and Clustering

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Physics-motivated features for distinguishing photographic images and computer graphics

Proceedings of the 13th annual ACM international conference on Multimedia
Fast statistical spam filter by approximate classifications

SIGMETRICS '06/Performance '06 Proceedings of the joint international conference on Measurement and modeling of computer systems
Spam Filtering Based On The Analysis Of Text Information Embedded Into Images

The Journal of Machine Learning Research
Image Spam Filtering Using Visual Information

ICIAP '07 Proceedings of the 14th International Conference on Image Analysis and Processing
Detecting image spam using visual features and near duplicate detection

Proceedings of the 17th international conference on World Wide Web
Evaluation of spam detection and prevention frameworks for email and image spam: a state of art

Proceedings of the 10th International Conference on Information Integration and Web-based Applications & Services
Ubiquitously supervised subspace learning

IEEE Transactions on Image Processing
Support vector machines for spam categorization

IEEE Transactions on Neural Networks

A comprehensive approach to image spam detection: from server to client solution

IEEE Transactions on Information Forensics and Security
A survey of image spamming and filtering techniques

Artificial Intelligence Review

Quantified Score

Hi-index	0.00

Visualization

Abstract

Image spam is a new trend in the family of email spams. The new image spams employ a variety of image processing technologies to create random noises. In this paper, we propose a semi-supervised approach, regularized discriminant EM algorithm (RDEM), to detect image spam emails, which leverages small amount of labeled data and large amount of unlabeled data for identifying spams and training a classification model simultaneously. Compared with fully supervised learning algorithms, the semi-supervised learning algorithm is more suitedin adversary classification problems, because the spammers are actively protecting their work by constantly making changes to circumvent the spam detection. It makes the cost too high for fully supervised learning to frequently collect sufficient labeled data for training. Experimental results demonstrate that our approach achieves 91.66% high detection rate with less than 2.96% false positive rate, meanwhile it significantly reduces the labeling cost.