Using Discriminant Eigenfeatures for Image Retrieval
IEEE Transactions on Pattern Analysis and Machine Intelligence
Internet: saving private e-mail
IEEE Spectrum
CNSR '04 Proceedings of the Second Annual Conference on Communication Networks and Services Research
Discriminative Common Vectors for Face Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
An Empirical Performance Comparison of Machine Learning Methods for Spam E-Mail Categorization
HIS '04 Proceedings of the Fourth International Conference on Hybrid Intelligent Systems
On the utility of incremental feature selection for the classification of textual data streams
PCI'05 Proceedings of the 10th Panhellenic conference on Advances in Informatics
Support vector machines for spam categorization
IEEE Transactions on Neural Networks
Subspace based feature selection for pattern recognition
Information Sciences: an International Journal
A novel probabilistic feature selection method for text classification
Knowledge-Based Systems
The impact of preprocessing on text classification
Information Processing and Management: an International Journal
Hi-index | 0.00 |
Electronic mail is an important communication method for most computer users. Spam e-mails however consume bandwidth resource, fill-up server storage and are also a waste of time to tackle.The general way to label an e-mail as spam or non-spam is to set up a finite set of discriminative features and use a classifier for the detection. In most cases, the selection of such features is empirically verified. In this paper, two different methods are proposed to select the most discriminative features among a set of reasonably arbitrary features for spam e-mail detection. The selection methods are developed using the Common Vector Approach (CVA) which is actually a subspace-based pattern classifier.Experimental results indicate that the proposed feature selection methods give considerable reduction on the number of features without affecting recognition rates.