Using evolutionary learning classifiers to do MobileSpam (SMS) filtering

Authors:
Muhammad Bilal Junaid;Muddassar Farooq
Affiliations:
National University of Computer and Emerging Sciences (NUCES), Islamabad, Pakistan;National University of Computer and Emerging Sciences (NUCES), Islamabad, Pakistan
Venue:
Proceedings of the 13th annual conference on Genetic and evolutionary computation
Year:
2011

Citing 9
Cited 2

Machine learning in automated text categorization

ACM Computing Surveys (CSUR)
Get Real! XCS with Continuous-Valued Inputs

Learning Classifier Systems, From Foundations to Applications
Accuracy-based learning classifier systems: models, analysis and applications to classification tasks

Evolutionary Computation
Feature engineering for mobile (SMS) spam filtering

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Classifier fitness based on accuracy

Evolutionary Computation
SMS: The Short Message Service

Computer
KEEL: a software tool to assess evolutionary algorithms for data mining problems

Soft Computing - A Fusion of Foundations, Methodologies and Applications - Special Issue on Evolutionary and Metaheuristics based Data Mining (EMBDM); Guest Editors: José A. Gámez, María J. del Jesús, José M. Puerta
Improved use of continuous attributes in C4.5

Journal of Artificial Intelligence Research
Induction of fuzzy-rule-based classifiers with evolutionary boosting algorithms

IEEE Transactions on Fuzzy Systems

Review: SMS spam filtering: Methods and data

Expert Systems with Applications: An International Journal
Simple SMS spam filtering on independent mobile phone

Security and Communication Networks

Quantified Score

Hi-index	0.00

Visualization

Abstract

In recent years, we have witnessed the dramatic increase in the volume of mobile SMS (Short Messaging Service) spam. The reason is that operators - owing to fierce market competition - have introduced packages that allow their customers to send unlimited SMS in less than $1 a month. It not only degrades the service of cellular operators but also compromises security and privacy of users. In this paper, we analyze SMS spam to identify novel features that distinguishes it from benign SMS (ham). The novelty of our approach is that we intercept the SMS at the access layer of a mobile phone - in hexadecimal format - and extract two features: (1) octet bigrams, and (2) frequency distribution of octets. Later, we provide these features to a number of evolutionary and non-evolutionary classifiers to identify the best classifier for our mobile spam filtering system. We evaluate the detection rate and false alarm rate of our system - using different classifiers - on a real world dataset. The results of our experiments show that sUpervised Classifier System (UCS), by operating on the the above-mentioned features'set, achieves more than 89% detection rate and 0% false alarm rate.