Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Adaptive information filtering using evolutionary computation
Information Sciences: an International Journal - Special issue on frontiers in evolutionary algorithms
Techniques of Cluster Algorithms in Data Mining
Data Mining and Knowledge Discovery
On Clustering Validation Techniques
Journal of Intelligent Information Systems
Genetic Algorithms Used to Solve Scheduling Problems
Cybernetics and Systems Analysis
Communications of the ACM - Program compaction
Fighting the spam wars: A remailer approach with restrictive aliasing
ACM Transactions on Internet Technology (TOIT)
Effective Summarization Method of Text Documents
WI '05 Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence
Spam Detection Using Text Clustering
CW '05 Proceedings of the 2005 International Conference on Cyberworlds
A Unified View on Clustering Binary Data
Machine Learning
An incremental cluster-based approach to spam filtering
Expert Systems with Applications: An International Journal
Spam Detection Using Dynamic Weighted Voting Based on Clustering
IITA '08 Proceedings of the 2008 Second International Symposium on Intelligent Information Technology Application - Volume 02
Symbiotic Data Mining for Personalized Spam Filtering
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
A survey of evolutionary algorithms for clustering
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Spam Detection Using Feature Selection and Parameters Optimization
CISIS '10 Proceedings of the 2010 International Conference on Complex, Intelligent and Software Intensive Systems
Revealing social networks of spammers through spectral clustering
ICC'09 Proceedings of the 2009 IEEE international conference on Communications
Text Classification Based on Ant Colony Optimization
ICIC '10 Proceedings of the 2010 Third International Conference on Information and Computing - Volume 03
A review: accuracy optimization in clustering ensembles using genetic algorithms
Artificial Intelligence Review
A term weighting approach for text categorization
AIRS'05 Proceedings of the Second Asia conference on Asia Information Retrieval Technology
A supervised clustering and classification algorithm for mining data with mixed variables
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Spam e-mail classification based on the IFWB algorithm
ACIIDS'13 Proceedings of the 5th Asian conference on Intelligent Information and Database Systems - Volume Part I
Hi-index | 0.00 |
A new method for clustering of spam messages collected in bases of antispam system is offered. The genetic algorithm is developed for solving clustering problems. The objective function is a maximization of similarity between messages in clusters, which is defined by k-nearest neighbor algorithm. Application of genetic algorithm for solving constrained problems faces the problem of constant support of chromosomes which reduces convergence process. Therefore, for acceleration of convergence of genetic algorithm, a penalty function that prevents occurrence of infeasible chromosomes at ranging of values of function of fitness is used. After classification, knowledge extraction is applied in order to get information about classes. Multidocument summarization method is used to get the information portrait of each cluster of spam messages. Classifying and parametrizing spam templates, it will be also possible to define the thematic dependence from geographical dependence (e.g., what subjects prevail in spam messages sent from certain countries). Thus, the offered system will be capable to reveal purposeful information attacks if those occur. Analyzing origins of the spam messages from collection, it is possible to define and solve the organized social networks of spammers.