Spam Detection Using Dynamic Weighted Voting Based on Clustering

  • Authors:
  • Mehrnoush Famil Saeedian;Hamid Beigy

  • Affiliations:
  • -;-

  • Venue:
  • IITA '08 Proceedings of the 2008 Second International Symposium on Intelligent Information Technology Application - Volume 02
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the last decade spam detection has been addressed as a text classification or categorization problem. In this paper we propose a new dynamic weighted voting method based on the combination of clustering and weighted voting, and apply it to the task of spam filtering. In order to classify a new sample, it first compares with all cluster centroids and its similarity to each cluster is identified; Classifiers in the vicinity of the input sample obtain greater weight for the final decision of the ensemble. The evaluation shows that the algorithm outperforms pure SVM.