Grindstone4Spam: An optimization toolkit for boosting e-mail classification

  • Authors:
  • José R. MéNdez;M. Reboiro-Jato;Fernando DíAz;Eduardo DíAz;Florentino Fdez-Riverola

  • Affiliations:
  • Escuela Superior de Ingeniería Informática, University of Vigo, Ourense, Spain;Escuela Superior de Ingeniería Informática, University of Vigo, Ourense, Spain;Escuela Universitaria de Informática, University of Valladolid, Segovia, Spain;Ultreia Comunicaciones S.L., Vigo, Spain;Escuela Superior de Ingeniería Informática, University of Vigo, Ourense, Spain

  • Venue:
  • Journal of Systems and Software
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Resulting from the huge expansion of Internet usage, the problem of unsolicited commercial e-mail (UCE) has grown astronomically. Although a good number of successful content-based anti-spam filters are available, their current utilization in real scenarios is still a long way off. In this context, the SpamAssassin filter offers a rule-based framework that can be easily used as a powerful integration and deployment tool for the fast development of new anti-spam strategies. This paper presents Grindstone4Spam, a publicly available optimization toolkit for boosting SpamAssassin performance. Its applicability has been verified by comparing its results with those obtained by the default SpamAssassin software as well as four well-known anti-spam filtering techniques such as Naive Bayes, Flexible Bayes, Adaboost and Support Vector Machines in two different case studies. The performance of the proposed alternative clearly outperforms existing approaches working in a cost-sensitive scenario.