ECUE: A Spam Filter that Uses Machine Learning to Track Concept Drift

  • Authors:
  • Sarah Jane Delany;Pádraig Cunningham;Barry Smyth

  • Affiliations:
  • Dublin Institute of Technology, Kevin St., Dublin 8, Ireland, email: sarahjane.delany@comp.dit.ie;Trinity College Dublin, Dublin 2, Ireland, email: padraig.cunningham@cs.tcd.ie;University College Dublin, Dublin 4, Ireland, email: barry.smyth@cs.ucd.ie

  • Venue:
  • Proceedings of the 2006 conference on ECAI 2006: 17th European Conference on Artificial Intelligence August 29 -- September 1, 2006, Riva del Garda, Italy
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

While text classification has been identified for some time as a promising application area for Artificial Intelligence, so far few deployed applications have been described. In this paper we present a spam filtering system that uses example-based machine learning techniques to train a classifier from examples of spam and legitimate email. This approach has the advantage that it can personalise to the specifics of the user's filtering preferences. This classifier can also automatically adjust over time to account for the changing nature of spam (and indeed changes in the profile of legitimate email). A significant software engineering challenge in developing this system was to ensure that it could interoperate with existing email systems to allow easy managment of the training data over time. This system has been deployed and evaluated over an extended period and the results of this evaluation are presented here.