Active learning for online spam filtering

  • Authors:
  • Wuying Liu;Ting Wang

  • Affiliations:
  • School of Computer, National University of Defense Technology, Changsha, Hunan, P.R. China;School of Computer, National University of Defense Technology, Changsha, Hunan, P.R. China

  • Venue:
  • AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Spam filtering is defined as a task trying to label emails with spam or ham in an online situation. The online feature requires the spam filter has a strong timely generalization and has a high processing speed. Machine learning can be employed to fulfill the two requirements. In this paper, we propose a SVMEL (SVM Ensemble Learning) method to combine five simple filters for higher accuracy and an active learning method to choose training emails for less training time. The experiments results show the filter applying active learning method can reduce requirements of labeled training emails and reach steady-state performance more quickly.