Study on Ensemble Classification Methods towards Spam Filtering

  • Authors:
  • Jinlong Wang;Ke Gao;Yang Jiao;Gang Li

  • Affiliations:
  • School of Computer Engineering, Qingdao Technological University, Qingdao, China 266033;School of Computer Engineering, Qingdao Technological University, Qingdao, China 266033;School of Computer Engineering, Qingdao Technological University, Qingdao, China 266033;School of Information Technology, Deakin University, Victoria, Australia 3125

  • Venue:
  • ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

Recently, many scholars make use of fusion of filters to enhance the performance of spam filtering. In the past several years, a lot of effort has been devoted to different ensemble methods to achieve better performance. In reality, how to select appropriate ensemble methods towards spam filtering is an unsolved problem. In this paper, we investigate this problem through designing a framework to compare the performances among various ensemble methods. It is helpful for researchers to fight spam email more effectively in applied systems. The experimental results indicate that online based methods perform well on accuracy, while the off-line batch methods are evidently influenced by the size of data set. When a large data set is involved, the performance of off-line batch methods is not at par with online methods, and in the framework of online methods, the performance of parallel ensemble is better when using complex filters only.