Spam Filtering using a Markov Random Field Model with Variable Weighting Schemas

  • Authors:
  • Shalendra Chhabra;William S. Yerazunis;Christian Siefkes

  • Affiliations:
  • UC Riverside, CA;MERL, Cambridge, Massachusetts;GKVI/FU Berlin, Germany

  • Venue:
  • ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present a Markov Random Field model based approach to filter spam. Our approach examines the importance of the neighborhood relationship (MRF cliques) among words in an email message for the purpose of spam classification. We propose and test several different theoretical bases for weighting schemes among corresponding neighborhood windows. Our results demonstrate that unexpected side effects depending on the neighborhood window size may have larger accuracy impact than the neighborhood relationship effects of the Markov Random Field.