Incremental E-Mail Classification and Rule Suggestion Using Simple Term Statistics

  • Authors:
  • Alfred Krzywicki;Wayne Wobcke

  • Affiliations:
  • School of Computer Science and Engineering, University of New South Wales, Sydney, Australia NSW 2052;School of Computer Science and Engineering, University of New South Wales, Sydney, Australia NSW 2052

  • Venue:
  • AI '09 Proceedings of the 22nd Australasian Joint Conference on Advances in Artificial Intelligence
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present and use a method for e-mail categorization based on simple term statistics updated incrementally. We apply simple term statistics to two different tasks. The first task is to predict folders for classification of e-mails when large numbers of messages are required to remain unclassified. The second task is to support users who define rule bases for the same classification task, by suggesting suitable keywords for constructing Ripple Down Rule bases in this scenario. For both tasks, the results are compared with a number of standard machine learning algorithms. The comparison shows that the simple term statistics method achieves a higher level of accuracy than other machine learning methods when taking computation time into account.