Comparing manual text patterns and machine learning for classification of e-mails for automatic answering by a government agency

  • Authors:
  • Hercules Dalianis;Jonas Sjöbergh;Eriks Sneiders

  • Affiliations:
  • Department of Computer and Systems Sciences, DSV, Stockholm University, Kista, Sweden;KTH CSC, Stockholm, Sweden;Department of Computer and Systems Sciences, DSV, Stockholm University, Kista, Sweden

  • Venue:
  • CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

E-mails to government institutions as well as to large companies may contain a large proportion of queries that can be answered in a uniform way. We analysed and manually annotated 4,404 e-mails from citizens to the Swedish Social Insurance Agency, and compared two methods for detecting answerable e-mails: manually-created text patterns (rule-based) and machine learning-based methods. We found that the text pattern-based method gave much higher precision at 89 percent than the machine learning-based method that gave only 63 percent precision. The recall was slightly higher (66 percent) for the machine learning-based methods than for the text patterns (47 percent). We also found that 23 percent of the total e-mail flow was processed by the automatic e-mail answering system.