An evaluation of community detection algorithms on large-scale email traffic

  • Authors:
  • Farnaz Moradi;Tomas Olovsson;Philippas Tsigas

  • Affiliations:
  • Computer Science and Engineering, Chalmers University of Technology, Göteborg, Sweden;Computer Science and Engineering, Chalmers University of Technology, Göteborg, Sweden;Computer Science and Engineering, Chalmers University of Technology, Göteborg, Sweden

  • Venue:
  • SEA'12 Proceedings of the 11th international conference on Experimental Algorithms
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Community detection algorithms are widely used to study the structural properties of real-world networks. In this paper, we experimentally evaluate the qualitative performance of several community detection algorithms using large-scale email networks. The email networks were generated from real email traffic and contain both legitimate email (ham) and unsolicited email (spam). We compare the quality of the algorithms with respect to a number of structural quality functions and a logical quality measure which assesses the ability of the algorithms to separate ham and spam emails by clustering them into distinct communities. Our study reveals that the algorithms that perform well with respect to structural quality, don't achieve high logical quality. We also show that the algorithms with similar structural quality also have similar logical quality regardless of their approach to clustering. Finally, we reveal that the algorithm that performs link community detection is more suitable for clustering email networks than the node-based approaches, and it creates more distinct communities of ham and spam edges.