Social network analysis for email classification

  • Authors:
  • K. Yelupula;Srini Ramaswamy

  • Affiliations:
  • Little Rock, AR;Little Rock, AR

  • Venue:
  • Proceedings of the 46th Annual Southeast Regional Conference on XX
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The availability of a large corpus of emails in organizations, such as the Enron dataset (used in this work), is the motivation for this work. The attempt is to see if one can predict the organizational structure of Enron by using data mining algorithms and methodologies on this email dataset. The primary approach in this attempt is the analysis of email flows within the organization. Our results show that significant information about an organization's structure can be obtained even if the body (content) of emails is neglected. Enough relevant data is extracted about the 'email' social network using simple email flow analysis and associated statistics gaining an over all picture of the organizational structure. The longer term objective of this work is to show that readily available information can be used to determine relevant metrics by which one can reconstruct and verify the approximate social hierarchies within an organization or company.