Towards the Orwellian nightmare: separation of business and personal emails

  • Authors:
  • Sanaz Jabbari;Ben Allison;David Guthrie;Louise Guthrie

  • Affiliations:
  • University of Sheffield, Sheffield;University of Sheffield, Sheffield;University of Sheffield, Sheffield;University of Sheffield, Sheffield

  • Venue:
  • COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes the largest scale annotation project involving the Enron email corpus to date. Over 12,500 emails were classified, by humans, into the categories "Business" and "Personal", and then sub-categorised by type within these categories. The paper quantifies how well humans perform on this task (evaluated by inter-annotator agreement). It presents the problems experienced with the separation of these language types. As a final section, the paper presents preliminary results using a machine to perform this classification task.