Taster's choice: a comparative analysis of spam feeds

  • Authors:
  • Andreas Pitsillidis;Chris Kanich;Geoffrey M. Voelker;Kirill Levchenko;Stefan Savage

  • Affiliations:
  • University of California, San Diego, San Diego, California, USA;University of Illinois, Chicago, Chicago, Illinois, USA;University of California, San Diego, San Diego, California, USA;University of California, San Diego, San Diego, California, USA;University of California, San Diego, San Diego, California, USA

  • Venue:
  • Proceedings of the 2012 ACM conference on Internet measurement conference
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

E-mail spam has been the focus of a wide variety of measurement studies, at least in part due to the plethora of spam data sources available to the research community. However, there has been little attention paid to the suitability of such data sources for the kinds of analyses they are used for. In spite of the broad range of data available, most studies use a single "spam feed" and there has been little examination of how such feeds may differ in content. In this paper we provide this characterization by comparing the contents of ten distinct contemporaneous feeds of spam-advertised domain names. We document significant variations based on how such feeds are collected and show how these variations can produce differences in findings as a result.