Metric convergence in social network sampling

  • Authors:
  • Christian Doerr;Norbert Blenn

  • Affiliations:
  • Delft University of Technology, Delft, Netherlands;Delft University of Technology, Dleft, Netherlands

  • Venue:
  • Proceedings of the 5th ACM workshop on HotPlanet
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

While enabling new research questions and methodologies, the massive size of social media platforms also poses a significant issue for the analysis of these networks. In order to deal with this data volume, researchers typically turn to samples of these graph structures to conduct their analysis. This however raises the question about the representativeness of such limited crawls, and the amount of data necessary to come to stable predictions about the underlying systems. This paper analyzes the convergence of six commonly used topological metrics as a function of the crawling method and sample size used. We find that graph crawling methods drastically over- and underestimate network metrics, and that a non-trivial amount of data is needed to arrive at a stable estimate of the underlying network.