Random walk-based graphical sampling in unbalanced heterogeneous bipartite social graphs

  • Authors:
  • Yusheng Xie;Zhengzhang Chen;Ankit Agrawal;Alok Choudhary;Lu Liu

  • Affiliations:
  • Northwestern University, Evanston, Illinois, USA;Northwestern University, Evanston, Illinois, USA;Northwestern University, Evanston, Illinois, USA;Northwestern University, Evanston, Illinois, USA;Northwestern University, Evanston, Illinois, USA

  • Venue:
  • Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We investigate sampling techniques in unbalanced heterogeneous bipartite graphs (UHBGs), which have wide applications in real world web-scale social networks. We propose random walked-based link sampling and stratified sampling for UHBGs and show that they have advantages over generic random walk samplers. In addition, each sampler's node degree distribution parameter estimator statistic is analytically derived to be used as a quality indicator. In the experiments, we apply the two sampling techniques, with a baseline node sampling method, to both synthetic and real Facebook data. The experimental results show that random walk-based stratified sampler has significant advantage over node sampler and link sampler on UHBGs.