Estimating PageRank on graph streams

  • Authors:
  • Atish Das Sarma;Sreenivas Gollapudi;Rina Panigrahy

  • Affiliations:
  • Georgia Institute of Technology, Mountain View, CA;Microsoft Research, Mountain View, CA;Microsoft Research

  • Venue:
  • Journal of the ACM (JACM)
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article focuses on computations on large graphs (e.g., the web-graph) where the edges of the graph are presented as a stream. The objective in the streaming model is to use small amount of memory (preferably sub-linear in the number of nodes n) and a smaller number of passes. In the streaming model, we show how to perform several graph computations including estimating the probability distribution after a random walk of length l, the mixing time M, and other related quantities such as the conductance of the graph. By applying our algorithm for computing probability distribution on the web-graph, we can estimate the PageRank p of any node up to an additive error of &sqrt;ε p+ε in Õ(&sqrt;M/α) passes and Õ(min(nα+1/ε&sqrt;M/α+(1/ε)Mα, α n&sqrt;Mα + (1/ε)&sqrt;M/α)) space, for any α ∈ (0,1]. Specifically, for ε = M/n, α = M−1/2, we can compute the approximate PageRank values in Õ(nM−1/4) space and Õ(M3/4) passes. In comparison, a standard implementation of the PageRank algorithm will take O(n) space and O(M) passes. We also give an approach to approximate the PageRank values in just Õ(1) passes although this requires Õ(nM) space.