Parallel PageRank Computation on a Gigabit PC Cluster

  • Authors:
  • Bundit Manaskasemsak;Arnon Rungsawang

  • Affiliations:
  • -;-

  • Venue:
  • AINA '04 Proceedings of the 18th International Conference on Advanced Information Networking and Applications - Volume 2
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Efficient computing the PageRank scores for alarge web graph is actually one of the hot issues inWeb-IR community. Recent researches propose toaccelerate the computation, both in algorithmic andarchitectural ways. We here focus on a parallelPageRank computational architecture on a cluster ofOpteron PCs networked via a Gigabit Ethernet. Wepropose both an efficient parallel algorithm of thestandard PageRank computation, and a simplepairwise communication model needed to synchronizelocal PageRank scores between processors. Ourexperimental results conducted on a large web graph,over 1.5 billion links, synthesized from the real set ofcrawled web pages in the TH domain, are quitepromising. The current implementation takes less than15 seconds for an iteration run.