A mixed MPI-Thread approach for parallel page ranking computation

  • Authors:
  • Bundit Manaskasemsak;Putchong Uthayopas;Arnon Rungsawang

  • Affiliations:
  • Massive Information & Knowledge Engineering, Department of Computer Engineering, Faculty of Engineering, Kasetsart University, Bangkok, Thailand;Thai National Grid Center, Software Industry Promotion Agency, Ministry of Information and Communication Technology, Thailand;Massive Information & Knowledge Engineering, Department of Computer Engineering, Faculty of Engineering, Kasetsart University, Bangkok, Thailand

  • Venue:
  • ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part II
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The continuing growth of the Internet challenges search engine providers to deliver up-to-date and relevant search results A critical component is the availability of a rapid, scalable technique for PageRank computation of a large web graph In this paper, we propose an efficient parallelized version of the PageRank algorithm based on a mixed MPI and multi-threading model The parallel adaptive PageRank algorithm is implemented and tested on two clusters of SMP hosts In the algorithm, communications between processes on different hosts are managed by a message passing (MPI) model, while those between threads are handled via a inter-thread mechanism We construct a synthesized web graph of approximately 62.6 million nodes and 1.37 billion hyperlinks to test the algorithm on two SMP clusters Preliminary results show that significant speedups are possible; however, large inter-node synchronization operations and issues of shared memory access inhibit efficient CPU utilization We believe that the proposed approach shows promise for large-scale PageRank applications and improvements in the algorithm can achieve more efficient CPU utilization.