Distributed Pagerank for P2P Systems

Authors:
Karthikeyan Sankaralingam;Simha Sethumadhavan;James C. Browne
Affiliations:
-;-;-
Venue:
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
Year:
2003

Citing 0
Cited 17

Distributed Pagerank: A Distributed Reputation Model for Open Peer-to-Peer Networks

SAINT-W '04 Proceedings of the 2004 Symposium on Applications and the Internet-Workshops (SAINT 2004 Workshops)
General parallel computations on desktop grid and P2P systems

LCR '04 Proceedings of the 7th workshop on Workshop on languages, compilers, and run-time support for scalable systems
Propagation Models for Trust and Distrust in Social Networks

Information Systems Frontiers
Survey of research towards robust peer-to-peer networks: search methods

Computer Networks: The International Journal of Computer and Telecommunications Networking
Hybrid global-local indexing for effcient peer-to-peer information retrieval

NSDI'04 Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation - Volume 1
pFusion: A P2P Architecture for Internet-Scale Content-Based Search and Retrieval

IEEE Transactions on Parallel and Distributed Systems
Bubblestorm: resilient, probabilistic, and exhaustive peer-to-peer search

Proceedings of the 2007 conference on Applications, technologies, architectures, and protocols for computer communications
A survey on distributed approaches to graph based reputation measures

Proceedings of the 2nd international conference on Performance evaluation methodologies and tools
Usage-based ranking of distributed XML data

Proceedings of the 2008 ACM symposium on Applied computing
ProMail: using progressive email social network for spam detection

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Review: A survey on content-centric technologies for the current Internet: CDN and P2P solutions

Computer Communications
A link-based ranking model for services

ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part I
A mixed MPI-Thread approach for parallel page ranking computation

ODBASE'06/OTM'06 Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part II
Efficient parallel computation of pagerank

ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Parallel PageRank computation using GPUs

Proceedings of the Third Symposium on Information and Communication Technology
From credit and risk to trust: towards a credit flow based trust model for social networks

Proceedings of the 17th ACM international conference on Supporting group work
Asynchronous distributed power iteration with gossip-based normalization

Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper de.nes and describes a fully distributed implementation of Google's highly effective Pagerank algorithm, for "peer to peer" (P2P) systems. The implementation is based on chaotic (asynchronous) iterative solution of linear systems. The P2P implementation also enables incremental computation of pageranks as new documents are entered into or deleted from the network. Incremental update enables continuously accurate pageranks whereas the currently centralized web crawl and computation over Internet documents requires several days. This suggests possible applicability of the distributed algorithm to pagerank computations as a replacement for the centralized web crawler based implementation for Internet documents. A complete solution of the distributed pagerank computation for an inplace network converges rapidly (1% accuracy in 10 iterations) for large systems although the time for an iteration may be long. The incremental computation resulting from addition of a single document converges extremely rapidly, typically requiring update path lengths of under 15 nodes even for large networks and very accurate solutions.This implementation of Pagerank provides a uniform ranking scheme for documents in P2P systems, and its integration with P2P keyword search provides one solution to the network traf.c problems engendered by return of document hits. In basic P2P keyword search, all the document hits must be returned to the querying node causing large network traffic. An incremental keyword search algorithm for P2P keyword search where document hits are sorted by pagerank, and incrementally returned to the querying node is proposed and evaluated. Integration of this algorithm into P2P keyword search can produce dramatic benefit both in terms of effectiveness for users and decrease in network traffic. The incremental search algorithm provided approximately a ten-fold reduction in network traffic for two-wordand three-word queries.