Efficiently Handling Dynamics in Distributed Link Based Authority Analysis

  • Authors:
  • Josiane Xavier Parreira;Sebastian Michel;Gerhard Weikum

  • Affiliations:
  • Max-Planck Institute for Informatics, Saarbrücken, Germany;Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland;Max-Planck Institute for Informatics, Saarbrücken, Germany

  • Venue:
  • WISE '08 Proceedings of the 9th international conference on Web Information Systems Engineering
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Link based authority analysis is an important tool for ranking resources in social networks and other graphs. Previous work have presented $\mathrm{J^{X}_P}$, a decentralized algorithm for computing PageRank scores. The algorithm is designed to work in distributed systems, such as peer-to-peer (P2P) networks. However, the dynamics of the P2P networks, one if its main characteristics, is currently not handled by the algorithm. This paper shows how to adapt $\mathrm{J^{X}_P}$ to work under network churn. First, we present a distributed algorithm that estimates the number of distinct documents in the network, which is needed in the local computation of the PageRank scores. We then present a method that enables each peer to detect other peers leave and to update its view of the network. We show that the number of stored items in the network can be efficiently estimated, with little overhead on the network traffic. Second, we present an extension of the original $\mathrm{J^{X}_P}$ algorithms that can cope with network and content dynamics. We show by a comprehensive performance analysis the practical usability of our approach. The proposed estimators together with the changes in the core $\mathrm{J^{X}_P}$ components allow for a fast and authority score computation even under heavy churn. We believe that this is the last missing step toward the application of distributed PageRank measures in real-life large-scale applications.