A repair mechanism for fault-tolerance for tree-structured peer-to-peer systems

  • Authors:
  • Eddy Caron;Frédéric Desprez;Charles Fourdrignier;Franck Petit;Cédric Tedeschi

  • Affiliations:
  • LIP Laboratory, UMR CNRS-ENS Lyon-UCB Lyon-INRIA 5668, France;LIP Laboratory, UMR CNRS-ENS Lyon-UCB Lyon-INRIA 5668, France;LaRIA Laboratory, University of Picardie, Amiens, France;LaRIA Laboratory, University of Picardie, Amiens, France;LIP Laboratory, UMR CNRS-ENS Lyon-UCB Lyon-INRIA 5668, France

  • Venue:
  • HiPC'06 Proceedings of the 13th international conference on High Performance Computing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Facing the limits of traditional tools of resource management within computational grids (related to scale, dynamicity, etc. of the platforms newly considered), new approaches, based on peer-to-peer technologies are emerging. The resource discovery and in particular the service discovery is concerned by this evolution. Among the solutions, a promising one is the indexing of resources using trie structures and more particularly prefix trees. The major advantages of trie-structured approaches is the capability to support search queries on ranges of values with a latency growing logarithmically in the number of nodes in the trie. Those techniques are easy to extend to multicriteria searches. One drawback of using tries is its inherent poor robustness in a dynamic environment, where nodes join and leave the network, leading to the split of the tree into a forest, which results in the impossibility to route requests. Within most recent approaches, the fault-tolerance is a prevention mechanism, often replication-based. The replication can be costly in term of resources required. In this paper, we propose a fault-tolerance protocol that reconnects subtrees a posteriori, after crashes, to have again a connected graph and then reorder the nodes to rebuild a consistent tree.