Concurrent rollback for crash recovery in extended hypercube networks

  • Authors:
  • Tong-Ying Juang;C. P. Chiu;Kun-Ming Yu

  • Affiliations:
  • -;-;-

  • Venue:
  • PAS '95 Proceedings of the First Aizu International Symposium on Parallel Algorithms/Architecture Synthesis
  • Year:
  • 1995

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recovering from processor failures is an important problem in the design and development of reliable systems. We present a concurrent rollback algorithm in extended hypercube networks to recover from crash failures which involves small message and time complexities. The network of an extended hypercube is a hierarchical, low diameter, recursive structure. By appending only O(1) additional information to each message, we use less than O(Nlog N) message exchanges and O(log/sup 2/ N) time elapsed for recovery work where N is the number of processors of the extended hypercube network. The algorithms can be used to recover from the failure of an arbitrary number of processors.