On the Probability of Facing Fault Patterns: A Performance and Comparison Measure of Network Fault-Tolerance

Authors:
Farshad Safaei;Ahmad Khonsari;Reza Moraveji
Affiliations:
Dept. of ECE, Shahid Beheshti Univ., Tehran, Iran;Dept. of ECE, Univ. of Tehran, Tehran, Iran and IPM School of Computer Science, Tehran, Iran;Dept. of ECE, Shahid Beheshti Univ., Tehran, Iran and IPM School of Computer Science, Tehran, Iran
Venue:
ICCS '08 Proceedings of the 8th international conference on Computational Science, Part I
Year:
2008

Citing 10
Cited 1

Dynamically Configurable Message Flow Control for Fault-Tolerant Routing

IEEE Transactions on Parallel and Distributed Systems
Software-Based Rerouting for Fault-Tolerant Pipelined Communication

IEEE Transactions on Parallel and Distributed Systems
A Fault-Tolerant Routing Scheme for Meshes with Nonconvex Faults

IEEE Transactions on Parallel and Distributed Systems
Interconnection Networks: An Engineering Approach

Interconnection Networks: An Engineering Approach
Performance Evaluation and Design Trade-Offs for Network-on-Chip Interconnect Architectures

IEEE Transactions on Computers
A Routing Methodology for Achieving Fault Tolerance in Direct Networks

IEEE Transactions on Computers
Characterization of spatial fault patterns in interconnection networks

Parallel Computing
A new performance measure for characterizing fault rings in interconnection networks

Information Sciences: an International Journal
Topological Structure and Analysis of Interconnection Networks

Topological Structure and Analysis of Interconnection Networks
A fault-tolerant permutation routing algorithm in mobile ad-hoc networks

ICN'05 Proceedings of the 4th international conference on Networking - Volume Part II

Analytical modeling and comparison of fault-tolerant message flow control mechanisms in torus-connected networks

The Journal of Supercomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

An important issue in the design and deployment of interconnection networks is the issue of network fault-tolerance for various types of failures. In designing parallel processing using torus as the underlying interconnection topology as well as in designing real applications on such processors, the estimates of the network reliability and fault-tolerance are important in choosing the routing algorithms and predicting their performance in the presence of faulty nodes. Under node-failure model, the faulty nodes may coalesce into fault patterns, which classified into two major categories, i.e., convex (|-shaped, $\Box$-shaped) and concave (L-shaped, T-shaped, +-shaped, H-shaped, U-shaped) regions. In this correspondence, we propose the first solution for computing the probability of message facing the fault patterns in tori both for convex and concave regions that is verified using simulation experiments. Our approach works for any number of faults as long as the network remains connected. We use these models to measure the network faulttolerance that can be achieved by adaptive routings, and to assess the impact of various fault patterns on the performance of such networks.