Experimental Evaluation of Behavior-Based Failure-Detection Schemes in Real-Time Communication Networks

  • Authors:
  • Seungjae Han;Kang G. Shin

  • Affiliations:
  • The Univ. of Michigan, Ann Arbor;-

  • Venue:
  • IEEE Transactions on Parallel and Distributed Systems
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

Effective detection of failures is essential for reliable communication services. Traditionally, non-real-time computer networks have relied on behavior-based techniques for detecting communication failures. That is, each node uses heartbeats to detect the failure of its neighbors and the end-to-end transport protocol (e.g., TCP) achieves reliable communication by acknowledgment/retransmission. Recently, there has been a growing demand for reliable 驴real-time驴 communication, but little research has been done on the failure detection problem. In this paper, we present two behavior-based failure-detection schemes驴neighbor detection and end-to-end detection驴for reliable real-time communication services and experimentally evaluate their effectiveness. Specifically, we measure and analyze the coverage and latency of these detection schemes through fault-injection experiments. The experimental results have shown that nearly all failures can be detected very quickly by the neighbor detection scheme, while the end-to-end detection scheme uncovers the remaining failures with larger detection latencies.