Design of the notification system for failure detectors

  • Authors:
  • Naohiro Hayashibara;Makoto Takizawa

  • Affiliations:
  • Faculty of Computer Science and Engineering, Department of Computer Science, Kyoto Sangyo University, Japan.;Faculty of Science and Technology, Department of Computers and Information Science, Seikei University, Japan

  • Venue:
  • International Journal of High Performance Computing and Networking
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

It is widely recognised that distributed systems would greatly benefit from the availability of a generic failure detection service. In this paper, we highlighted the issue on the construction of the monitoring network of failure detectors. We proposed an algorithm to construct and manage the monitoring network that each failure detector is monitored by some failure detectors. Notification of failures is propagated along the network. Especially it can involve various types of failure detectors from simple timeout-based failure detectors to accrual failure detectors, and help to spread information on suspected processes/nodes. In addition, we have made a simulation of the proposed algorithm for constructing the monitoring network. It shows that the algorithm is scalable for increasing the number of failure detectors.