Impossibility of distributed consensus with one faulty process
Journal of the ACM (JACM)
Unreliable failure detectors for reliable distributed systems
Journal of the ACM (JACM)
An improved algorithm for decentralized extrema-finding in circular configurations of processes
Communications of the ACM
Fault-Tolerant Ethernet for IP-Based Process Control: A Demonstration
DSN '00 Proceedings of the 2000 International Conference on Dependable Systems and Networks (formerly FTCS-30 and DCCA-8)
Fault avoidance and recovery for distributed multimedia multicast
IW-MMDBMS '96 Proceedings of the 1996 International Workshop on Multi-Media Database Management Systems (IW-MMDBMS '96)
Elections in a Distributed Computing System
IEEE Transactions on Computers
Hi-index | 0.00 |
In this paper, we present a new fault-tolerant Ethernet scheme called SAFE (Scalable Autonomous Fault-tolerant Ethernet). SAFE scheme is based on software approach which takes place in layer 2 and layer 3 of the OSIRM. The goal of SAFE is to provide scalability, autonomous fault detection and recovery. SAFE divides a network into several subnets and limits the number of nodes in a subnet. Network can be extended by adding additional subnets. All nodes in a subnet automatically detect faults and perform fail-over by sending and receiving Ethernet based heartbeat each other. For inter-subnet fault recovery, SAFE manages master nodes in each subnet. Master nodes communicate each other using IP packets to exchange the subnet status. We also propose a master election algorithm to recover faults of master node automatically. Proposed SAFE performs efficiently for large scale network and provides fast and autonomous fault recovery.