ER-TCP: an efficient TCP fault-tolerance scheme for cluster computing
The Journal of Supercomputing
Quality of Service guarantees and fault-tolerant TCP services in mobile wireless optical networks
International Journal of Ad Hoc and Ubiquitous Computing
CoRAL: A transparent fault-tolerant web service
Journal of Systems and Software
Practical and low-overhead masking of failures of TCP-based servers
ACM Transactions on Computer Systems (TOCS)
Context-aware fault tolerance in migratory services
Proceedings of the 5th Annual International Conference on Mobile and Ubiquitous Systems: Computing, Networking, and Services
ER-TCP: an efficient fault-tolerance scheme for TCP connections
ISPA'05 Proceedings of the Third international conference on Parallel and Distributed Processing and Applications
Hi-index | 0.00 |
With the Internet increasingly being used as access medium for a variety of critical services, there is a growing need to provide fault tolerant services over internetworks, in a completely client-transparent fashion. We present HYDRANET-FT, an infrastructure to dynamically replicate services across an internetwork and have the replicas provide a single fault tolerant service access point to clients. HYDRANET-FT uses the TCP communication protocol with a few modifications on the server side to allow one-to-many message delivery from a client to service replicas and many-to-one message delivery from the replicas to the client. A communication channel between the replicas provides atomicity and message ordering. A low-latency failure estimator is used to detect failures of servers in the system and initiate fail-over mechanisms. An implementation and measurements on a local testbed show that the overhead of our scheme is reasonably small.