Hypervisor-based fault tolerance
ACM Transactions on Computer Systems (TOCS) - Special issue on operating system principles
ACM Transactions on Computer Systems (TOCS)
Practical Byzantine fault tolerance
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
On the scale and performance of cooperative Web proxy caching
Proceedings of the seventeenth ACM symposium on Operating systems principles
Chord: A scalable peer-to-peer lookup service for internet applications
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
SEDA: an architecture for well-conditioned, scalable internet services
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
SRDS '98 Proceedings of the The 17th IEEE Symposium on Reliable Distributed Systems
Xen and the art of virtualization
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
DSN '04 Proceedings of the 2004 International Conference on Dependable Systems and Networks
Fully Distributed Three-Tier Active Software Replication
IEEE Transactions on Parallel and Distributed Systems
The SMART way to migrate replicated stateful services
Proceedings of the 1st ACM SIGOPS/EuroSys European Conference on Computer Systems 2006
Secondsite: disaster protection for the common server
HOTDEP'06 Proceedings of the 2nd conference on Hot Topics in System Dependability - Volume 2
Glacier: highly durable, decentralized storage despite massive correlated failures
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Hypervisor-Based Efficient Proactive Recovery
SRDS '07 Proceedings of the 26th IEEE International Symposium on Reliable Distributed Systems
A gossip-style failure detection service
Middleware '98 Proceedings of the IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing
Hi-index | 0.00 |
We present a virtual machine-based peer-to-peer platform that allows many Internet services to back up their services cooperatively. The goal of our platform is to provide a highly-available and service-independent solution that is cost-effective for smaller, independent service providers. Use of virtual machines makes it possible to encapsulate the complete service state and to share physical hosts. A multicast protocol guarantees that service state is replicated reliably on multiple physical hosts so that in case of failures, recent state can be recovered. We implemented a prototype and evaluated it by experiments to show that our design can adapt to dynamic host changes and evaluate the runtime and failure recovery performance possible with Xen and SBUML virtual machines.