Xen and the art of virtualization
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Using time travel to diagnose computer problems
Proceedings of the 11th workshop on ACM SIGOPS European workshop
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Vigilant: out-of-band detection of failures in virtual machines
ACM SIGOPS Operating Systems Review
Virtual machine time travel using continuous data protection and checkpointing
ACM SIGOPS Operating Systems Review
Forensics examination of volatile system data using virtual introspection
ACM SIGOPS Operating Systems Review
A Case for High Availability in a Virtualized Environment (HAVEN)
ARES '08 Proceedings of the 2008 Third International Conference on Availability, Reliability and Security
Remus: high availability via asynchronous virtual machine replication
NSDI'08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation
Enhancing the Disaster Recovery Plan Through Virtualization
Journal of Information Technology Research
Hi-index | 0.00 |
Traditional high-availability and disaster recovery solutions require proprietary hardware, complex configurations, applicationspecific logic, highly skilled personnel, and a rigorous and lengthy testing process. The resulting high costs have limited their adoption to environments with the most critical applications. However, high availability and disaster recovery are becoming increasingly important in many environments that cannot bear the complexity and the expense involved. In this paper, we show that virtualization can be used to develop solutions that meet this market demand. We describe the recently released Virtual Availability Manager (VAM) product offering, which provides simplified availability solutions using Xent-based virtualization, and which is available as part of the IBM Systems Director product. We present key design principles of VAM, explain its architecture and current capabilities, and describe the way it is being extended to enable recovery in case of disaster.