Optimizing the Performance of Virtual Machine Synchronization for Fault Tolerance

Authors:
Jun Zhu;Zhefu Jiang;Zhen Xiao;Xiaoming Li
Affiliations:
Peking University, Beijing;Peking University, Beijing;Peking University, Beijing;State Key Laboratory of Software Development Environment, MOST, China
Venue:
IEEE Transactions on Computers
Year:
2011

Citing 0
Cited 3

Safe side effects commitment for OS-level virtualization

Proceedings of the 8th ACM international conference on Autonomic computing
A medical image file accessing system with virtualization fault tolerance on cloud

GPC'12 Proceedings of the 7th international conference on Advances in Grid and Pervasive Computing
Cost-Benefit Analysis of Virtualizing Batch Systems: Performance-Energy-Dependability Trade-Offs

UCC '13 Proceedings of the 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing

Quantified Score

Hi-index	14.98

Visualization

Abstract

Hypervisor-based fault tolerance (HBFT), which synchronizes the state between the primary VM and the backup VM at a high frequency of tens to hundreds of milliseconds, is an emerging approach to sustaining mission-critical applications. Based on virtualization technology, HBFT provides an economic and transparent fault tolerant solution. However, the advantages currently come at the cost of substantial performance overhead during failure-free, especially for memory intensive applications. This paper presents an in-depth examination of HBFT and options to improve its performance. Based on the behavior of memory accesses among checkpointing epochs, we introduce two optimizations, read-fault reduction and write-fault prediction, for the memory tracking mechanism. These two optimizations improve the performance by 31 percent and 21 percent, respectively, for some applications. Then, we present software superpage which efficiently maps large memory regions between virtual machines (VM). Our optimization improves the performance of HBFT by a factor of 1.4 to 2.2 and achieves about 60 percent of that of the native VM.