Transparent Fault Tolerance of Device Drivers for Virtual Machines

  • Authors:
  • Heeseung Jo;Hwanju Kim;Jae-Wan Jang;Joonwon Lee;Seungryoul Maeng

  • Affiliations:
  • Korea Advanced Institute of Science and Technology, Daejeon;Korea Advanced Institute of Science and Technology, Daejeon;Korea Advanced Institute of Science and Technology, Daejeon;SungKyunKwan University, Suwon;Korea Advanced Institute of Science and Technology, Daejeon

  • Venue:
  • IEEE Transactions on Computers
  • Year:
  • 2010

Quantified Score

Hi-index 14.98

Visualization

Abstract

In a consolidated server system using virtualization, physical device accesses from guest virtual machines (VMs) need to be coordinated. In this environment, a separate driver VM is usually assigned to this task to enhance reliability and to reuse existing device drivers. This driver VM needs to be highly reliable, since it handles all the I/O requests. This paper describes a mechanism to detect and recover the driver VM from faults to enhance the reliability of the whole system. The proposed mechanism is transparent in that guest VMs cannot recognize the fault and the driver VM can recover and continue its I/O operations. Our mechanism provides a progress monitoring-based fault detection that is isolated from fault contamination with low monitoring overhead. When a fault occurs, the system recovers by switching the faulted driver VM to another one. The recovery is performed without service disconnection or data loss and with negligible delay by fully exploiting the I/O structure of the virtualized system.