Maintaining Network QoS Across NIC Device Driver Failures Using Virtualization

  • Authors:
  • Michael Le;Andrew Gallagher;Yuval Tamir;Yoshio Turner

  • Affiliations:
  • -;-;-;-

  • Venue:
  • NCA '09 Proceedings of the 2009 Eighth IEEE International Symposium on Network Computing and Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Device driver failures have been shown to be a major cause of system failures. Network services stress NIC device drivers, increasing the probability of NIC driver bugs being manifested as server failures. System virtualization is increasingly used for server consolidation and management. The isolated driver domain (IDD) architecture used by several virtual machine monitors, such as Xen, forms a natural foundation for making systems resilient to NIC driver failures. In order to realize this potential, recovery must be fast enough to maintain QoS for network services across NIC driver failures. We show that the standard Xen configuration, enhanced with simple detection and recovery mechanisms, cannot provide such QoS. However, with NIC drivers isolated in two virtual machines, in a primary/warm-spare configuration, the system can recover from an overwhelming majority of NIC driver failures in under 10ms.