RemusDB: transparent high availability for database systems

  • Authors:
  • Umar Farooq Minhas;Shriram Rajagopalan;Brendan Cully;Ashraf Aboulnaga;Kenneth Salem;Andrew Warfield

  • Affiliations:
  • University of Waterloo, Waterloo, Canada;University of British Columbia, Vancouver, Canada;University of British Columbia, Vancouver, Canada;University of Waterloo, Waterloo, Canada;University of Waterloo, Waterloo, Canada;University of British Columbia, Vancouver, Canada

  • Venue:
  • The VLDB Journal — The International Journal on Very Large Data Bases
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we present a technique for building a high-availability (HA) database management system (DBMS). The proposed technique can be applied to any DBMS with little or no customization, and with reasonable performance overhead. Our approach is based on Remus, a commodity HA solution implemented in the virtualization layer, that uses asynchronous virtual machine state replication to provide transparent HA and failover capabilities. We show that while Remus and similar systems can protect a DBMS, database workloads incur a performance overhead of up to 32 % as compared to an unprotected DBMS. We identify the sources of this overhead and develop optimizations that mitigate the problems. We present an experimental evaluation using two popular database systems and industry standard benchmarks showing that for certain workloads, our optimized approach provides fast failover ( $$\le $$ 3 s of downtime) with low performance overhead when compared to an unprotected DBMS. Our approach provides a practical means for existing, deployed database systems to be made more reliable with a minimum of risk, cost, and effort. Furthermore, this paper invites new discussion about whether the complexity of HA is best implemented within the DBMS, or as a service by the infrastructure below it.