RemusDB: transparent high availability for database systems

Authors:
Umar Farooq Minhas;Shriram Rajagopalan;Brendan Cully;Ashraf Aboulnaga;Kenneth Salem;Andrew Warfield
Affiliations:
University of Waterloo, Waterloo, Canada;University of British Columbia, Vancouver, Canada;University of British Columbia, Vancouver, Canada;University of Waterloo, Waterloo, Canada;University of Waterloo, Waterloo, Canada;University of British Columbia, Vancouver, Canada
Venue:
The VLDB Journal — The International Journal on Very Large Data Bases
Year:
2013

Citing 21
Cited 5

Optimistic recovery in distributed systems

ACM Transactions on Computer Systems (TOCS)
ARIES: a transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging

ACM Transactions on Database Systems (TODS)
Evaluation of remote backup algorithms for transaction processing systems

SIGMOD '92 Proceedings of the 1992 ACM SIGMOD international conference on Management of data
Hypervisor-based fault tolerance

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
The dangers of replication and a solution

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
A Majority consensus approach to concurrency control for multiple copy databases

ACM Transactions on Database Systems (TODS)
Transaction Processing: Concepts and Techniques

Transaction Processing: Concepts and Techniques
Don't Be Lazy, Be Consistent: Postgres-R, A New Way to Implement Database Replication

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Weighted voting for replicated data

SOSP '79 Proceedings of the seventh ACM symposium on Operating systems principles
A "flight data recorder" for enabling full-system multiprocessor deterministic replay

Proceedings of the 30th annual international symposium on Computer architecture
Xen and the art of virtualization

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
ReVirt: enabling intrusion analysis through virtual-machine logging and replay

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
TPCC-UVa: an open-source TPC-C implementation for global performance measurement of computer systems

ACM SIGMOD Record
Live migration of virtual machines

NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Execution replay of multiprocessor virtual machines

Proceedings of the fourth ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Remus: high availability via asynchronous virtual machine replication

NSDI'08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation
ODR: output-deterministic replay for multicore debugging

Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Automatic virtual machine configuration for database workloads

ACM Transactions on Database Systems (TODS)
Respec: efficient online multiprocessor replayvia speculation and external determinism

Proceedings of the fifteenth edition of ASPLOS on Architectural support for programming languages and operating systems
SecondSite: disaster tolerance as a service

VEE '12 Proceedings of the 8th ACM SIGPLAN/SIGOPS conference on Virtual Execution Environments
RemusDB: transparent high availability for database systems

The VLDB Journal — The International Journal on Very Large Data Bases

PipeCloud: using causality to overcome speed-of-light delays in cloud-based disaster recovery

Proceedings of the 2nd ACM Symposium on Cloud Computing
SecondSite: disaster tolerance as a service

VEE '12 Proceedings of the 8th ACM SIGPLAN/SIGOPS conference on Virtual Execution Environments
RemusDB: transparent high availability for database systems

The VLDB Journal — The International Journal on Very Large Data Bases
Yank: enabling green data centers to pull the plug

nsdi'13 Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation
Escape capsule: explicit state is robust and scalable

HotOS'13 Proceedings of the 14th USENIX conference on Hot Topics in Operating Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a technique for building a high-availability (HA) database management system (DBMS). The proposed technique can be applied to any DBMS with little or no customization, and with reasonable performance overhead. Our approach is based on Remus, a commodity HA solution implemented in the virtualization layer, that uses asynchronous virtual machine state replication to provide transparent HA and failover capabilities. We show that while Remus and similar systems can protect a DBMS, database workloads incur a performance overhead of up to 32 % as compared to an unprotected DBMS. We identify the sources of this overhead and develop optimizations that mitigate the problems. We present an experimental evaluation using two popular database systems and industry standard benchmarks showing that for certain workloads, our optimized approach provides fast failover ( $$\le $$ 3 s of downtime) with low performance overhead when compared to an unprotected DBMS. Our approach provides a practical means for existing, deployed database systems to be made more reliable with a minimum of risk, cost, and effort. Furthermore, this paper invites new discussion about whether the complexity of HA is best implemented within the DBMS, or as a service by the infrastructure below it.