Robust snapshot replication

Authors:
Uwe Röhm;Michael J. Cahill;Alan Fekete;Hyungsoo Jung;Seung Woo Baek;Mathew Rodley
Affiliations:
The University of Sydney, NSW Australia;The University of Sydney, NSW Australia;The University of Sydney, NSW Australia;The University of Sydney, NSW Australia;The University of Sydney, NSW Australia;The University of Sydney, NSW Australia
Venue:
ADC '13 Proceedings of the Twenty-Fourth Australasian Database Conference - Volume 137
Year:
2013

Citing 18
Cited 0

Serializability theory for replicated databases

Journal of Computer and System Sciences
Concurrency control and recovery in database systems

Concurrency control and recovery in database systems
A critique of ANSI SQL isolation levels

SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
The dangers of replication and a solution

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Data Replication Strategies for Fault Tolerance and Availability on Commodity Clusters

DSN '00 Proceedings of the 2000 International Conference on Dependable Systems and Networks (formerly FTCS-30 and DCCA-8)
Ganymed: scalable replication for transactional web applications

Proceedings of the 5th ACM/IFIP/USENIX international conference on Middleware
Postgres-R(SI): Combining Replica Control with Concurrency Control Based on Snapshot Isolation

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Middleware based data replication providing snapshot isolation

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Fine-grained replication and scheduling with freshness and correctness guarantees

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Database Replication Using Generalized Snapshot Isolation

SRDS '05 Proceedings of the 24th IEEE Symposium on Reliable Distributed Systems
Preventive Replication in a Database Cluster

Distributed and Parallel Databases
MIDDLE-R: Consistent database replication at the middleware level

ACM Transactions on Computer Systems (TOCS)
Transaction Time Support Inside a Database Engine

ICDE '06 Proceedings of the 22nd International Conference on Data Engineering
Lazy database replication with snapshot isolation

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
Online recovery in cluster databases

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Middleware-based database replication: the gaps between theory and practice

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
DBFarm: a scalable cluster for multiple databases

Proceedings of the ACM/IFIP/USENIX 2006 International Conference on Middleware
Pangea: an eager database replication middleware guaranteeing snapshot isolation without modification of database servers

Proceedings of the VLDB Endowment

Quantified Score

Hi-index	0.00

Visualization

Abstract

An important technique to ensure the scalability and availability of clustered computer systems is data replication. This paper describes a new approach to data replication management called Robust Snapshot Replication. It combines an update anywhere approach (so updates can be evaluated on any replica, spreading their load) with lazy update propagation and snapshot isolation concurrency control. The innovation is how we employ snapshot isolation in the replicas to provide consistency, fail safety, and also to achieve high scalability for both readers and up-daters, by a system design without middleware or group communication infrastructure. We implemented our approach using the PostgreSQL database system and conducted an extensive experimental evaluation with a small database cluster of 8 nodes. Our results demonstrate the scalability of our algorithm and its performance benefits as compared to a standard consistent replication system based on synchronous propagation. We also evaluated the costs for adding a new cluster node and the robustness of our approach against node failures. It shows that our approach is at a sweet-spot between scalability, consistency and availability: it offers an almost perfect speed-up and load-balancing for our 8 node cluster, while allowing dynamic extension of a cluster with new nodes, and being robust against any number of replica node failures or a master failure.