On the efficiency of durable state machine replication

  • Authors:
  • Alysson Bessani;Marcel Santos;João Felix;Nuno Neves;Miguel Correia

  • Affiliations:
  • FCUL, LaSIGE, University of Lisbon, Portugal;FCUL, LaSIGE, University of Lisbon, Portugal;FCUL, LaSIGE, University of Lisbon, Portugal;FCUL, LaSIGE, University of Lisbon, Portugal;INESC-ID, IST, University of Lisbon, Portugal

  • Venue:
  • USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

State Machine Replication (SMR) is a fundamental technique for ensuring the dependability of critical services in modern internet-scale infrastructures. SMR alone does not protect from full crashes, and thus in practice it is employed together with secondary storage to ensure the durability of the data managed by these services. In this work we show that the classical durability enforcing mechanisms - logging, checkpointing, state transfer - can have a high impact on the performance of SMR-based services even if SSDs are used instead of disks. To alleviate this impact, we propose three techniques that can be used in a transparent manner, i.e., without modifying the SMR programming model or requiring extra resources: parallel logging, sequential checkpointing, and collaborative state transfer. We show the benefits of these techniques experimentally by implementing them in an open-source replication library, and evaluating them in the context of a consistent key-value store and a coordination service.