Towards transparent hardening of distributed systems

  • Authors:
  • Diogo Behrens;Christof Fetzer;Flavio P. Junqueira;Marco Serafini

  • Affiliations:
  • TU Dresden, Dresden, Germany;TU Dresden, Dresden, Germany;Microsoft Research, Cambridge, UK;Qatar Computing Research Institute, Doha, Qatar

  • Venue:
  • Proceedings of the 9th Workshop on Hot Topics in Dependable Systems
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

In distributed systems, errors such as data corruption or arbitrary changes to the flow of programs might cause processes to propagate incorrect state across the system. To prevent error propagation in such systems, an efficient and effective technique is to harden processes against Arbitrary State Corruption (ASC) faults through local detection, without replication. For distributed systems designed from scratch, dealing with state corruption can be made fully transparent, but requires that developers follow a few concrete design patterns. In this paper, we discuss the problem of hardening existing code bases of distributed systems transparently. Existing systems have not been designed with ASC hardening in mind, so they do not necessarily follow required design patterns. For such systems, we focus here on both performance and number of changes to the existing code base. Using memcached as an example, we identify and discuss three areas of improvement: reducing the memory overhead, improving access to state variables, and supporting multi-threading. Our initial evaluation of memcached shows that our ASC-hardened version obtains a throughput that is roughly 76% of the throughput of stock memcached with 128-byte and 1k-byte messages.