SD codes: erasure codes designed for how storage systems really fail

  • Authors:
  • James S. Plank;Mario Blaum;James L. Hafner

  • Affiliations:
  • EECS Department, University of Tennessee;IBM Research Division, Almaden Research Center;IBM Research Division, Almaden Research Center

  • Venue:
  • FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Traditionally, when storage systems employ erasure codes, they are designed to tolerate the failures of entire disks. However, the most common types of failures are latent sector failures, which only affect individual disk sectors, and block failures which arise through wear on SSD's. This paper introduces SD codes, which are designed to tolerate combinations of disk and sector failures. As such, they consume far less storage resources than traditional erasure codes. We specify the codes with enough detail for the storage practitioner to employ them, discuss their practical properties, and detail an open-source implementation.