Effect of codeword placement on the reliability of erasure coded data storage systems

  • Authors:
  • Vinodh Venkatesan;Ilias Iliadis

  • Affiliations:
  • IBM Research --- Zurich, Rüschlikon, Switzerland;IBM Research --- Zurich, Rüschlikon, Switzerland

  • Venue:
  • QEST'13 Proceedings of the 10th international conference on Quantitative Evaluation of Systems
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Modern data storage systems employ advanced erasure codes to protect data from storage node failures because of their ability to provide high data reliability at high storage efficiency. In contrast to previous studies, we consider the practical case where the length of codewords in an erasure coded system is much smaller than the number of storage nodes in the system. In this case, there exists a large number of possible ways in which different codewords can be stored across the nodes of the system. In this paper, it is shown that a declustered placement of codewords can significantly improve system reliability compared to other placement schemes. A detailed reliability analysis is presented that accounts for the rebuild times involved, the amounts of partially rebuilt data when additional nodes fail during rebuild, and an intelligent rebuild process that attempts to rebuild the most critical codewords first.