Modeling the Fault Tolerance Consequences of Deduplication

  • Authors:
  • Eric W. D. Rozier;William H. Sanders;Pin Zhou;Nagapramod Mandagere;Sandeep M. Uttamchandani;Mark L. Yakushev

  • Affiliations:
  • -;-;-;-;-;-

  • Venue:
  • SRDS '11 Proceedings of the 2011 IEEE 30th International Symposium on Reliable Distributed Systems
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Modern storage systems are employing data deduplication with increasing frequency. Often the storage systems on which these techniques are deployed contain important data, and utilize fault-tolerant hardware and software to improve the reliability of the system and reduce data loss. We suggest that data deduplication introduces inter-file relationships that may have a negative impact on the fault tolerance of such systems by creating dependencies that can increase the severity of data loss events. We present a framework composed of data analysis methods and a model of data deduplication that is useful in studying the reliability impact of data deduplication. The framework is useful for determining a deduplication strategy that is estimated to satisfy a set of reliability constraints supplied by a user.