Availability of multi-object operations

  • Authors:
  • Haifeng Yu;Phillip B. Gibbons;Suman Nath

  • Affiliations:
  • Intel Research Pittsburgh / CMU;Intel Research Pittsburgh;Microsoft Research

  • Venue:
  • NSDI'06 Proceedings of the 3rd conference on Networked Systems Design & Implementation - Volume 3
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Highly-available distributed storage systems are commonly designed to optimize the availability of individual data objects, despite the fact that user level tasks typically request multiple objects. In this paper, we show that the assignment of object replicas (or fragments, in the case of erasure coding) to machines plays a dramatic role in the availability of such multi-object operations, without affecting the availability of individual objects. For example, for the TPC-H benchmark under real-world failures, we observe differences of up to four nines between popular assignments used in existing systems. Experiments using our wide-area storage system prototype, MOAT, on the PlanetLab, as well as extensive simulations, show which assignments lead to the highest availability for a given setting.