LH*RS---a highly-available scalable distributed data structure

  • Authors:
  • Witold Litwin;Rim Moussa;Thomas Schwarz

  • Affiliations:
  • Université Paris Dauphine, Paris, France;Université Paris Dauphine, Paris, France;Santa Clara University, Santa Clara, CA

  • Venue:
  • ACM Transactions on Database Systems (TODS)
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

LH*RS is a high-availability scalable distributed data structure (SDDS). An LH*RS file is hash partitioned over the distributed RAM of a multicomputer, for example, a network of PCs, and supports the unavailability of any k ≥ 1 of its server nodes. The value of k transparently grows with the file to offset the reliability decline. Only the number of the storage nodes potentially limits the file growth. The high-availability management uses a novel parity calculus that we have developed, based on Reed-Salomon erasure correcting coding. The resulting parity storage overhead is about the lowest possible. The parity encoding and decoding are faster than for any other candidate coding we are aware of. We present our scheme and its performance analysis, including experiments with a prototype implementation on Wintel PCs. The capabilities of LH*RS offer new perspectives to data intensive applications, including the emerging ones of grids and of P2P computing.