BlobSeer: how to enable efficient versioning for large object storage under heavy access concurrency

  • Authors:
  • Bogdan Nicolae;Gabriel Antoniu;Luc Bougé

  • Affiliations:
  • University of Rennes 1, IRISA, Rennes, France;INRIA, Centre Rennes - Bretagne Atlantique, IRISA, Rennes, France;ENS Cachan/Brittany, IRISA, France

  • Venue:
  • Proceedings of the 2009 EDBT/ICDT Workshops
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

To accommodate the needs of large-scale distributed P2P systems, scalable data management strategies are required, allowing applications to efficiently cope with continuously growing, highly distributed data. This paper addresses the problem of efficiently storing and accessing very large binary data objects (blobs). It proposes an efficient versioning scheme allowing a large number of clients to concurrently read, write and append data to huge blobs that are fragmented and distributed at a very large scale. Scalability under heavy concurrency is achieved thanks to an original metadata scheme, based on a distributed segment tree built on top of a Distributed Hash Table (DHT). Our approach has been implemented and experimented within our BlobSeer prototype on the Grid'5000 testbed, using up to 175 nodes.