Hierarchical Codes: How to Make Erasure Codes Attractive for Peer-to-Peer Storage Systems

Authors:
Alessandro Duminuco;Ernst Biersack
Affiliations:
-;-
Venue:
P2P '08 Proceedings of the 2008 Eighth International Conference on Peer-to-Peer Computing
Year:
2008

Citing 0
Cited 10

Self-organized Data Redundancy Management for Peer-to-Peer Storage Systems

IWSOS '09 Proceedings of the 4th IFIP TC 6 International Workshop on Self-Organizing Systems
Optimizing peer-to-peer backup using lifetime estimations

Proceedings of the 2009 EDBT/ICDT Workshops
A quantitative analysis of redundancy schemes for peer-to- peer storage systems

SSS'10 Proceedings of the 12th international conference on Stabilization, safety, and security of distributed systems
Towards the design of optimal data redundancy schemes for heterogeneous cloud storage infrastructures

Computer Networks: The International Journal of Computer and Telecommunications Networking
Integrated tools for the simulation analysis of peer-to-peer backup systems

Proceedings of the 5th International ICST Conference on Simulation Tools and Techniques
Redundantly grouped cross-object coding for repairable storage

Proceedings of the Asia-Pacific Workshop on Systems
Redundantly grouped cross-object coding for repairable storage

APSys'12 Proceedings of the Third ACM SIGOPS Asia-Pacific conference on Systems
An overview of codes tailor-made for better repairability in networked distributed storage systems

ACM SIGACT News
In-network redundancy generation for opportunistic speedup of data backup

Future Generation Computer Systems
The state of peer-to-peer network simulators

ACM Computing Surveys (CSUR)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Redundancy is the basic technique to provide reliability in storage systems consisting of multiple components. A redundancy scheme defines how the redundant data are produced and maintained. The simplest redundancy scheme is replication, which however suffers from storage inefficiency. Another approach is erasure coding, which provides the same level of reliability as replication using a significantly smaller amount of storage. When redundant data are lost, they need to be replaced. While replacing replicated data consists in a simple copy, it becomes a complex operation with erasure codes: new data are produced performing a coding over some other available data. The amount of data to be read and coded is d times larger than the amount of data produced. This implies that coding has a larger computational and I/O cost, which, for distributed storage systems, translates into increased network traffic. Participants of Peer-to-Peer systems have ample storage and CPU power, but their network bandwidth may be limited. For these reasons existing coding techniques are not suitable for P2P storage. This work explores the design space between replication and the existing erasure codes. We propose and evaluate a new class of erasure codes, called Hierarchical Codes, which aims at finding a flexible trade-off that allows the reduction of the network traffic due to maintenance without losing the benefits given by traditional codes.