Reducing Storage Overhead with Small Write Bottleneck Avoiding in Cloud RAID System

Authors:
Jianzong Wang;Weijiao Gong;Peter Varman;Changsheng Xie
Affiliations:
-;-;-;-
Venue:
GRID '12 Proceedings of the 2012 ACM/IEEE 13th International Conference on Grid Computing
Year:
2012

Citing 15
Cited 0

Redundant disk arrays: reliable, parallel secondary storage

Redundant disk arrays: reliable, parallel secondary storage
RAID-x: A New Distributed Disk Array for I/O-Centric Cluster Computing

HPDC '00 Proceedings of the 9th IEEE International Symposium on High Performance Distributed Computing
The Google file system

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Total recall: system support for automated availability management

NSDI'04 Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation - Volume 1
Ceph: a scalable, high-performance distributed file system

OSDI '06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7
Measurement and analysis of large-scale network file system workloads

ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
A performance evaluation and examination of open-source erasure coding libraries for storage

FAST '09 Proccedings of the 7th conference on File and storage technologies
Data warehousing and analytics infrastructure at facebook

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Middleware enabled data sharing on cloud storage services

Proceedings of the 5th International Workshop on Middleware for Service Oriented Computing
Row-diagonal parity for double disk failure correction

FAST'04 Proceedings of the 3rd USENIX conference on File and storage technologies
A file is not a file: understanding the I/O behavior of Apple desktop applications

SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Windows Azure Storage: a highly available cloud storage service with strong consistency

SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Characteristics of backup workloads in production systems

FAST'12 Proceedings of the 10th USENIX conference on File and Storage Technologies
NCCloud: applying network coding for the storage repair in a cloud-of-clouds

FAST'12 Proceedings of the 10th USENIX conference on File and Storage Technologies
A Quantitative Evaluation Model for Choosing Efficient Redundancy Strategies over Clouds

NAS '12 Proceedings of the 2012 IEEE Seventh International Conference on Networking, Architecture, and Storage

Quantified Score

Hi-index	0.00

Visualization

Abstract

Cloud storage systems commonly use replication of stored data sets to ensure high reliability and availability. However, the high storage overhead of replication becomes increasingly unacceptable with the explosive growth of data stored in cloud. Some cloud storage systems have attempted to replace replication with erasure coding to reduce storage overhead, that is just the thinking behind Cloud RAID. A well-designed Cloud RAID mechanism should achieve the right tradeoffs between storage efficiency, performance, and reliability. As there exists no widely-accepted methods for Cloud RAID, we present a workloads-based Cloud RAID schema篓CSelective Cloud RAID (SCR for short). SCR treats primary storage and backup storage with different RAIDmethods, the former at the level of directories, and the latter at the level of individual files. SCR has three distinct advantages over previous attempts at Cloud RAID: (1) it can significantly reduce the storage overhead compared with threeway replication, (2) it can avoid most cases of the "small write bottleneck" and simplify system maintenance, (3) its implementation is modular, therefore, it is easy to configure different erasure codes for different workloads. Additionally, we have implemented a SCR prototype with RDP code, which shows significant benefits over Blaum-Roth codes in degraded read performance. To verify the effectiveness of SCR, we perform theoretical analysis and elaborate benchmark tests to evaluate the performance of SCR prototype.