Redundant disk arrays: reliable, parallel secondary storage
Redundant disk arrays: reliable, parallel secondary storage
RAID-x: A New Distributed Disk Array for I/O-Centric Cluster Computing
HPDC '00 Proceedings of the 9th IEEE International Symposium on High Performance Distributed Computing
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Total recall: system support for automated availability management
NSDI'04 Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation - Volume 1
Ceph: a scalable, high-performance distributed file system
OSDI '06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7
Measurement and analysis of large-scale network file system workloads
ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
A performance evaluation and examination of open-source erasure coding libraries for storage
FAST '09 Proccedings of the 7th conference on File and storage technologies
Data warehousing and analytics infrastructure at facebook
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Middleware enabled data sharing on cloud storage services
Proceedings of the 5th International Workshop on Middleware for Service Oriented Computing
Row-diagonal parity for double disk failure correction
FAST'04 Proceedings of the 3rd USENIX conference on File and storage technologies
A file is not a file: understanding the I/O behavior of Apple desktop applications
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Windows Azure Storage: a highly available cloud storage service with strong consistency
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Characteristics of backup workloads in production systems
FAST'12 Proceedings of the 10th USENIX conference on File and Storage Technologies
NCCloud: applying network coding for the storage repair in a cloud-of-clouds
FAST'12 Proceedings of the 10th USENIX conference on File and Storage Technologies
A Quantitative Evaluation Model for Choosing Efficient Redundancy Strategies over Clouds
NAS '12 Proceedings of the 2012 IEEE Seventh International Conference on Networking, Architecture, and Storage
Hi-index | 0.00 |
Cloud storage systems commonly use replication of stored data sets to ensure high reliability and availability. However, the high storage overhead of replication becomes increasingly unacceptable with the explosive growth of data stored in cloud. Some cloud storage systems have attempted to replace replication with erasure coding to reduce storage overhead, that is just the thinking behind Cloud RAID. A well-designed Cloud RAID mechanism should achieve the right tradeoffs between storage efficiency, performance, and reliability. As there exists no widely-accepted methods for Cloud RAID, we present a workloads-based Cloud RAID schema篓CSelective Cloud RAID (SCR for short). SCR treats primary storage and backup storage with different RAIDmethods, the former at the level of directories, and the latter at the level of individual files. SCR has three distinct advantages over previous attempts at Cloud RAID: (1) it can significantly reduce the storage overhead compared with threeway replication, (2) it can avoid most cases of the "small write bottleneck" and simplify system maintenance, (3) its implementation is modular, therefore, it is easy to configure different erasure codes for different workloads. Additionally, we have implemented a SCR prototype with RDP code, which shows significant benefits over Blaum-Roth codes in degraded read performance. To verify the effectiveness of SCR, we perform theoretical analysis and elaborate benchmark tests to evaluate the performance of SCR prototype.