DiskReduce: RAID for data-intensive scalable computing

Authors:
Bin Fan;Wittawat Tantisiriroj;Lin Xiao;Garth Gibson
Affiliations:
Carnegie Mellon University;Carnegie Mellon University;Carnegie Mellon University;Carnegie Mellon University
Venue:
Proceedings of the 4th Annual Workshop on Petascale Data Storage
Year:
2009

Citing 17
Cited 13

A case for redundant arrays of inexpensive disks (RAID)

SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
Failure correction techniques for large disk arrays

ASPLOS III Proceedings of the third international conference on Architectural support for programming languages and operating systems
Combining the concepts of compression and caching for a two-level filesystem

ASPLOS IV Proceedings of the fourth international conference on Architectural support for programming languages and operating systems
The Zebra striped network file system

SOSP '93 Proceedings of the fourteenth ACM symposium on Operating systems principles
RAID: high-performance, reliable secondary storage

ACM Computing Surveys (CSUR)
Swift/RAID: a distributed RAID system

Computing Systems
EVENODD: An Efficient Scheme for Tolerating Double Disk Failures in RAID Architectures

IEEE Transactions on Computers - Special issue on fault-tolerant computing
The HP AutoRAID hierarchical storage system

ACM Transactions on Computer Systems (TOCS) - Special issue on operating system principles
Computing in the RAIN: A Reliable Array of Independent Nodes

IEEE Transactions on Parallel and Distributed Systems
Erasure Coding Vs. Replication: A Quantitative Comparison

IPTPS '01 Revised Papers from the First International Workshop on Peer-to-Peer Systems
The Google file system

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Awarded Best Paper! -- Row-Diagonal Parity for Double Disk Failure Correction

FAST '04 Proceedings of the 3rd USENIX Conference on File and Storage Technologies
WEAVER codes: highly fault tolerant erasure codes for storage systems

FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
MapReduce: simplified data processing on large clusters

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
AFRAID: a frequently redundant array of independent disks

ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
Scalable performance of the Panasas parallel file system

FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
A performance evaluation and examination of open-source erasure coding libraries for storage

FAST '09 Proccedings of the 7th conference on File and storage technologies

Data warehousing and analytics infrastructure at facebook

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Rethinking erasure codes for cloud file systems: minimizing I/O for recovery and degraded reads

FAST'12 Proceedings of the 10th USENIX conference on File and Storage Technologies
MixApart: decoupled analytics for shared storage systems

HotStorage'12 Proceedings of the 4th USENIX conference on Hot Topics in Storage and File Systems
Redundantly grouped cross-object coding for repairable storage

Proceedings of the Asia-Pacific Workshop on Systems
Robust Redundancy Scheme for the Repair Process: Hierarchical Codes in the Bandwidth-Limited Systems

Journal of Grid Computing
Redundantly grouped cross-object coding for repairable storage

APSys'12 Proceedings of the Third ACM SIGOPS Asia-Pacific conference on Systems
Scalable Reed-Solomon-based reliable local storage for HPC applications on iaas clouds

Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
In-network redundancy generation for opportunistic speedup of data backup

Future Generation Computer Systems
XORing elephants: novel erasure codes for big data

Proceedings of the VLDB Endowment
Exploiting Redundancies and Deferred Writes to Conserve Energy in Erasure-Coded Storage Clusters

ACM Transactions on Storage (TOS)
Data-Intensive Cloud Computing: Requirements, Expectations, Challenges, and Solutions

Journal of Grid Computing
Generating request streams on Big Data using clustered renewal processes

Performance Evaluation
MixApart: decoupled analytics for shared storage systems

FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies

Quantified Score

Hi-index	0.00

Visualization

Abstract

Data-intensive file systems, developed for Internet services and popular in cloud computing, provide high reliability and availability by replicating data, typically three copies of everything. Alternatively high performance computing, which has comparable scale, and smaller scale enterprise storage systems get similar tolerance for multiple failures from lower overhead erasure encoding, or RAID, organizations. DiskReduce is a modification of the Hadoop distributed file system (HDFS) enabling asynchronous compression of initially triplicated data down to RAID-class redundancy overheads. In addition to increasing a cluster's storage capacity as seen by its users by up to a factor of three, DiskReduce can delay encoding long enough to deliver the performance benefits of multiple data copies.