Higher reliability redundant disk arrays: Organization, operation, and coding

Authors:
Alexander Thomasian;Mario Blaum
Affiliations:
Thomasian and Associates, Pleasantville, NY;Universidad Complutense de Madrid (UCM), Madrid, Spain
Venue:
ACM Transactions on Storage (TOS)
Year:
2009

Citing 141
Cited 10

A fast file system for UNIX

ACM Transactions on Computer Systems (TOCS)
Quantitative system performance: computer system analysis using queueing network models

Quantitative system performance: computer system analysis using queueing network models
Disk cache—miss ratio analysis and design considerations

ACM Transactions on Computer Systems (TOCS)
Approximate Analysis of Fork/Join Synchronization in Parallel Queues

IEEE Transactions on Computers
A case for redundant arrays of inexpensive disks (RAID)

SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
The placement optimization program: a practical solution to the disk file assignment problem

SIGMETRICS '89 Proceedings of the 1989 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Analysis of file I/O traces in commercial computing environments

SIGMETRICS '92/PERFORMANCE '92 Proceedings of the 1992 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Redundant disk arrays: reliable, parallel secondary storage

Redundant disk arrays: reliable, parallel secondary storage
Distributed sparing in disk arrays

COMPCON '92 Proceedings of the thirty-seventh international conference on COMPCON
The design and implementation of a log-structured file system

ACM Transactions on Computer Systems (TOCS)
The Performance of Parity Placements in Disk Arrays

IEEE Transactions on Computers
Designing disk arrays for high data reliability

Journal of Parallel and Distributed Computing - Special issue on parallel I/O systems
Floating parity and data disk arrays

Journal of Parallel and Distributed Computing - Special issue on parallel I/O systems
Reliability analysis of redundant arrays of inexpensive disks

Journal of Parallel and Distributed Computing - Special issue on parallel I/O systems
The architecture of a fault-tolerant cached RAID controller

ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
An introduction to disk drive modeling

Computer
Background data movement in a log-structured disk subsystem

IBM Journal of Research and Development
RAID: high-performance, reliable secondary storage

ACM Computing Surveys (CSUR)
Sparing for redundant disk arrays

Distributed and Parallel Databases
Scheduling algorithms for modern disk drives

SIGMETRICS '94 Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Performance of RAID5 disk arrays with read and write caching

Distributed and Parallel Databases - Special issue on disk arrays
Architectures and algorithms for on-line failure recovery in redundant disk arrays

Distributed and Parallel Databases - Special issue on disk arrays
Parity logging disk arrays

ACM Transactions on Computer Systems (TOCS)
Crosshatch disk array for improved reliability and performance

ISCA '94 Proceedings of the 21st annual international symposium on Computer architecture
On-line data reconstruction in redundant disk arrays

On-line data reconstruction in redundant disk arrays
EVENODD: An Efficient Scheme for Tolerating Double Disk Failures in RAID Architectures

IEEE Transactions on Computers - Special issue on fault-tolerant computing
Reliability and performance of disk arrays

Reliability and performance of disk arrays
The HP AutoRAID hierarchical storage system

ACM Transactions on Computer Systems (TOCS) - Special issue on operating system principles
Performance and reliability analysis of computer systems: an example-based approach using the SHARPE software package

Performance and reliability analysis of computer systems: an example-based approach using the SHARPE software package
Hot mirroring: a method of hiding parity update penalty and degradation during rebuilds for RAID5

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
A Performance Evaluation of RAID Architectures

IEEE Transactions on Computers
Self-similarity through high-variability: statistical analysis of Ethernet LAN traffic at the source level

IEEE/ACM Transactions on Networking (TON)
On Variable Scope of Parity Protection in Disk Arrays

IEEE Transactions on Computers
Tolerating multiple failures in RAID architectures with optimal storage and uniform declustering

Proceedings of the 24th annual international symposium on Computer architecture
A tutorial on Reed-Solomon coding for fault-tolerance in RAID-like systems

Software—Practice & Experience
Destage Algorithms for Disk Arrays with Nonvolatile Caches

IEEE Transactions on Computers
Self-similarity in file systems

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
An analytic behavior model for disk drives with readahead caches and request reordering

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Declustered disk array architectures with optimal and near-optimal parallelism

Proceedings of the 25th annual international symposium on Computer architecture
Using System-Level Models to Evaluate I/O Subsystem Designs

IEEE Transactions on Computers
Modeling and optimizing I/O throughput of multiple disks on a bus

SIGMETRICS '99 Proceedings of the 1999 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Best-fit bin-packing with random order

Proceedings of the seventh annual ACM-SIAM symposium on Discrete algorithms
Modeling and performance of MEMS-based storage devices

Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
MEMS-based integrated-circuit mass-storage systems

Communications of the ACM
The fractal structure of data reference: applications to the memory hierarchy

The fractal structure of data reference: applications to the memory hierarchy
Algorithm 235: Random permutation

Communications of the ACM
Reliability and performance of hierarchical RAID with multiple controllers

Proceedings of the twentieth annual ACM symposium on Principles of distributed computing
Probability and statistics with reliability, queuing and computer science applications

Probability and statistics with reliability, queuing and computer science applications
Minerva: An automated resource provisioning tool for large-scale storage systems

ACM Transactions on Computer Systems (TOCS)
Cello: A Disk Scheduling Framework for Next Generation Operating Systems*

Real-Time Systems - Flexible Scheduling on Real-Time Systems
Analysis of methods for scheduling low priority disk drive tasks

SIGMETRICS '02 Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Some new disk scheduling policies and their performance

SIGMETRICS '02 Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Computer Performance Modeling Handbook

Computer Performance Modeling Handbook
Database Management Systems

Database Management Systems
Advances in Disk Technology: Performance Issues

Computer
Uniform Parity Group Distribution in Disk Arrays with Multiple Failures

IEEE Transactions on Computers
Analytic Modeling of Clustered RAID with Mapping Based on Nearly Random Permutation

IEEE Transactions on Computers
Efficient Placement of Parity and Data to Tolerate Two Disk Failures in Disk Array Systems

IEEE Transactions on Parallel and Distributed Systems
RAID5 Performance with Distributed Sparing

IEEE Transactions on Parallel and Distributed Systems
Aqueduct: Online Data Migration with Performance Guarantees

FAST '02 Proceedings of the Conference on File and Storage Technologies
Track-Aligned Extents: Matching Access Patterns to Disk Drive Characteristics

FAST '02 Proceedings of the Conference on File and Storage Technologies
Freeblock Scheduling Outside of Disk Firmware

FAST '02 Proceedings of the Conference on File and Storage Technologies
Performance Analysis of RAID5 Disk Arrays with a Vacationing Server Model for Rebuild Mode Operation

Proceedings of the Tenth International Conference on Data Engineering
Performance Analysis of Disk Arrays under Failure

VLDB '90 Proceedings of the 16th International Conference on Very Large Data Bases
Parity Striping of Disk Arrays: Low-Cost Reliable Storage with Acceptable Throughput

VLDB '90 Proceedings of the 16th International Conference on Very Large Data Bases
On the foundations of artificial workload design

SIGMETRICS '84 Proceedings of the 1984 ACM SIGMETRICS conference on Measurement and modeling of computer systems
The RAID Configuration Tool

HIPC '96 Proceedings of the Third International Conference on High-Performance Computing (HiPC '96)
Simulation study of cached RAID5 designs

HPCA '95 Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture
Permutation Development Data Layout (PDDL)

HPCA '99 Proceedings of the 5th International Symposium on High Performance Computer Architecture
A performance comparison of RAID-5 and log-structured arrays

HPDC '95 Proceedings of the 4th IEEE International Symposium on High Performance Distributed Computing
A New Approach in the Modeling and Generation of Synthetic Disk Workload

MASCOTS '00 Proceedings of the 8th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems
Rebuild options in RAID5 disk arrays

SPDP '95 Proceedings of the 7th IEEE Symposium on Parallel and Distributeed Processing
Rules of Thumb in Data Engineering

ICDE '00 Proceedings of the 16th International Conference on Data Engineering
A Modular, Analytical Throughput Model for Modern Disk Arrays

MASCOTS '01 Proceedings of the Ninth International Symposium in Modeling, Analysis and Simulation of Computer and Telecommunication Systems
Issues and Challenges in the Performance Analysis of Real Disk Arrays

IEEE Transactions on Parallel and Distributed Systems
Adaptive cross-parity (AXP) code for a high-density magnetic tape subsystem

IBM Journal of Research and Development - High-density magnetic recording
The performance impact of I/O optimizations and disk improvements

IBM Journal of Research and Development
Characteristics of I/O traffic in personal computer and server workloads

IBM Systems Journal
Clustered Scheduling Algorithms for Mixed-Media Disk Workloads in a Multimedia Server

Cluster Computing
A Performance Evaluation Tool for RAID Disk Arrays

QEST '04 Proceedings of the The Quantitative Evaluation of Systems, First International Conference
Disk Scrubbing in Large Archival Storage Systems

MASCOTS '04 Proceedings of the The IEEE Computer Society's 12th Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems
Note: Correction to the 1997 tutorial on Reed–Solomon coding

Software—Practice & Experience - Research Articles
Reconstruct versus read-modify writes in RAID

Information Processing Letters
New Efficient MDS Array Codes for RAID Part I: Reed-Solomon-Like Codes for Tolerating Three Disk Failures

IEEE Transactions on Computers
Storage Bricks Have Arrived

FAST '02 Proceedings of the 1st USENIX Conference on File and Storage Technologies
Awarded Best Paper! - Using MEMS-Based Storage in Disk Arrays

FAST '03 Proceedings of the 2nd USENIX Conference on File and Storage Technologies
Façade: Virtual Storage Devices with Performance Guarantees

FAST '03 Proceedings of the 2nd USENIX Conference on File and Storage Technologies
More Than an Interface---SCSI vs. ATA

FAST '03 Proceedings of the 2nd USENIX Conference on File and Storage Technologies
Comment on "Issues and Challenges in the Performance Analysis of Real Disk Arrays'

IEEE Transactions on Parallel and Distributed Systems
Awarded Best Paper! -- Row-Diagonal Parity for Double Disk Failure Correction

FAST '04 Proceedings of the 3rd USENIX Conference on File and Storage Technologies
Disk Infant Mortality in Large Storage Systems

MASCOTS '05 Proceedings of the 13th IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems
Performance Evaluation of a Heterogeneous Disk Array Architecture

MASCOTS '05 Proceedings of the 13th IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems
New Efficient MDS Array Codes for RAID Part II: Rabin-Like Codes for Tolerating Multiple (greater than or equal to 4) Disk Failures

IEEE Transactions on Computers
Quickly finding near-optimal storage designs

ACM Transactions on Computer Systems (TOCS)
The automatic improvement of locality in storage systems

ACM Transactions on Computer Systems (TOCS)
Zygaria: Storage Performance as a Managed Resource

RTAS '06 Proceedings of the 12th IEEE Real-Time and Embedded Technology and Applications Symposium
Comment on "RAID5 Performance with Distributed Sparing'

IEEE Transactions on Parallel and Distributed Systems
HoVer Erasure Codes For Disk Arrays

DSN '06 Proceedings of the International Conference on Dependable Systems and Networks
Reliability for Networked Storage Nodes

DSN '06 Proceedings of the International Conference on Dependable Systems and Networks
Multi-level RAID for very large disk arrays

ACM SIGMETRICS Performance Evaluation Review - Design, implementation, and performance of storage systems
A page fault equation for modeling the effect of memory size

Performance Evaluation
Reliability of modular mesh-connected intelligent storage brick systems

IBM Journal of Research and Development
Optimizing Cauchy Reed-Solomon Codes for Fault-Tolerant Network Storage Applications

NCA '06 Proceedings of the Fifth IEEE International Symposium on Network Computing and Applications
Clustered RAID Arrays and Their Access Costs

The Computer Journal
Mirrored Disk Organization Reliability Analysis

IEEE Transactions on Computers
Computer Architecture, Fourth Edition: A Quantitative Approach

Computer Architecture, Fourth Edition: A Quantitative Approach
Simulation with Arena with CD (McGraw-Hill Series in Industrial Engineering and Management)

Simulation with Arena with CD (McGraw-Hill Series in Industrial Engineering and Management)
Analysis of Rebuild Processing in RAID5 Disk Arrays

The Computer Journal
Matrix methods for lost data reconstruction in erasure codes

FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
WEAVER codes: highly fault tolerant erasure codes for storage systems

FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
On multidimensional data and modern disks

FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
Enhanced Reliability Modeling of RAID Storage Systems

DSN '07 Proceedings of the 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks
An analysis of latent sector errors in disk drives

Proceedings of the 2007 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Performance of Two-Disk Failure-Tolerant Disk Arrays

IEEE Transactions on Computers
File system design for an NFS file server appliance

WTEC'94 Proceedings of the USENIX Winter 1994 Technical Conference on USENIX Winter 1994 Technical Conference
An implementation of a log-structured file system for UNIX

USENIX'93 Proceedings of the USENIX Winter 1993 Conference Proceedings on USENIX Winter 1993 Conference Proceedings
Failure trends in a large disk drive population

FAST '07 Proceedings of the 5th USENIX conference on File and Storage Technologies
PRO: a popularity-based multi-threaded reconstruction optimization for RAID-structured storage systems

FAST '07 Proceedings of the 5th USENIX conference on File and Storage Technologies
Understanding disk failure rates: What does an MTTF of 1,000,000 hours mean to you?

ACM Transactions on Storage (TOS)
A Double Track Error-Correction Code for Magnetic Tape

IEEE Transactions on Computers
A Unified Reliability Model for Fault-Tolerant Computers

IEEE Transactions on Computers
A new intra-disk redundancy scheme for high-reliability RAID storage systems in the presence of unrecoverable errors

ACM Transactions on Storage (TOS)
The RAID-6 liberation codes

FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
An analysis of data corruption in the storage stack

FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
Intel® Turbo Memory: Nonvolatile disk caches in the storage hierarchy of mainstream computer systems

ACM Transactions on Storage (TOS)
A nine year study of file system and storage benchmarking

ACM Transactions on Storage (TOS)
Disk scrubbing versus intra-disk redundancy for high-reliability raid storage systems

SIGMETRICS '08 Proceedings of the 2008 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Ironmodel: robust performance models in the wild
STAR: An Efficient Coding Scheme for Correcting Triple Storage Node Failures

IEEE Transactions on Computers
A New Minimum Density RAID-6 Code with a Word Size of Eight

NCA '08 Proceedings of the 2008 Seventh IEEE International Symposium on Network Computing and Applications
Reliability and Performance of Mirrored Disk Organizations

The Computer Journal
Undetected disk errors in RAID arrays

IBM Journal of Research and Development
Storage-class memory: the next storage system technology

IBM Journal of Research and Development
Effects of scheduling on file memory operations

AFIPS '67 (Spring) Proceedings of the April 18-20, 1967, spring joint computer conference
WorkOut: I/O workload outsourcing for boosting RAID reconstruction performance

FAST '09 Proccedings of the 7th conference on File and storage technologies
A performance evaluation and examination of open-source erasure coding libraries for storage

FAST '09 Proccedings of the 7th conference on File and storage technologies
Memory Systems: Cache, DRAM, Disk

Memory Systems: Cache, DRAM, Disk
MDS array codes with independent parity symbols

IEEE Transactions on Information Theory
On lowest density MDS codes

IEEE Transactions on Information Theory
X-code: MDS array codes with optimal encoding

IEEE Transactions on Information Theory
Low-density MDS codes and factors of complete graphs

IEEE Transactions on Information Theory

Disk Scrubbing Versus Intradisk Redundancy for RAID Storage Systems

ACM Transactions on Storage (TOS)
Why specialized disks for composite operations may be unnecessary

ACM SIGARCH Computer Architecture News
RAID level selection for heterogeneous disk arrays

Cluster Computing
Survey and analysis of disk scheduling methods

ACM SIGARCH Computer Architecture News
WeLe-RAID: a SSD-based RAID for system endurance and performance

NPC'11 Proceedings of the 8th IFIP international conference on Network and parallel computing
Rebuild processing in RAID5 with emphasis on the supplementary parity augmentation method[37]

ACM SIGARCH Computer Architecture News
Performance, reliability, and performability of a hybrid RAID array and a comparison with traditional RAID1 arrays

Cluster Computing
Hierarchical RAID: Design, performance, reliability, and recovery

Journal of Parallel and Distributed Computing
Effect of codeword placement on the reliability of erasure coded data storage systems

QEST'13 Proceedings of the 10th international conference on Quantitative Evaluation of Systems
Beyond MTTDL: A Closed-Form RAID 6 Reliability Equation

ACM Transactions on Storage (TOS)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Parity is a popular form of data protection in redundant arrays of inexpensive/independent disks (RAID). RAID5 dedicates one out of N disks to parity to mask single disk failures, that is, the contents of a block on a failed disk can be reconstructed by exclusive-ORing the corresponding blocks on surviving disks. RAID5 can mask a single disk failure, and it is vulnerable to data loss if a second disk failure occurs. The RAID5 rebuild process systematically reconstructs the contents of a failed disk on a spare disk, returning the system to its original state, but the rebuild process may be unsuccessful due to unreadable sectors. This has led to two disk failure tolerant arrays (2DFTs), such as RAID6 based on Reed-Solomon (RS) codes. EVENODD, RDP (Row-Diagonal-Parity), the X-code, and RM2 (Row-Matrix) are 2DFTs with parity coding. RM2 incurs a higher level of redundancy than two disks, while the X-code is limited to a prime number of disks. RDP is optimal with respect to the number of XOR operations at the encoding, but not for short write operations. For small symbol sizes EVENODD and RDP have the same disk access pattern as RAID6, while RM2 and the X-code incur a high recovery cost with two failed disks. We describe variations to RAID5 and RAID6 organizations, including clustered RAID, different methods to update parities, rebuild processing, disk scrubbing to eliminate sector errors, and the intra-disk redundancy (IDR) method to deal with sector errors. We summarize the results of recent studies of failures in hard disk drives. We describe Markov chain reliability models to estimate RAID mean time to data loss (MTTDL) taking into account sector errors and the effect of disk scrubbing. Numerical results show that RAID5 plus IDR attains the same MTTDL level as RAID6, while incurring a lower performance penalty. We conclude with a survey of analytic and simulation studies of RAID performance and tools and benchmarks for RAID performance evaluation.