The Zebra striped network file system

Authors:
John H. Hartman;John K. Ousterhout
Affiliations:
Department of Computer Science, Gould-Simpson Building, The University of Arizona, Tucson, AZ;Sun Microsystems Laboratories, Inc., 2550 Garcia Avenue, MS UMTV29-232, Mountain View, CA
Venue:
ACM Transactions on Computer Systems (TOCS)
Year:
1995

Citing 21
Cited 57

A fast file system for UNIX

ACM Transactions on Computer Systems (TOCS)
Scale and performance in a distributed file system

ACM Transactions on Computer Systems (TOCS)
Reimplementing the Cedar file system using logging and group commit

SOSP '87 Proceedings of the eleventh ACM Symposium on Operating systems principles
Caching in the Sprite network file system

ACM Transactions on Computer Systems (TOCS)
The Sprite Network Operating System

Computer
A case for redundant arrays of inexpensive disks (RAID)

SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
Coda: A Highly Available File System for a Distributed Workstation Environment

IEEE Transactions on Computers
The design and implementation of a log-structured file system

SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
Measurements of a distributed file system

SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
Replication in the harp file system

SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
Non-volatile memory for fast, reliable file systems

ASPLOS V Proceedings of the fifth international conference on Architectural support for programming languages and operating systems
The TickerTAIP parallel RAID architecture

ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
RAID-II: a high-bandwidth network file server

ISCA '94 Proceedings of the 21st annual international symposium on Computer architecture
Swift/RAID: a distributed RAID system

Computing Systems
Maximizing performance in a striped disk array

ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
Concurrency Control in Distributed Database Systems

ACM Computing Surveys (CSUR)
Grapevine: an exercise in distributed computing

Communications of the ACM
A Case for NOW (Networks of Workstations)

IEEE Micro
Directory Reference Patterns in Hierarchical File Systems

IEEE Transactions on Knowledge and Data Engineering
The LOCUS distributed operating system

SOSP '83 Proceedings of the ninth ACM symposium on Operating systems principles
File system logging versus clustering: a performance comparison

TCON'95 Proceedings of the USENIX 1995 Technical Conference Proceedings

A brief survey of current work on network attached peripherals (extended abstract)

ACM SIGOPS Operating Systems Review
Serverless network file systems

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Serverless network file systems

ACM Transactions on Computer Systems (TOCS) - Special issue on operating system principles
Recovery in the Calypso file system

ACM Transactions on Computer Systems (TOCS)
Distributed file systems and distributed memory

ACM Computing Surveys (CSUR)
Petal: distributed virtual disks

Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
Frangipani: a scalable distributed file system

Proceedings of the sixteenth ACM symposium on Operating systems principles
Improving the performance of log-structured file systems with adaptive methods

Proceedings of the sixteenth ACM symposium on Operating systems principles
Computing in the RAIN: A Reliable Array of Independent Nodes

IEEE Transactions on Parallel and Distributed Systems
Interposed request routing for scalable network storage

ACM Transactions on Computer Systems (TOCS)
A Demand Adaptive and Locality Aware (DALA) streaming media server cluster architecture

NOSSDAV '02 Proceedings of the 12th international workshop on Network and operating systems support for digital audio and video
Studies on striping and buffer caching issues for the software RAID file system

Journal of Systems Architecture: the EUROMICRO Journal
LH*G: A High-Availability Scalable Distributed Data Structure By Record Grouping

IEEE Transactions on Knowledge and Data Engineering
Extending Heterogeneity to RAID Level 5

Proceedings of the General Track: 2002 USENIX Annual Technical Conference
File declustering for efficient parallel I/O on networks of workstations

Cluster computing
An Efficient Caching Scheme for Software RAID File System in Workstation Clusters

HPC-ASIA '97 Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
A Scalable Architecture for Clustered Network Attached Storage

MSS '03 Proceedings of the 20 th IEEE/11 th NASA Goddard Conference on Mass Storage Systems and Technologies (MSS'03)
HAMFS File System

SRDS '99 Proceedings of the 18th IEEE Symposium on Reliable Distributed Systems
Ivy: a read/write peer-to-peer file system

ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
Taking advantage of heterogeneity in disk arrays

Journal of Parallel and Distributed Computing
Using PI/OT to Support Complex Parallel I/O

IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
A Self-Organizing Storage Cluster for Parallel Data-Intensive Applications

Proceedings of the 2004 ACM/IEEE conference on Supercomputing
An Efficient Data Location Protocol for Self.organizing Storage Clusters

Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Ivy: a read/write peer-to-peer file system

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Myriad: Cost-effective Disaster Tolerance

FAST '02 Proceedings of the 1st USENIX Conference on File and Storage Technologies
Metadata Efficiency in Versioning File Systems

FAST '03 Proceedings of the 2nd USENIX Conference on File and Storage Technologies
Hydra: a platform for survivable and secure data storage systems

Proceedings of the 2005 ACM workshop on Storage security and survivability
CEFT: A cost-effective, fault-tolerant parallel virtual file system

Journal of Parallel and Distributed Computing
Design, implementation and performance evaluation of a cost-effective, fault-tolerant parallel virtual file system

SNAPI '03 Proceedings of the international workshop on Storage network architecture and parallel I/Os
Constructing collaborative desktop storage caches for large scientific datasets

ACM Transactions on Storage (TOS)
Large files, small writes, and pNFS

Proceedings of the 20th annual international conference on Supercomputing
STAR: an efficient coding scheme for correcting triple storage node failures

FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
Interposed request routing for scalable network storage

OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
Swarm: a log-structured storage system for Linux

ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
AFRAID: a frequently redundant array of independent disks

ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
Low-overhead byzantine fault-tolerant storage

Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
A high performance redundancy scheme for cluster file systems

International Journal of High Performance Computing and Networking
Smoke and mirrors: reflecting files at a geographically remote location without loss of performance

FAST '09 Proccedings of the 7th conference on File and storage technologies
Online reorganization of databases

ACM Computing Surveys (CSUR)
R-ADMAD: high reliability provision for large-scale de-duplication archival storage systems

Proceedings of the 23rd international conference on Supercomputing
A survey of confidential data storage and deletion methods

ACM Computing Surveys (CSUR)
Paxos replicated state machines as the basis of a high-performance data store

Proceedings of the 8th USENIX conference on Networked systems design and implementation
Myriad: cost-effective disaster tolerance

FAST'02 Proceedings of the 1st USENIX conference on File and storage technologies
Metadata efficiency in versioning file systems

FAST'03 Proceedings of the 2nd USENIX conference on File and storage technologies
Minimizing metadata access latency in wide area networked file systems

HiPC'06 Proceedings of the 13th international conference on High Performance Computing
A cluster LVM for SAN environments

ISPA'05 Proceedings of the 2005 international conference on Parallel and Distributed Processing and Applications
Wukong: A cloud-oriented file service for mobile Internet devices

Journal of Parallel and Distributed Computing
From paxos to CORFU: a flash-speed shared log

ACM SIGOPS Operating Systems Review
Building access oblivious storage cloud for enterprise

Hot-ICE'12 Proceedings of the 2nd USENIX conference on Hot Topics in Management of Internet, Cloud, and Enterprise Networks and Services
CORFU: a shared log design for flash clusters

NSDI'12 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation
Beyond block I/O: implementing a distributed shared log in hardware

Proceedings of the 6th International Systems and Storage Conference
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles

ACM SIGOPS 24th Symposium on Operating Systems Principles
Tango: distributed data structures over a shared log

Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
CORFU: A distributed shared log

ACM Transactions on Computer Systems (TOCS)
Gecko: contention-oblivious disk arrays for cloud storage

FAST'13 Proceedings of the 11th USENIX conference on File and Storage Technologies
Parity logging with reserved space: towards efficient updates and recovery in erasure-coded clustered storage

FAST'14 Proceedings of the 12th USENIX conference on File and Storage Technologies
Blizzard: fast, cloud-scale block storage for cloud-oblivious applications

NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Zebra is a network file system that increases throughput by striping the file data across multiple servers. Rather than striping each file separately, Zebra forms all the new data from each client into a single stream, which it then stripes using an approach similar to a log-structured file system. This provides high performance for writes of small files as well as for reads and writes of large files. Zebra also writes parity information in each stripe in the style of RAID disk arrays; this increases storage costs slightly, but allows the system to continue operation while a single storage server is unavailable. A prototype implementation of Zebra, built in the Sprite operating system, provides 4–5 times the throughput of the standard Sprite file system or NFS for large files and a 15–300% improvement for writing small files.