Making the most out of direct-access network attached storage

Authors:
Kostas Magoutis;Salimah Addetia;Alexandra Fedorova;Margo I. Seltzer
Affiliations:
Division of Engineering and Applied Sciences, Harvard University;Division of Engineering and Applied Sciences, Harvard University;Division of Engineering and Applied Sciences, Harvard University;Division of Engineering and Applied Sciences, Harvard University
Venue:
FAST'03 Proceedings of the 2nd USENIX conference on File and storage technologies
Year:
2003

Citing 23
Cited 1

Limits to low-latency communication on high-speed networks

ACM Transactions on Computer Systems (TOCS)
LogP: towards a realistic model of parallel computation

PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
Separating data and control transfer in distributed operating systems

ASPLOS VI Proceedings of the sixth international conference on Architectural support for programming languages and operating systems
U-Net: a user-level network interface for parallel and distributed computing

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
An implementation of the Hamlyn sender-managed interface architecture

OSDI '96 Proceedings of the second USENIX symposium on Operating systems design and implementation
Effects of communication latency, overhead, and bandwidth in a cluster architecture

Proceedings of the 24th annual international symposium on Computer architecture
A cost-effective, high-bandwidth storage architecture

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
IO-lite: a unified I/O buffering and caching system

OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
NFS sensitivity to high performance networks

SIGMETRICS '99 Proceedings of the 1999 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Implementing remote procedure calls

ACM Transactions on Computer Systems (TOCS)
Queue pair IP: a hybrid architecture for system area networks

ISCA '02 Proceedings of the 29th annual international symposium on Computer architecture
SCTP: New Transport Protocol for TCP/IP

IEEE Internet Computing
Fibre Channel: A Connection to the Future

Computer
GPFS: A Shared-Disk File System for Large Computing Clusters

FAST '02 Proceedings of the Conference on File and Storage Technologies
Structure and Performance of the Direct Access File System

ATEC '02 Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference
The Multi-Queue Replacement Algorithm for Second Level Buffer Caches

Proceedings of the General Track: 2002 USENIX Annual Technical Conference
An Efficient Zero-Copy I/O Framework for UNIX

An Efficient Zero-Copy I/O Framework for UNIX
The Direct Access File System

FAST '03 Proceedings of the 2nd USENIX Conference on File and Storage Technologies
Design and implementation of a direct access file system (DAFS) kernel server for FreeBSD

BSDC'02 Proceedings of the BSD Conference 2002 on BSD Conference
Interposed request routing for scalable network storage

OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
Cheating the I/O bottleneck: network storage with Trapeze/Myrinet

ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference
Berkeley DB

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
End system optimizations for high-speed TCP

IEEE Communications Magazine

RDMA in the SiCortex cluster systems

PVM/MPI'07 Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface

Quantified Score

Hi-index	0.00

Visualization

Abstract

The performance of high-speed network-attached storage applications is often limited by end-system overhead, caused primarily by memory copying and network protocol processing. In this paper, we examine alternative strategies for reducing overhead in such systems. We consider optimizations to remote procedure call (RPC)-based data transfer using either remote direct memory access (RDMA) or network interface support for pre-posting of application receive buffers. We demonstrate that both mechanisms enable file access throughput that saturates a 2Gb/s network link when performing large I/Os on relatively slow, commodity PCs. However, for multi-client workloads dominated by small I/Os, throughput is limited by the per-I/O overhead of processing RPCs in the server. For such workloads, we propose the use of a new network I/O mechanism, Optimistic RDMA (ORDMA). ORDMA is an alternative to RPC that aims to improve server throughput and response time for small I/Os. We measured performance improvements of up to 32% in server throughput and 36% in response time with use of ORDMA in our prototype.