ACM Transactions on Computer Systems (TOCS)
A case for redundant arrays of inexpensive disks (RAID)
SIGMOD '88 Proceedings of the 1988 ACM SIGMOD international conference on Management of data
Performance of a disk array protype
SIGMETRICS '91 Proceedings of the 1991 ACM SIGMETRICS conference on Measurement and modeling of computer systems
A status report on research in transparent informed prefetching
ACM SIGOPS Operating Systems Review
Floating parity and data disk arrays
Journal of Parallel and Distributed Computing - Special issue on parallel I/O systems
Parity logging overcoming the small write problem in redundant disk arrays
ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
The logical disk: a new approach to improving file systems
SOSP '93 Proceedings of the fourteenth ACM symposium on Operating systems principles
The Zebra striped network file system
SOSP '93 Proceedings of the fourteenth ACM symposium on Operating systems principles
RAID-II: a high-bandwidth network file server
ISCA '94 Proceedings of the 21st annual international symposium on Computer architecture
Hi-index | 0.00 |
This paper describes the implementation of Sawmill, a network file system using the RAID-II storage system. Sawmill takes advantage of the direct data path in RAID-II between the disks and the network, which bypasses the file server CPU. The key ideas in the implementation of Sawmill are combining logging (LFS) with RAID to obtain fast small writes, using new log layout techniques to improve bandwidth, and pipelining through the controller memory to reduce latency. The file system can currently read data at 21 MB/s and write data at 15 MB/s, close to the raw disk array bandwidth, while running on a relatively slow Sun-4. Performance measurements show that LFS improved performance of a stream of small writes by over a order of magnitude compared to writing directly to the RAID, and this improvement would be even larger with a faster CPU. Sawmill demonstrates that by using a storage system with a direct data path, a file system can provide data at bandwidths much higher than the file server itself could handle. However, processor speed is still an important factor, especially when handling many small requests in parallel.