A survey of process migration mechanisms
ACM SIGOPS Operating Systems Review
The design and implementation of a log-structured file system
SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
RAID: high-performance, reliable secondary storage
ACM Computing Surveys (CSUR)
Implementing global memory management in a workstation cluster
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
A Case for NOW (Networks of Workstations)
IEEE Micro
Replication Algorithms in a Remote Caching Architecture
IEEE Transactions on Parallel and Distributed Systems
Cooperative caching: using remote client memory to improve file system performance
OSDI '94 Proceedings of the 1st USENIX conference on Operating Systems Design and Implementation
Implementation of a reliable remote memory pager
ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
A practical evaluation of large-memory data processing on a reliable remote memory system
Proceedings of the 2009 ACM symposium on Applied Computing
Hi-index | 0.01 |
Recent distributed systems are connected with high-performance networks, that make possible the use of a new level of memory hierarchy: network memory. Network memory provides both high bandwidth and low latency, that make it attractive for uses like paging and file caching. In this paper we explore the issues of building a reliable network memory system that is resilient to single workstation failures. We propose novel parity-based policies and evaluate their performance using trace-driven simulation of realistic applications. The presented performance results suggest that our proposed policies provide reliability at a surprisingly small run-time overhead.