Memory coherence in shared virtual memory systems
ACM Transactions on Computer Systems (TOCS)
Serverless network file systems
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
The galley parallel file system
ICS '96 Proceedings of the 10th international conference on Supercomputing
A Survey of Recoverable Distributed Shared Virtual Memory Systems
IEEE Transactions on Parallel and Distributed Systems
BFXM: a parallel file system model based on the mechanism of distributed shared memory
ACM SIGOPS Operating Systems Review
Design, implementation and evaluation of ICARE: an efficient recoverable DSM
Software—Practice & Experience - Special issue on multiprocessor operating systems
An Efficient and Scalable Approach for Implementing Fault-Tolerant DSM Architectures
IEEE Transactions on Computers
High Availability of the Memory Hierarchy in a Cluster
SRDS '00 Proceedings of the 19th IEEE Symposium on Reliable Distributed Systems
Hi-index | 0.00 |
A parallel single level store (psls) system integrates a shared virtual memory and a parallel file system representing an attractive support for long running parallel applications in a cluster. In this paper we present the smooth integration of a backward error recovery high-availability support into a psls system. Our highly-available psls system relies on a high degree of integration and re-usability between high-availability and standard supports. We focus on the parallel file system management at checkpointing and recovery time. A prototype has been implemented and we show some performance results.