The Sprite Network Operating System
Computer
Memory coherence in shared virtual memory systems
ACM Transactions on Computer Systems (TOCS)
Transparent process migration: design alternatives and the sprite implementation
Software—Practice & Experience
Implementing global memory management in a workstation cluster
SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
MILLIPEDE: easy parallel programming in available distributed environments
Software—Practice & Experience
A Survey of Recoverable Distributed Shared Virtual Memory Systems
IEEE Transactions on Parallel and Distributed Systems
Design, implementation and evaluation of ICARE: an efficient recoverable DSM
Software—Practice & Experience - Special issue on multiprocessor operating systems
Preemptable remote execution facilities for the V-system
Proceedings of the tenth ACM symposium on Operating systems principles
Adaptive and reliable paging to remote main memory
Journal of Parallel and Distributed Computing
The MOSIX Distributed Operating System: Load Balancing for UNIX
The MOSIX Distributed Operating System: Load Balancing for UNIX
GENESIS: an efficient, transparent and easy to use cluster operating system
Parallel Computing
Containers: A Sound Basis For a True Single System Image
CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
Checkpointing an Recovery of Share Memory Parallel Applications in a Cluster
CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
SOSP '83 Proceedings of the ninth ACM symposium on Operating systems principles
Nomad: A Scalable Operating System for Clusters of Uni and Multiprocessors
IWCC '99 Proceedings of the 1st IEEE Computer Society International Workshop on Cluster Computing
Distributed operating system for resource discovery and allocation in federated clusters
Proceedings of the twentieth ACM symposium on Operating systems principles
The parXXL environment: scalable fine grained development for large coarse grained platforms
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
EuroPar'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part I
Architecture for the next generation system management tools
Future Generation Computer Systems
Towards multi-level adaptation for distributed operating systems and applications
ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part II
Hi-index | 0.00 |
The lack of a single system image Operating System (OS) for clusters restricts their use for parallel processing. We propose an approach for building an efficient single system image cluster operating system. The proposed system implements distributed services performing global and dynamic resource management to offer high performance, high availability and ease of use and programming. The programming API of the OS running on each cluster node is kept unmodified but the high level OS services can take benefit of all cluster resources. Our approach has been validated by a prototype based on Linux. Our prototype comprises of a limited kernel patch and a set of modules extending the kernel to implement the cluster distributed services. Existing applications running on symmetric multiprocessors (SMP) on top of Linux can be executed on top of our cluster OS without modification.