Introduction to parallel computing: design and analysis of algorithms
Introduction to parallel computing: design and analysis of algorithms
Enhanced Algorithms for Multi-site Scheduling
GRID '02 Proceedings of the Third International Workshop on Grid Computing
The ANL/IBM SP Scheduling System
IPPS '95 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Chimera: AVirtual Data System for Representing, Querying, and Automating Data Derivation
SSDBM '02 Proceedings of the 14th International Conference on Scientific and Statistical Database Management
The Performance of Processor Co-Allocation in Multicluster Systems
CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
Resource Co-Allocation in Computational Grids
HPDC '99 Proceedings of the 8th IEEE International Symposium on High Performance Distributed Computing
Trace-Based Simulations of Processor Co-Allocation Policies in Multiclusters
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
Policy Driven Heterogeneous Resource Co-Allocation with Gangmatching
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
The Maximal Utilization of Processor Co-Allocation in Multicluster Systems
IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
On Advantages of Grid Computing for Parallel Job Scheduling
CCGRID '02 Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid
Condor-G: A Computation Management Agent for Multi-Institutional Grids
HPDC '01 Proceedings of the 10th IEEE International Symposium on High Performance Distributed Computing
Characterization of Bandwidth-Aware Meta-Schedulers for Co-Allocating Jobs Across Multiple Clusters
The Journal of Supercomputing
ICPADS '06 Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1
An innovative perspective on mapping in grids
BADS '09 Proceedings of the 2009 workshop on Bio-inspired algorithms for distributed systems
An adaptive multisite mapping for computationally intensive grid applications
Future Generation Computer Systems
Impact of reservations on production job scheduling
JSSPP'07 Proceedings of the 13th international conference on Job scheduling strategies for parallel processing
Multisite co-allocation algorithms for computational grid
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Parallel job scheduling — a status report
JSSPP'04 Proceedings of the 10th international conference on Job Scheduling Strategies for Parallel Processing
Hi-index | 0.00 |
In multicluster systems, and more generally in grids, jobs may require co-allocation, i.e., the simultaneous allocation of resources such as processors in multiple clusters to improve their performance. In previous work, we have studied processor co-allocation through simulations. Here, we extend this work with the design and implementation of a dynamic processor co-allocation service in multicluster systems. While an implementation of basic co-allocation mechanisms has existed for some years in the form of the DUROC component of the Globus Toolkit, DUROC does not provide resource-brokering functionality or fault tolerance in the face of job submission or completion failures. Our design adds these two elements in the form of a software layer on top of DUROC. We have performed experiments that show that our co-allocation service works reliably.