A cooperative approach to support software deployment using the software dock
Proceedings of the 21st international conference on Software engineering
Hierarchical resource management in the polder metacomputing initiative
Parallel Computing - Special issue on applications
Performance of Hierarchical Processor Scheduling in Shared-Memory Multiprocessor Systems
IEEE Transactions on Computers
Design and implementations of Ninf: towards a global computing infrastructure
Future Generation Computer Systems - Special issue on metacomputing
Future Generation Computer Systems - Special issue on metacomputing
Constrained Component Deployment in Wide-Area Networks Using AI Planning Techniques
IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
Distributed Ant: A System to Support Application Deployment in the Grid
GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
Optimal Resource-Aware Deployment Planning for Component-Based Distributed Applications
HPDC '04 Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
The Pegasus portal: web based grid computing
Proceedings of the 2005 ACM symposium on Applied computing
Worldwide computing: Adaptive middleware and programming technology for dynamic Grid environments
Scientific Programming - Dynamic Grids and Worldwide Computing
Grid'5000: A Large Scale and Highly Reconfigurable Grid Experimental Testbed
GRID '05 Proceedings of the 6th IEEE/ACM International Workshop on Grid Computing
Diet: new developments and recent results
Euro-Par'06 Proceedings of the CoreGRID 2006, UNICORE Summit 2006, Petascale Computational Biology and Bioinformatics conference on Parallel processing
Deployment of a hierarchical middleware
EuroPar'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part I
A first step towards automatically building network representations
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Hi-index | 0.00 |
The use of remotely distributed computing resources as a single system offers great potential for compute-intensive applications. Increasingly, users have access to hundreds or thousands of machines at once and wish to utilize those resources concurrently. To provide a reasonable user experience, such systems must provide an effective, scalable scheduling system. Unfortunately, the great majority of job schedulers are centralized and many do not scale well to thousands or even hundreds of nodes. In this paper we study how distributed scheduling systems can be designed most effectively; we focus on the problem of selecting an optimal arrangement of schedulers, or a deployment, for hierarchically organized systems. We show that the optimal deployment is a complete spanning d-ary tree; this result conforms with results from the scheduling literature. More importantly, we present an approach for determining the optimal degree d for the tree. To test our approach, we use DIET, a middleware system that uses hierarchical schedulers. We develop detailed performance models for DIET and validate these models in a real-world environment. Finally, we demonstrate that our approach selects deployments that are near-optimal in practice.