Extensible Resource Management For Cluster Computing

Authors:
Nayeem Islam;Mark S. Squillante;Liana L. Fong;Ajei S. Gopal;Andreas L. Prodromidis
Affiliations:
-;-;-;-;-
Venue:
ICDCS '97 Proceedings of the 17th International Conference on Distributed Computing Systems (ICDCS '97)
Year:
1997

Citing 0
Cited 10

The impact of I/O on program behavior and parallel scheduling

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
An evaluation of parallel job scheduling for ASCI Blue-Pacific

SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Impact of Workload and System Parameters on Next Generation Cluster Scheduling Mechanisms

IEEE Transactions on Parallel and Distributed Systems
An infrastructure for efficient parallel job execution in Terascale computing environments

SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Internet based audiovisual communication tools used in a joint engineering design class at the University of Alabama in Huntsville and the Ecole Superieure et de Construction Automobile (ESTACA) in Paris, France

SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Process Tracking for Parallel Job Control

IPPS/SPDP '99/JSSPP '99 Proceedings of the Job Scheduling Strategies for Parallel Processing
The Effect of Correlating Quantum Allocation and Job Size for Gang Scheduling

IPPS/SPDP '99/JSSPP '99 Proceedings of the Job Scheduling Strategies for Parallel Processing
An Integrated Approach to Parallel Scheduling Using Gang-Scheduling, Backfilling, and Migration

JSSPP '01 Revised Papers from the 7th International Workshop on Job Scheduling Strategies for Parallel Processing
An Integrated Approach to Parallel Scheduling Using Gang-Scheduling, Backfilling, and Migration

IEEE Transactions on Parallel and Distributed Systems
Resource Calculations with Constraints, and Placement of Tenants and Instances for Multi-tenant SaaS Applications

ICSOC '08 Proceedings of the 6th International Conference on Service-Oriented Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Advanced general-purpose parallel systems should be able to support diverse applications with different resource requirements without compromising effectiveness and efficiency. We present a new resource management model for cluster computing that allows multiple scheduling policies to co-exist dynamically. In particular, we have built ``Octopus'', an extensible and distributed hierarchical scheduler that implements new space-sharing, gang-scheduling and load-sharing strategies. A series of experiments performed on an IBM SP2 suggest that ``Octopus'' can effectively match application requirements to available resources, and improve the performance of a variety of parallel applications within a cluster.