A dynamic processor allocation policy for multiprogrammed shared-memory multiprocessors
ACM Transactions on Computer Systems (TOCS)
CHARM++: a portable concurrent object oriented system based on C++
OOPSLA '93 Proceedings of the eighth annual conference on Object-oriented programming systems, languages, and applications
Dynamic scheduling on parallel machines
Theoretical Computer Science - Special issue on dynamic and on-line algorithms
Dynamic resource management on distributed systems using reconfigurable applications
IBM Journal of Research and Development - Special issue: performance analysis and its impact on design
Using moldability to improve the performance of supercomputer jobs
Journal of Parallel and Distributed Computing
A Performance Oriented Migration Framework For The Grid
CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
A Malleable-Job System for Timeshared Parallel Machines
CCGRID '02 Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid
Integrated scheduling: the best of both worlds
Journal of Parallel and Distributed Computing
A framework for adaptive execution in grids
Software—Practice & Experience
Performance-Driven Processor Allocation
IEEE Transactions on Parallel and Distributed Systems
ICPP '07 Proceedings of the 2007 International Conference on Parallel Processing
An architecture for reconfigurable iterative MPI applications in dynamic environments
PPAM'05 Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics
Efficient multidimensional data redistribution for resizable parallel computations
ISPA'07 Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications
Evaluation of reallocation heuristics for moldable tasks in computational grids
AusPDC '11 Proceedings of the Ninth Australasian Symposium on Parallel and Distributed Computing - Volume 118
Competitive online adaptive scheduling for sets of parallel jobs with fairness and efficiency
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
This paper describes the design and initial implementation of a software framework for exploiting resizability in distributed-memory parallel applications. By ''resizable'' we mean the ability at run-time to expand or contract the number of processes participating in a parallel application. The ReSHAPE framework described here includes a cluster scheduler, a library supporting data redistribution and process remapping, and an application programming interface (API) which allows applications to interact with the scheduler and resizing library with only minor code modifications. Parallel applications executed using the ReSHAPE framework can expand to take advantage of additional free processors or contract to accommodate a high priority application without being suspended. Experimental results show that the ReSHAPE framework can significantly improve individual job turn-around time and overall system throughput, even with very simple application scheduling policies. In addition, the framework serves as a convenient platform for research into much more sophisticated cluster scheduling policies and methods.