Using moldability to improve the performance of supercomputer jobs

Authors:
Walfredo Cirne;Francine Berman
Affiliations:
Departamento de Sistemas e Computa¸ão, Universidade Federal de Campina Grande, Av. Aprígio Veloso, 882, Campina Grande, PB 58109-970 Brazil;San Diego Supercomputer Center, Computer Science and Engineering, University of California at San Diego, MC and 9500 Gilman Drive, LaJolla, California
Venue:
Journal of Parallel and Distributed Computing
Year:
2002

Citing 28
Cited 20

Speedup Versus Efficiency in Parallel Systems

IEEE Transactions on Computers
Characterizations of parallelism in applications and their use in scheduling

SIGMETRICS '89 Proceedings of the 1989 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
The Processor Working Set and its Use in Scheduling Multiprocessor Systems

IEEE Transactions on Software Engineering
Application scheduling and processor allocation in multiprogrammed parallel processing systems

Performance Evaluation - Special issue: performance modeling of parallel processing systems
Robust partitioning policies of multiprocessor systems

Performance Evaluation - Special issue: performance modeling of parallel processing systems
Use of application characteristics and limited preemption for run-to-completion parallel processor scheduling policies

SIGMETRICS '94 Proceedings of the 1994 ACM SIGMETRICS conference on Measurement and modeling of computer systems
How useful is old information (extended abstract)?

PODC '97 Proceedings of the sixteenth annual ACM symposium on Principles of distributed computing
Parallel application scheduling on networks of workstations

Journal of Parallel and Distributed Computing
Computer architecture (2nd ed.): a quantitative approach

Computer architecture (2nd ed.): a quantitative approach
The elusive goal of workload characterization

ACM SIGMETRICS Performance Evaluation Review
Impact of job mix on optimizations for space sharing schedulers

Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
When the Herd Is Smart: Aggregate Behavior in the Selection of Job Request

IEEE Transactions on Parallel and Distributed Systems
Predicting Queue Times on Space-Sharing Parallel Computers

IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
A Model for Moldable Supercomputer Jobs

IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Analysis of Non-Work-Conserving Processor Partitioning Policies

IPPS '95 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
The ANL/IBM SP Scheduling System

IPPS '95 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Job Scheduling Under the Portable Batch System

IPPS '95 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
A Historical Application Profiler for Use by Parallel Schedulers

IPPS '97 Proceedings of the Job Scheduling Strategies for Parallel Processing
Theory and Practice in Parallel Job Scheduling

IPPS '97 Proceedings of the Job Scheduling Strategies for Parallel Processing
Using Queue Time Predictions for Processor Allocation

IPPS '97 Proceedings of the Job Scheduling Strategies for Parallel Processing
A Comparative Study of Real Workload Traces and Synthetic Workload Models for Parallel Job Scheduling

IPPS/SPDP '98 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Metrics and Benchmarking for Parallel Job Scheduling

IPPS/SPDP '98 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Job Scheduling Scheme for Pure Space Sharing Among Rigid Jobs

IPPS/SPDP '98 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Job-Length Estimation and Performance in Backfilling Schedulers

HPDC '99 Proceedings of the 8th IEEE International Symposium on High Performance Distributed Computing
Utilization and Predictability in Scheduling the IBM SP2 with Backfilling

IPPS '98 Proceedings of the 12th. International Parallel Processing Symposium on International Parallel Processing Symposium
A Model For Speedup of Parallel Programs

A Model For Speedup of Parallel Programs
Using moldability to improve the performance of supercomputer jobs

Using moldability to improve the performance of supercomputer jobs
A comprehensive model of the supercomputer workload

WWC '01 Proceedings of the Workload Characterization, 2001. WWC-4. 2001 IEEE International Workshop

When the Herd Is Smart: Aggregate Behavior in the Selection of Job Request

IEEE Transactions on Parallel and Distributed Systems
Adaptive Computing on the Grid Using AppLeS

IEEE Transactions on Parallel and Distributed Systems
Use of PVFS for Efficient Execution of Jobs with Pipeline-Shared I/O

GRID '04 Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing
User-guided symbiotic space-sharing of real workloads

Proceedings of the 20th annual international conference on Supercomputing
Backfilling Using System-Generated Predictions Rather than User Runtime Estimates

IEEE Transactions on Parallel and Distributed Systems
Workflow task clustering for best effort systems with Pegasus

Proceedings of the 15th ACM Mardi Gras conference: From lightweight mash-ups to lambda grids: Understanding the spectrum of distributed computing requirements, applications, tools, infrastructures, interoperability, and the incremental adoption of key capabilities
Robust scheduling of moldable parallel jobs

International Journal of High Performance Computing and Networking
The cost of doing science on the cloud: the Montage example

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Simbatch: An API for Simulating and Predicting the Performance of Parallel Resources Managed by Batch Systems

Euro-Par 2008 Workshops - Parallel Processing
Dynamic Resizing of Parallel Scientific Simulations: A Case Study Using LAMMPS

ICCS '09 Proceedings of the 9th International Conference on Computational Science: Part I
Real-time scheduling of divisible loads in cluster computing environments

Journal of Parallel and Distributed Computing
Design and performance of a scheduling framework for resizable parallel applications

Parallel Computing
Moldable parallel job scheduling using job efficiency: an iterative approach

JSSPP'06 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing
Using moldability to improve scheduling performance of parallel jobs on computational grid

GPC'08 Proceedings of the 3rd international conference on Advances in grid and pervasive computing
Supporting GPU sharing in cloud environments with a transparent runtime consolidation framework

Proceedings of the 20th international symposium on High performance distributed computing
Scheduling Concurrent Applications on a Cluster of CPU-GPU Nodes

CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
A job scheduling approach for multi-core clusters based on virtual malleability

Euro-Par'12 Proceedings of the 18th international conference on Parallel Processing
Evaluation of reallocation heuristics for moldable tasks in computational grids

AusPDC '11 Proceedings of the Ninth Australasian Symposium on Parallel and Distributed Computing - Volume 118
Scheduling concurrent applications on a cluster of CPU-GPU nodes

Future Generation Computer Systems
Predicting intermediate storage performance for workflow applications

PDSW '13 Proceedings of the 8th Parallel Data Storage Workshop

Quantified Score

Hi-index	0.01

Visualization

Abstract

In most parallel supercomputers, submitting a job for execution involves specifying (i) how many processors are to be allocated to the job, and (ii) for how long these processors are to be available to the job. Since most jobs are moldable (i.e., there is a choice on how many processors the job uses), the user typically has to decide how many processors to ask for a given job, and estimate the request time accordingly. In this paper, we show that the request that submits a moldable job can be automatically selected in a way that often reduces the job's turn-around time. More precisely, we introduce and evaluate SA, an application scheduler that chooses, on behalf of the user, which request to use to submit a moldable job. The user provides SA with a set of possible requests that can be used to submit a given moldable job. SA estimates the turn-around time of each request based on the current state of the supercomputer, and then forwards to the supercomputer the request with the smallest expected turn-around time. The conditions under which SA is studied cover variations on the characteristics of the job, the state of the supercomputer, and the information available to SA. The results show that SA often improves the turn-around time of the job under a variety of conditions.