The impact of job memory requirements on gang-scheduling performance

  • Authors:
  • Sanjeev Setia;Mark S. Squillante;Vijay K. Naik

  • Affiliations:
  • Computer Science Department, George Mason University, Fairfax, VA;IBM Thomas J. Watson Research Center, P.O. Box 704, Yorktown Heights, NY;IBM Thomas J. Watson Research Center, P.O. Box 704, Yorktown Heights, NY

  • Venue:
  • ACM SIGMETRICS Performance Evaluation Review
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

Almost all previous research on gang-scheduling has ignored the impact of real job memory requirements on the performance of the policy. This is despite the fact that on parallel supercomputers, because of the problems associated with demand paging, executing jobs are typically allocated enough memory so that their entire address space is memory-resident. In this paper, we examine the impact of job memory requirements on the performance of gang-scheduling policies. We first present an analysis of the memory-usage characteristics of jobs in the production workload on the Cray T3E at the San Diego Supercomputer Center. We also characterize the memory usage of some of the applications that form part of the workload on the LLNL ASCI supercomputer. Next, we examine the issue of long-term scheduling on MPPs, i.e., we study policies for deciding which jobs among a set of competing jobs should be allocated memory and thus should be allowed to execute on the processors of the system. Using trace-driven simulation, we evaluate the impact of using different long-term scheduling policies on the overall performance of Distributed Hierarchical Control (DHC), a gang-scheduling policy that has been studied extensively in the research literature.