Fair Share on High Performance Computing Systems: What Does Fair Really Mean?

  • Authors:
  • Stephen D. Kleban;Scott H. Clearwater

  • Affiliations:
  • -;-

  • Venue:
  • CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We report on a performance evaluation of a Fair Sharesystem at the ASCI Blue Mountain supercomputer cluster.We study the impacts of share allocation under Fair Shareon wait times and expansion factor. We also measure theService Ratio, a typical figure of merit for Fair Sharesystems, with respect to a number of job parameters. Weconclude that Fair Share does little to alter importantperformance metrics such as expansion factor. This leadsto the question of what Fair Share means on clustermachines. The essential difference between Fair Share ona uni-processor and a cluster is that the workload on acluster is not fungible in space or time. We find that clustermachines must be highly utilized and supportcheckpointing in order for Fair Share to function moreclosely to the spirit in which it was originally developed.