Predicting job start times on clusters

  • Authors:
  • Hui Li;D. Groep;J. Templon;L. Wolters

  • Affiliations:
  • Leiden Inst. of Adv. Comput. Sci., Leiden Univ., Netherlands;Leiden Inst. of Adv. Comput. Sci., Leiden Univ., Netherlands;Dept. of Comput. Sci., Illinois Univ., Urbana, IL, USA;Dept. of Comput. Sci., Indiana Univ., Bloomington, IN, USA

  • Venue:
  • CCGRID '04 Proceedings of the 2004 IEEE International Symposium on Cluster Computing and the Grid
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

In a computational Grid which consists of many computer clusters, job start time predictions are useful to guide resource selections and balance the workload distribution. However, the basic Grid middleware available today either has no means of expressing the time that a site will take before starting a job or uses a simple linear scale. In this paper we introduce a system for predicting job start times on clusters. Our predictions are based on statistical analysis of historical job traces and simulation of site schedulers. We have deployed the system on the EDG (European Data-Grid) production cluster at NIKHEF. The experimental results show that acceptable prediction accuracy is achieved to reflect real site states and site-specific scheduling policies. We find that the average error of our job start time predictions is 18.9 percent of the average job queue wait time and this is around 20 times smaller than the average prediction error using the EDG solution.