Modeling machine availability in enterprise and wide-area distributed computing environments

  • Authors:
  • Daniel Nurmi;John Brevik;Rich Wolski

  • Affiliations:
  • Department of Computer Science, University of California, Santa Barbara, Santa Barbara, CA;Department of Computer Science, University of California, Santa Barbara, Santa Barbara, CA;Department of Computer Science, University of California, Santa Barbara, Santa Barbara, CA

  • Venue:
  • Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we consider the problem of modeling machine availability in enterprise-area and wide-area distributed computing settings. Using availability data gathered from three different environments, we detail the suitability of four potential statistical distributions for each data set: exponential, Pareto, Weibull, and hyperexponential. In each case, we use software we have developed to determine the necessary parameters automatically from each data collection. To gauge suitability, we present both graphical and statistical evaluations of the accuracy with each distribution fits each data set. For all three data sets, we find that a hyperexponential model fits slightly more accurately than a Weibull, but that both are substantially better choices than either an exponential or Pareto. These results indicate that either a hyperexponential or Weibull model effectively represents machine availability in enterprise and Internet computing environments.