Towards Non-Stationary Grid Models

  • Authors:
  • Tamás Éltető;Cécile Germain-Renaud;Pascal Bondon;Michèle Sebag

  • Affiliations:
  • University Paris-Sud, INRIA-Saclay and CNRS, Orsay, France;University Paris-Sud, INRIA-Saclay and CNRS, Orsay, France;Laboratoire des Signaux et Systèmes, Supelec, CNRS, Gif Sur Yvette Cedex, France;University Paris-Sud, INRIA-Saclay and CNRS, Orsay, France

  • Venue:
  • Journal of Grid Computing
  • Year:
  • 2011

Quantified Score

Hi-index 0.01

Visualization

Abstract

Despite intense research on Grid scheduling, differentiated quality of service remains an open question, and no consensus has emerged on the most promising strategy. The difficulties of experimentation might be one of the root causes of this stalling. An alternative to experimenting on real, large, and complex data is to look for well-founded and parsimonious representations, which may also contribute to the a-priori knowledge required for operational Autonomics. The goal of this paper is thus to explore explanatory and generative models rather than predictive ones. As a test case, we address the following issue: is it possible to exhibit and validate consistent models of the Grid workload? Most existing work on modeling the dynamics of Grid behavior describes Grids as complex systems, but assumes a steady-state system (technically stationarity) and concludes to some form of long-range dependence (slowly decaying correlation) in the associated time-series. But the physical (economic and sociologic) processes governing the Grid behavior dispel the stationarity hypothesis. This paper considers an appealing different class of models: a sequence of stationary processes separated by breakpoints. The model selection question is now defined as identifying the breakpoints and fitting the processes in each segment. Experimenting with data from the EGEE/EGI Grid, we found that a non-stationary model can consistently be identified from empirical data, and that limiting the range of models to piecewise affine (autoregressive) time series is sufficiently powerful. We propose and experiment a validation methodology that empirically addresses the current lack of theoretical results concerning the quality of the estimated model parameters. Finally, we present a bootstrapping strategy for building more robust models from the limited samples at hand.