Dual time-scale distributed capacity allocation and load redirect algorithms for cloud systems

  • Authors:
  • Danilo Ardagna;Sara Casolari;Michele Colajanni;Barbara Panicucci

  • Affiliations:
  • Politecnico di Milano, Dipartimento di Elettronica Informazione, Italy;Universití di Modena e Reggio Emilia, Dipartimento di Ingegneria dell'Informazione, Italy;Universití di Modena e Reggio Emilia, Dipartimento di Ingegneria dell'Informazione, Italy;Politecnico di Milano, Dipartimento di Elettronica Informazione, Italy and Universití di Modena e Reggio Emilia, Dipartimento di Scienze e Metodi dell'Ingegneria, Italy

  • Venue:
  • Journal of Parallel and Distributed Computing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Resource management remains one of the main issues of cloud computing providers because system resources have to be continuously allocated to handle workload fluctuations while guaranteeing Service Level Agreements (SLA) to the end users. In this paper, we propose novel capacity allocation algorithms able to coordinate multiple distributed resource controllers operating in geographically distributed cloud sites. Capacity allocation solutions are integrated with a load redirection mechanism which, when necessary, distributes incoming requests among different sites. The overall goal is to minimize the costs of allocated resources in terms of virtual machines, while guaranteeing SLA constraints expressed as a threshold on the average response time. We propose a distributed solution which integrates workload prediction and distributed non-linear optimization techniques. Experiments show how the proposed solutions improve other heuristics proposed in literature without penalizing SLAs, and our results are close to the global optimum which can be obtained by an oracle with a perfect knowledge about the future offered load.