Towards autonomic detection of SLA violations in Cloud infrastructures

  • Authors:
  • Vincent C. Emeakaroha;Marco A. S. Netto;Rodrigo N. Calheiros;Ivona Brandic;Rajkumar Buyya;César A. F. De Rose

  • Affiliations:
  • Vienna University of Technology, Vienna, Austria;Faculty of Informatics, Catholic University of Rio Grande do Sul, Porto Alegre, Brazil;CLOUDS Laboratory, Department of Computer Science and Software Engineering, The University of Melbourne, Australia;Vienna University of Technology, Vienna, Austria;CLOUDS Laboratory, Department of Computer Science and Software Engineering, The University of Melbourne, Australia;Faculty of Informatics, Catholic University of Rio Grande do Sul, Porto Alegre, Brazil

  • Venue:
  • Future Generation Computer Systems
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Cloud computing has become a popular paradigm for implementing scalable computing infrastructures provided on-demand on a case-by-case basis. Self-manageable Cloud infrastructures are required in order to comply with users' requirements defined by Service Level Agreements (SLAs) and to minimize user interactions with the computing environment. Thus, adequate SLA monitoring strategies and timely detection of possible SLA violations represent challenging research issues. This paper presents the Detecting SLA Violation infrastructure (DeSVi) architecture, sensing SLA violations through sophisticated resource monitoring. Based on the user requests, DeSVi allocates computing resources for a requested service and arranges its deployment on a virtualized environment. Resources are monitored using a novel framework capable of mapping low-level resource metrics (e.g., host up and down time) to user-defined SLAs (e.g., service availability). The detection of possible SLA violations relies on the predefined service level objectives and utilization of knowledge databases to manage and prevent such violations. We evaluate the DeSVi architecture using two application scenarios: (i) image rendering applications based on ray-tracing, and (ii) transactional web applications based on the well-known TPC-W benchmark. These applications exhibit heterogeneous workloads for investigating optimal monitoring interval of SLA parameters. The achieved results show that our architecture is able to monitor and detect SLA violations. The architecture output also provides a guideline on the appropriate monitoring intervals for applications depending on their resource consumption behavior.