Energy-Efficient Thermal-Aware Autonomic Management of Virtualized HPC Cloud Infrastructure

  • Authors:
  • Ivan Rodero;Hariharasudhan Viswanathan;Eun Kyung Lee;Marc Gamell;Dario Pompili;Manish Parashar

  • Affiliations:
  • NSF Cloud and Autonomic Computing Center, Rutgers Discovery Informatics Institute, Rutgers University, Piscataway, USA 08854;NSF Cloud and Autonomic Computing Center, Rutgers Discovery Informatics Institute, Rutgers University, Piscataway, USA 08854;NSF Cloud and Autonomic Computing Center, Rutgers Discovery Informatics Institute, Rutgers University, Piscataway, USA 08854;NSF Cloud and Autonomic Computing Center, Rutgers Discovery Informatics Institute, Rutgers University, Piscataway, USA 08854;NSF Cloud and Autonomic Computing Center, Rutgers Discovery Informatics Institute, Rutgers University, Piscataway, USA 08854;NSF Cloud and Autonomic Computing Center, Rutgers Discovery Informatics Institute, Rutgers University, Piscataway, USA 08854

  • Venue:
  • Journal of Grid Computing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Virtualized datacenters and clouds are being increasingly considered for traditional High-Performance Computing (HPC) workloads that have typically targeted Grids and conventional HPC platforms. However, maximizing energy efficiency and utilization of datacenter resources, and minimizing undesired thermal behavior while ensuring application performance and other Quality of Service (QoS) guarantees for HPC applications requires careful consideration of important and extremely challenging tradeoffs. Virtual Machine (VM) migration is one of the most common techniques used to alleviate thermal anomalies (i.e., hotspots) in cloud datacenter servers as it reduces load and, hence, the server utilization. In this article, the benefits of using other techniques such as voltage scaling and pinning (traditionally used for reducing energy consumption) for thermal management over VM migrations are studied in detail. As no single technique is the most efficient to meet temperature/performance optimization goals in all situations, an autonomic approach that performs energy-efficient thermal management while ensuring the QoS delivered to the users is proposed. To address the problem of VM allocation that arises during VM migrations, an innovative application-centric energy-aware strategy for Virtual Machine (VM) allocation is proposed. The proposed strategy ensures high resource utilization and energy efficiency through VM consolidation while satisfying application QoS by exploiting knowledge obtained through application profiling along multiple dimensions (CPU, memory, and network bandwidth utilization). To support our arguments, we present the results obtained from an experimental evaluation on real hardware using HPC workloads under different scenarios.