GreenHadoop: leveraging green energy in data-processing frameworks

  • Authors:
  • Íñigo Goiri;Kien Le;Thu D. Nguyen;Jordi Guitart;Jordi Torres;Ricardo Bianchini

  • Affiliations:
  • Rutgers University, Piscataway, NJ, USA;Rutgers University, Piscataway, NJ, USA;Rutgers University, Piscataway, NJ, USA;Universitat Politècnica de Catalunya/Barcelona Supercomputing Center, Barcelona, Spain;Universitat Politècnica de Catalunya/Barcelona Supercomputing Center, Barcelona, Spain;Rutgers University, Piscataway, NJ, USA

  • Venue:
  • Proceedings of the 7th ACM european conference on Computer Systems
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Interest has been growing in powering datacenters (at least partially) with renewable or "green" sources of energy, such as solar or wind. However, it is challenging to use these sources because, unlike the "brown" (carbon-intensive) energy drawn from the electrical grid, they are not always available. This means that energy demand and supply must be matched, if we are to take full advantage of the green energy to minimize brown energy consumption. In this paper, we investigate how to manage a datacenter's computational workload to match the green energy supply. In particular, we consider data-processing frameworks, in which many background computations can be delayed by a bounded amount of time. We propose GreenHadoop, a MapReduce framework for a datacenter powered by a photovoltaic solar array and the electrical grid (as a backup). GreenHadoop predicts the amount of solar energy that will be available in the near future, and schedules the MapReduce jobs to maximize the green energy consumption within the jobs' time bounds. If brown energy must be used to avoid time bound violations, GreenHadoop selects times when brown energy is cheap, while also managing the cost of peak brown power consumption. Our experimental results demonstrate that GreenHadoop can significantly increase green energy consumption and decrease electricity cost, compared to Hadoop.