Energy efficiency for MapReduce workloads: an in-depth study

  • Authors:
  • Boliang Feng;Jiaheng Lu;Yongluan Zhou;Nan Yang

  • Affiliations:
  • Renmin University of China, Beijing, PR China;Renmin University of China, Beijing, PR China;University of Southern Denmark, Campusvej, Odense M;Renmin University of China, Beijing, PR China

  • Venue:
  • ADC '12 Proceedings of the Twenty-Third Australasian Database Conference - Volume 124
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Energy efficiency has emerged as a crucial optimization goal in data centers. MapReduce has become a popular and even fashionable distributed processing model for parallel computing in data centers. Hadoop is an open-source implementation of MapReduce, which is widely used for short jobs requiring low response time. In this paper, we conduct an in-depth study of the energy efficiency for MapReduce workloads. We identify four factors that affect the energy efficiency of MapReduce. In particular, we make experiments over four typical MapReduce workloads that represent different kinds of application scenarios and measure the energy consumption with varied cluster parameters. Our key finding is that with well-tuned system parameters and adaptive resource configurations, MapReduce cluster can achieve both performance improvement and good energy saving simultaneously in some instances, which is surprisingly contrast to previous works on cluster-level energy conservation.