Boosting energy efficiency with mirrored data block replication policy and energy scheduler

Authors:
Sara Arbab Yazd;Subbarayan Venkatesan;Neeraj Mittal
Affiliations:
The University of Texas at Dallas, Richardson, TX;The University of Texas at Dallas, Richardson, TX;The University of Texas at Dallas, Richardson, TX
Venue:
ACM SIGOPS Operating Systems Review
Year:
2013

Citing 20
Cited 0

Xen and the art of virtualization

SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
More Than an Interface---SCSI vs. ATA

FAST '03 Proceedings of the 2nd USENIX Conference on File and Storage Technologies
Live migration of virtual machines

NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Dryad: distributed data-parallel programs from sequential building blocks

Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
MapReduce: simplified data processing on large clusters

Communications of the ACM - 50th anniversary issue: 1958 - 2008
The Case for Energy-Proportional Computing

Computer
Reducing network energy consumption via sleeping and rate-adaptation

NSDI'08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation
Energy-aware server provisioning and load dispatching for connection-intensive internet services

NSDI'08 Proceedings of the 5th USENIX Symposium on Networked Systems Design and Implementation
Minimizing data center cooling and server power costs

Proceedings of the 14th ACM/IEEE international symposium on Low power electronics and design
On the energy (in)efficiency of Hadoop clusters

ACM SIGOPS Operating Systems Review
Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling

Proceedings of the 5th European conference on Computer systems
Energy-efficient server clusters

PACS'02 Proceedings of the 2nd international conference on Power-aware computer systems
Energy aware consolidation for cloud computing

HotPower'08 Proceedings of the 2008 conference on Power aware computing and systems
Server farms with setup costs

Performance Evaluation
The Hadoop Distributed File System

MSST '10 Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)
Energy management for MapReduce clusters

Proceedings of the VLDB Endowment
Purlieus: locality-aware resource allocation for MapReduce in a cloud

Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Matchmaking: A New MapReduce Scheduling Technique

CLOUDCOM '11 Proceedings of the 2011 IEEE Third International Conference on Cloud Computing Technology and Science
T: a data-centric cooling energy costs reduction approach for big data analytics cloud

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Migration energy-aware workload consolidation in enterprise clouds

CLOUDCOM '12 Proceedings of the 2012 IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Energy efficiency is one of the major challenges in big datacenters. To facilitate processing of large data sets in a distributed fashion, the MapReduce programming model is employed in these datacenters. Hadoop is an open-source implementation of MapReduce which contains a distributed file system. Hadoop Distributed File System provides a data block replication scheme to preserve reliability and data availability. The distribution of the data block replicas over the nodes is performed randomly by meeting some constraints (e.g., preventing storage of two replicas of a data block on a single node). This study makes use of flexibility in the data block placement policy to increase energy efficiency in datacenters. Furthermore, inspired by Zaharia et al.'s delay scheduling algorithm, a scheduling algorithm is introduced, which takes into account energy efficiency in addition to fairness and data locality properties. Computer simulations of the proposed method suggest its superiority over Hadoop's standard settings.