Application-driven energy-efficient architecture explorations for big data

Authors:
Xiaoyan Gu;Rui Hou;Ke Zhang;Lixin Zhang;Weiping Wang
Affiliations:
Graduate University of Chinese Academy of Sciences, Beijing, China;Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China
Venue:
Proceedings of the 1st Workshop on Architectures and Systems for Big Data
Year:
2011

Citing 12
Cited 0

Reasoning about naming systems

ACM Transactions on Programming Languages and Systems (TOPLAS)
C-store: a column-oriented DBMS

VLDB '05 Proceedings of the 31st international conference on Very large data bases
MapReduce: simplified data processing on large clusters

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
IBM z10: The Next-Generation Mainframe Microprocessor

IEEE Micro
Stateful hardware decompression in networking environment

Proceedings of the 4th ACM/IEEE Symposium on Architectures for Networking and Communications Systems
Gordon: using flash memory to build fast, power-efficient clusters for data-intensive applications

Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
A Genetic Algorithm of Two-Stage Supply Chain Distribution Problem Associated with Fixed Charge and Multiple Transportation Modes

ICNC '09 Proceedings of the 2009 Fifth International Conference on Natural Computation - Volume 04
Web search using mobile cores: quantifying and mitigating the price of efficiency

Proceedings of the 37th annual international symposium on Computer architecture
Introduction to the wire-speed processor and architecture

IBM Journal of Research and Development
To compress or not to compress - compute vs. IO tradeoffs for mapreduce energy efficiency

Proceedings of the first ACM SIGCOMM workshop on Green networking
FAWN: a fast array of wimpy nodes

Communications of the ACM
Efficient data streaming with on-chip accelerators: Opportunities and challenges

HPCA '11 Proceedings of the 2011 IEEE 17th International Symposium on High Performance Computer Architecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

Building energy-efficient systems is critical for big data applications. This paper investigates and compares the energy consumption and the execution time of a typical Hadoop-based big data application running on a traditional Xeon-based cluster and an Atom-based (Micro-server) cluster. Our experimental results show that the micro-server platform is more energy-efficient than the Xeon-based platform. Our experimental results also reveal that data compression and decompression accounts for a considerable percentage of the total execution time. More precisely, data compression/decompression occupies 7-11% of the execution time of the map tasks and 37.9-41.2% of the execution time of the reduce tasks. Based on our findings, we demonstrate the necessity of using a heterogeneous architecture for energy-efficient big data processing. The desired architecture takes the advantages of both micro-server processors and hardware compression/decompression accelerators. In addition, we propose a mechanism that enables the accelerators to perform more efficient data compression/decompression.