Computing infrastructure for big data processing

Authors:
Ling Liu
Affiliations:
Distributed Data Intensive Systems Lab, School of Computer Science, Georgia Institute of Technology, Atlanta, USA 30332
Venue:
Frontiers of Computer Science: Selected Publications from Chinese Universities
Year:
2013

Citing 6
Cited 0

Thousand core chips: a technology perspective

Proceedings of the 44th annual Design Automation Conference
LUBM: A benchmark for OWL knowledge base systems

Web Semantics: Science, Services and Agents on the World Wide Web
Pregel: a system for large-scale graph processing

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Power-efficient computing for compute-intensive GPGPU applications

Proceedings of the 21st international conference on Parallel architectures and compilation techniques
GraphChi: large-scale graph computation on just a PC

OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
Analyzing the impact of joint optimization of cell size, redundancy, and ECC on low-voltage SRAM array total area

IEEE Transactions on Very Large Scale Integration (VLSI) Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

With computing systems undergone a fundamental transformation from single-processor devices at the turn of the century to the ubiquitous and networked devices and the warehouse-scale computing via the cloud, the parallelism has become ubiquitous at many levels. At micro level, parallelisms are being explored from the underlying circuits, to pipelining and instruction level parallelism on multi-cores or many cores on a chip as well as in a machine. From macro level, parallelisms are being promoted from multiple machines on a rack, many racks in a data center, to the globally shared infrastructure of the Internet. With the push of big data, we are entering a new era of parallel computing driven by novel and ground breaking research innovation on elastic parallelism and scalability. In this paper, we will give an overview of computing infrastructure for big data processing, focusing on architectural, storage and networking challenges of supporting big data paper. We will briefly discuss emerging computing infrastructure and technologies that are promising for improving data parallelism, task parallelism and encouraging vertical and horizontal computation parallelism.