Automated Mapping of the MapReduce Pattern onto Parallel Computing Platforms

Authors:
Qiang Liu;Tim Todman;Wayne Luk;George A. Constantinides
Affiliations:
Department of Computing, Imperial College London, London, UK SW7 2AZ;Department of Computing, Imperial College London, London, UK SW7 2AZ;Department of Computing, Imperial College London, London, UK SW7 2AZ;Department of Electrical Engineering, Imperial College London, London, UK SW7 2AZ
Venue:
Journal of Signal Processing Systems
Year:
2012

Citing 11
Cited 0

Software pipelining

ACM Computing Surveys (CSUR)
Loop Parallelization

Loop Parallelization
Maximizing Multiprocessor Performance with the SUIF Compiler

Computer
A Technique for FPGA Synthesis Driven by Automatic Source Code Analysis and Transformations

FPL '02 Proceedings of the Reconfigurable Computing Is Going Mainstream, 12th International Conference on Field-Programmable Logic and Applications
Single-Dimension Software Pipelining for Multi-Dimensional Loops

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Convex Optimization

Convex Optimization
A Geometric Programming Framework for Optimal Multi-Level Tiling

Proceedings of the 2004 ACM/IEEE conference on Supercomputing
MapReduce: simplified data processing on large clusters

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Map-reduce as a Programming Model for Custom Computing Machines

FCCM '08 Proceedings of the 2008 16th International Symposium on Field-Programmable Custom Computing Machines
Outer loop pipelining for application specific datapaths in FPGAs

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Hierarchical algorithm partitioning at system level for an improved utilization of memory structures

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The MapReduce pattern can be found in many important applications, and can be exploited to significantly improve system parallelism. Unlike previous work, in which designers explicitly specify how to exploit the pattern, we develop a compilation approach for mapping applications with the MapReduce pattern automatically onto Field-Programmable Gate Array (FPGA) based parallel computing platforms. We formulate the problem of mapping the MapReduce pattern to hardware as a geometric programming model; this model exploits loop-level parallelism and pipelining to give an optimal implementation on given hardware resources. The approach is capable of handling single and multiple nested MapReduce patterns. Furthermore, we explore important variations of MapReduce, such as using a linear structure rather than a tree structure for merging intermediate results generated in parallel. Results for six benchmarks show that our approach can find performance-optimal designs in the design space, improving system performance by up to 170 times compared to the initial designs on the target platform.