A holistic approach for tightly coupled reconfigurable parallel processors

Authors:
Hritam Dutta;Dmitrij Kissler;Frank Hannig;Alexey Kupriyanov;Jürgen Teich;Bernard Pottier
Affiliations:
Hardware/Software Co-Design, Department of Computer Science, University of Erlangen-Nuremberg, Germany;Hardware/Software Co-Design, Department of Computer Science, University of Erlangen-Nuremberg, Germany;Hardware/Software Co-Design, Department of Computer Science, University of Erlangen-Nuremberg, Germany;Hardware/Software Co-Design, Department of Computer Science, University of Erlangen-Nuremberg, Germany;Hardware/Software Co-Design, Department of Computer Science, University of Erlangen-Nuremberg, Germany;Architectures et Systèmes, Université de Bretagne Occidentale, Brest, France
Venue:
Microprocessors & Microsystems
Year:
2009

Citing 13
Cited 3

Resource constrained scheduling of uniform algorithms

Journal of VLSI Signal Processing Systems
Compaan: deriving process networks from Matlab for embedded signal processing architectures

CODES '00 Proceedings of the eighth international workshop on Hardware/software codesign
MorphoSys: An Integrated Reconfigurable System for Data-Parallel and Computation-Intensive Applications

IEEE Transactions on Computers
Loop Parallelization in the Polytope Model

CONCUR '93 Proceedings of the 4th International Conference on Concurrency Theory
High-Level Synthesis of Nonprogrammable Hardware Accelerators

ASAP '00 Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures, and Processors
PACT XPP—A Self-Reconfigurable Data Processing Architecture

The Journal of Supercomputing
SPARK: A High-Lev l Synthesis Framework For Applying Parallelizing Compiler Transformations

VLSID '03 Proceedings of the 16th International Conference on VLSI Design
Automatic compilation to a coarse-grained reconfigurable system-opn-chip

ACM Transactions on Embedded Computing Systems (TECS)
Dynamic Piecewise Linear/Regular Algorithms

PARELEC '04 Proceedings of the international conference on Parallel Computing in Electrical Engineering
Hierarchical Partitioning for Piecewise Linear Algorithms

PARELEC '06 Proceedings of the international symposium on Parallel Computing in Electrical Engineering
Efficient control generation for mapping nested loop programs onto processor arrays

Journal of Systems Architecture: the EUROMICRO Journal
Efficient event-driven simulation of parallel processor architectures

SCOPES '07 Proceedingsof the 10th international workshop on Software & compilers for embedded systems
Controller synthesis for mapping partitioned programs on array architectures

ARCS'06 Proceedings of the 19th international conference on Architecture of Computing Systems

Modern development methods and tools for embedded reconfigurable systems: A survey

Integration, the VLSI Journal
Improving performance of nested loops on reconfigurable array processors

ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers
A direct method for optimal VLSI realization of deeply nested n-D loop problems

Microprocessors & Microsystems

Quantified Score

Hi-index	0.00

Visualization

Abstract

New standards in signal, multimedia, and network processing for embedded electronics are characterized by computationally intensive algorithms, high flexibility due to the swift change in specifications. In order to meet demanding challenges of increasing computational requirements and stringent constraints on area and power consumption in fields of embedded engineering, there is a gradual trend towards coarse-grained parallel embedded processors. Furthermore, such processors are enabled with dynamic reconfiguration features for supporting time- and space-multiplexed execution of the algorithms. However, the formidable problem in efficient mapping of applications (mostly loop algorithms) onto such architectures has been a hindrance in their mass acceptance. In this paper we present (a) a highly parameterizable, tightly coupled, and reconfigurable parallel processor architecture together with the corresponding power breakdown and reconfiguration time analysis of a case study application, (b) a retargetable methodology for mapping of loop algorithms, (c) a co-design framework for modeling, simulation, and programming of such architectures, and (d) loosely coupled communication with host processor.