Synthesis of Platform Architectures from OpenCL Programs

Authors:
Muhsen Owaida;Nikolaos Bellas;Konstantis Daloukas;Christos D. Antonopoulos
Affiliations:
-;-;-;-
Venue:
FCCM '11 Proceedings of the 2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines
Year:
2011

Citing 0
Cited 8

OpenCL memory infrastructure for FPGAs (abstract only)

Proceedings of the ACM/SIGDA international symposium on Field Programmable Gate Arrays
Exploring many-core design templates for FPGAs and ASICs

International Journal of Reconfigurable Computing - Special issue on Selected Papers from the International Conference on Reconfigurable Computing and FPGAs (ReConFig'10)
A low-overhead interconnect architecture for virtual reconfigurable fabrics

Proceedings of the 2012 international conference on Compilers, architectures and synthesis for embedded systems
Cyfield-RISP: generating dynamic instruction set processors for reconfigurable hardware using OpenCL

ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
FPGA programming for the masses

Communications of the ACM
FPGA Programming for the Masses

Queue - Mobile Web Development
Throughput-oriented kernel porting onto FPGAs

Proceedings of the 50th Annual Design Automation Conference
Efficient compilation of CUDA kernels for high-performance computing on FPGAs

ACM Transactions on Embedded Computing Systems (TECS) - Special issue on application-specific processors

Quantified Score

Hi-index	0.02

Visualization

Abstract

The problem of automatically generating hardware modules from a high level representation of an application has been at the research forefront in the last few years. In this paper, we use OpenCL, an industry supported standard for writing programs that execute on multicore platforms and accelerators such as GPUs. Our architectural synthesis tool, SOpenCL (Silicon-OpenCL), adapts OpenCL into a novel hardware design flow which efficiently maps coarse and fine-grained parallelism of an application onto an FPGA reconfigurable fabric. SOpenCL is based on a source-to-source code transformation step that coarsens the OpenCL fine-grained parallelism into a series of nested loops, and on a template-based hardware generation back-end that configures the accelerator based on the functionality and the application performance and area requirements. Our experimentation with a variety of OpenCL and C kernel benchmarks reveals that area, throughput and frequency optimized hardware implementations are attainable using SOpenCL.