Synthesis of pipelined memory access controllers for streamed data applications on FPGA-based computing engines

Authors:
Joonseok Park;Pedro C. Diniz
Affiliations:
University of Southern California, Marina del Rey, CA;University of Southern California, Marina del Rey, CA
Venue:
Proceedings of the 14th international symposium on Systems synthesis
Year:
2001

Citing 10
Cited 15

Dataflow-driven memory allocation for multi-dimensional signal processing systems

ICCAD '94 Proceedings of the 1994 IEEE/ACM international conference on Computer-aided design
Synthesis of application-specific memory designs

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Exploiting off-chip memory access modes in high-level synthesis

ICCAD '97 Proceedings of the 1997 IEEE/ACM international conference on Computer-aided design
High-level address optimization and synthesis techniques for data-transfer-intensive applications

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Minimizing the required memory bandwidth in VLSI system realizations

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Data and memory optimization techniques for embedded systems

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Memory Access Optimization and RAM Inference for Pipeline Vectorization

FPL '99 Proceedings of the 9th International Workshop on Field-Programmable Logic and Applications
Automatic Allocation of Arrays to Memories in FPGA Processors with Multiple Memory Banks

FCCM '99 Proceedings of the Seventh Annual IEEE Symposium on Field-Programmable Custom Computing Machines
Automatic Synthesis of Data Storage and Control Structures for FPGA-Based Computing Engines

FCCM '00 Proceedings of the 2000 IEEE Symposium on Field-Programmable Custom Computing Machines
Bridging the gap between compilation and synthesis in the DEFACTO system

LCPC'01 Proceedings of the 14th international conference on Languages and compilers for parallel computing

Data reorganization engines for the next generation of system-on-a-chip FPGAs

FPGA '02 Proceedings of the 2002 ACM/SIGDA tenth international symposium on Field-programmable gate arrays
Unifying memory and processor wrapper architecture in multiprocessor SoC design

Proceedings of the 15th international symposium on System Synthesis
Performance and Area Modeling of Complete FPGA Designs in the Presence of Loop Transformations

IEEE Transactions on Computers
Pipelined Memory Controllers for DSP Applications Handling Unpredictable Data Accesses

ISVLSI '05 Proceedings of the IEEE Computer Society Annual Symposium on VLSI: New Frontiers in VLSI Design
SOMA: a tool for synthesizing and optimizing memory accesses in ASICs

CODES+ISSS '05 Proceedings of the 3rd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
Transformation synthesis for data intensive applications to FPGAs

GLSVLSI '06 Proceedings of the 16th ACM Great Lakes symposium on VLSI
A reconfigurable HW/SW platform for computation intensive high-resolution real-time digital film applications

Proceedings of the conference on Design, automation and test in Europe: Proceedings
A high-end real-time digital film processing reconfigurable platform

EURASIP Journal on Embedded Systems
Master Interface for On-chip Hardware Accelerator Burst Communications

Journal of VLSI Signal Processing Systems
An FPGA-based computation model for blocked algorithms

AIC'06 Proceedings of the 6th WSEAS International Conference on Applied Informatics and Communications
Computation of the minimum data storage and applications in memory management for multimedia signal processing

Integrated Computer-Aided Engineering
Compiling for reconfigurable computing: A survey

ACM Computing Surveys (CSUR)
Bridging the gap between compilation and synthesis in the DEFACTO system

LCPC'01 Proceedings of the 14th international conference on Languages and compilers for parallel computing
Dynamic memory access management for high-performance DSP applications using high-level synthesis

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
An efficient hardware architecture from c program with memory access to hardware

ICCSA'10 Proceedings of the 2010 international conference on Computational Science and Its Applications - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

Commercially available behavioral synthesis tools do not adequately support FPGA vendor-specific external memory interfaces making it extremely difficult to exploit pipelined memory access modes as well as application specific memory operations scheduling critical for high-performance solutions. This lack of support substantially increases the complexity and the burden on designers in the mapping of applications to FPGA-based computing engines. In this paper we address the problem of external memory interfacing and aggressive scheduling of memory operations by proposing a decoupled architecture with two components - one component captures the specific target architecture timing while the other component uses application specific memory access pattern information. Our results support the claim that it is possible to exploit application specific information and integrate that knowledge into custom schedulers that mix pipelined and non-pipelined access modes aimed at reducing the overhead associated with external memory accesses. The results also reveal that the additional design complexity of the scheduler, and its impact in the overall design is minimal.