Cathedral-III: Architecture-driven high-level synthesis for high throughput DSP applications
DAC '91 Proceedings of the 28th ACM/IEEE Design Automation Conference
Architecture and programming of two generations of video signal processors
Microprocessing and Microprogramming - Special issue: parallel programmable architectures and compilation
Low-power architectural synthesis and the impact of exploiting locality
Journal of VLSI Signal Processing Systems - Special issue on technologies for wireless computing
Synthesis of application specific programmable processors
DAC '97 Proceedings of the 34th annual Design Automation Conference
A data-driven multiprocessor architecture for high throughput digital signal processing
A data-driven multiprocessor architecture for high throughput digital signal processing
Area optimization of multi-functional processing units
ICCAD '92 Proceedings of the 1992 IEEE/ACM international conference on Computer-aided design
Local Search in Combinatorial Optimization
Local Search in Combinatorial Optimization
An Approach for Quantitative Analysis of Application-Specific Dataflow Architectures
ASAP '97 Proceedings of the IEEE International Conference on Application-Specific Systems, Architectures and Processors
Recursive Bipartitioning of Signal Flow Graphs for Programmable Video Signal Processors
EDTC '96 Proceedings of the 1996 European conference on Design and Test
PROPHID: a data-driven multi-processor architecture for high-performance DSP
EDTC '97 Proceedings of the 1997 European conference on Design and Test
PHIDEO: a silicon compiler for high speed algorithms
EURO-DAC '91 Proceedings of the conference on European design automation
Performance optimization using template mapping for datapath-intensive high-level synthesis
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Cheops: a reconfigurable data-flow system for video processing
IEEE Transactions on Circuits and Systems for Video Technology
Efficient Techniques for Clustering and Scheduling onto Embedded Multiprocessors
IEEE Transactions on Parallel and Distributed Systems
Hi-index | 0.00 |
We explore the area efficiency of a class of stream-baseddataflow architectures as a function of the grain-size, for a givenset of applications. We believe the grain-size is a key parameter inbalancing flexibility and efficiency of this class ofarchitectures. We apply a clustering approach on a well-defined setof applications to derive a set of processing elements of varyinggrain-sizes. The resulting architectures are compared with respectto their silicon area. For a set of twenty-one industrially relevantvideo algorithms, we determined architectures with variousgrain-sizes. The results of this exercise indicate an improvementfactor of two for the silicon area, while changing the grain-sizefrom fine-grain to coarser-grain.