Synthesis-friendly techniques for tightly-coupled integration of hardware accelerators into shared-memory multi-core clusters

  • Authors:
  • Francesco Conti;Andrea Marongiu;Luca Benini

  • Affiliations:
  • DEI - Università di Bologna;DEI - Università di Bologna;DEI - Università di Bologna

  • Venue:
  • Proceedings of the Ninth IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Several many-core designs tackle scalability issues by leveraging tightly-coupled clusters as building blocks, where low-latency, high-bandwidth interconnection between a small/medium number of cores and L1 memory achieves high performance/watt. Tight coupling of hardware accelerators into these multi-core clusters constitutes a promising approach to further improve performance/area/watt. However, accelerators are often clocked at a lower frequency than processor clusters for energy efficiency reasons. In this paper, we propose a technique to integrate shared-memory accelerators within the tightly-coupled clusters of the STMicroelectronics STHORM architecture. Our methodology significantly relaxes timing constraints for tightly-coupled accelerators, while optimizing data bandwidth. In addition, our technique allows to operate the accelerator at an integer submultiple of the cluster frequency. Experimental results show that the proposed approach allows to recover up to 84% of the slow-down implied by reduced accelerator speed.