A Compact DSP Core with Static Floating-Point Arithmetic

  • Authors:
  • Tay-Jyi Lin;Hung-Yueh Lin;Chie-Min Chao;Chih-Wei Liu;Chih-Wei Jen

  • Affiliations:
  • Department of Electronics Engineering, National Chiao Tung University, Taiwan;Department of Electronics Engineering, National Chiao Tung University, Taiwan;Department of Electronics Engineering, National Chiao Tung University, Taiwan;Department of Electronics Engineering, National Chiao Tung University, Taiwan;SoC Technology Center, Industrial Technology Research Institute, Taiwan

  • Venue:
  • Journal of VLSI Signal Processing Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

A multimedia system-on-a-chip (SoC) usually contains one or more programmable digital signal processors (DSP) to accelerate data-intensive computations. But most of these DSP cores are designed originally for standalone applications, and they must have some overlapped (and redundant) components with the host microprocessor. This paper presents a compact DSP for multi-core systems, which is fully programmable and has been optimized to execute a set of signal processing kernels very efficiently. The DSP core was designed concurrently with its automatic software generator based on high-level synthesis. Moreover, it performs lightweight arithmetic--the static floating-point (SFP), which approximates the quality of floating-point (FP) operations with the hardware similar to that of the integer arithmetic. In our simulations, the compact DSP and its auto-generated software can achieve 3X performance (estimated in cycles) of those DSP cores in the dual-core baseband processors with similar computing resources. Besides, the 16-bit SFP has above 40 dB signal to round-off noise ratio over the IEEE single-precision FP, and it even outperforms the hand-optimized programs based on the 32-bit integer arithmetic. The 24-bit SFP has above 64 dB quality, of which the maximum precision is identical to that of the single-precision FP. Finally, the DSP core has been implemented and fabricated in the UMC 0.18µm 1P6M CMOS technology. It can operate at 314.5 MHz while consuming 52mW average power. The core size is only 1.5 mm脳1.5 mm including the 16 KB on-chip memory and the AMBA AHB interface.