Data pipeline optimization for shared memory multiple-SIMD architecture

  • Authors:
  • Weihua Zhang;Tao Bao;Binyu Zang;Chuanqi Zhu

  • Affiliations:
  • Parallel Processing Institute, Fudan University, Shanghai, China and Key Laboratory of Computer System and Architecture, Institute of Computing Technology, Chinese Academy of Sciences;Parallel Processing Institute, Fudan University, Shanghai, China;Parallel Processing Institute, Fudan University, Shanghai, China;Parallel Processing Institute, Fudan University, Shanghai, China

  • Venue:
  • LCPC'06 Proceedings of the 19th international conference on Languages and compilers for parallel computing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

The rapid growth of multimedia applications has been putting high pressure on the processing capability of modern processors, which leads to more and more modern multimedia processors employing parallel single instruction multiple data (SIMD) units to achieve high performance. In embedded system on chips (SOCs), shared memory multiple-SIMD architecture becomes popular because of its less power consumption and smaller chip size. In order to match the properties of some multimedia applications, there are interconnections among multiple SIMD units. In this paper, we present a novel program transformation technique to exploit parallel and pipelined computing power of modern shared-memory multiple-SIMD architecture. This optimizing technique can greatly reduce the conflict of shared data bus and improve the performance of applications with inherent data pipeline characteristics. Experimental results show that our method provides impressive speedup. For a shared memory multiple-SIMD architecture with 8 SIMD units, this method obtains more than 3.6X speedup for the multimedia programs.