Array control for high-performance SIMD systems

  • Authors:
  • Martin C. Herbordt;Jade Cravy;Honghai Zhang

  • Affiliations:
  • Department of Electrical and Computer Engineering, 336 Photonics Center, 8 Saint Mary's Street, Boston University, Boston, MA;GDA Technologies, 2071 Junction Ave., San Jose, CA;Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA

  • Venue:
  • Journal of Parallel and Distributed Computing
  • Year:
  • 2004

Quantified Score

Hi-index 0.03

Visualization

Abstract

Although arrays of SIMD processing elements can be built with very high operating frequencies, problems exist in keeping the array busy. The inherent mismatch between host and array makes it difficult to maintain high array utilization: either the rate of instruction issue is very low or PE data locality is compromised, having the same effect. Our solution is based on an array control unit (ACU) design that expands macroinstructions in two stages, first by data tile and then into microinstructions. The expansion itself solves the issue problem; decoupling the expansion modalities maintains data locality. Several issues involving host/ACU interaction need to be resolved to effect this solution. We present experimental results showing that our approach delivers substantial improvement in memory hierarchy performance: a cache of only one-fourth the size is sufficient to achieve the same performance as previous approaches. We also describe our implementations which demonstrate that achieving gigaherz operating frequencies with current technologies is plausible.