Exploiting the Cell/BE Architecture with the StarPU Unified Runtime System

  • Authors:
  • Cédric Augonnet;Samuel Thibault;Raymond Namyst;Maik Nijhuis

  • Affiliations:
  • INRIA Bordeaux Sud-Ouest --- LaBRI, University of Bordeaux,;INRIA Bordeaux Sud-Ouest --- LaBRI, University of Bordeaux,;INRIA Bordeaux Sud-Ouest --- LaBRI, University of Bordeaux,;Vrije Universiteit Amsterdam,

  • Venue:
  • SAMOS '09 Proceedings of the 9th International Workshop on Embedded Computer Systems: Architectures, Modeling, and Simulation
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Core specialization is currently one of the most promising ways for designing power-efficient multicore chips. However, approaching the theoretical peak performance of such heterogeneous multicore architectures with specialized accelerators, is a complex issue. While substantial effort has been devoted to efficiently offloading parts of the computation, designing an execution model that unifies all computing units is the main challenge. We therefore designed the StarPU runtime system for providing portable support for heterogeneous multicore processors to high performance applications and compiler environments. StarPU provides a high-level, unified execution model which is tightly coupled to an expressive data management library. In addition to our previous results on using multicore processors alongside with graphic processors, we show that StarPU is flexible enough to efficiently exploit the heterogeneous resources in the Cell processor. We present a scalable design supporting multiple different accelerators while minimizing the overhead on the overall system. Using experiments with classical linear algebra algorithms, we show that StarPU improves programmability and provides performance portability.