Dynamic Code Partitioning for Clustered Architectures

  • Authors:
  • Ramon Canal;Joan-Manuel Parcerisa;Antonio González

  • Affiliations:
  • Department d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, Cr. Jordi Girona, 1–3 Mòdul D6, 08034 Barcelona, Spain. {rcanal, jmanel,antonio}@ac.upc.es;Department d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, Cr. Jordi Girona, 1–3 Mòdul D6, 08034 Barcelona, Spain. {rcanal, jmanel,antonio}@ac.upc.es;Department d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, Cr. Jordi Girona, 1–3 Mòdul D6, 08034 Barcelona, Spain. {rcanal, jmanel,antonio}@ac.upc.es

  • Venue:
  • International Journal of Parallel Programming
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recent works^(1) show that delays introduced in the issue and bypass logic will become critical for wide issue superscalar processors. One of the proposed solutions is clustering the processor core. Clustered architectures benefit from a less complex partitioned processor core and thus, incur in less critical delays. In this paper, we propose a dynamic instruction steering logic for these clustered architectures that decides at decode time the cluster where each instruction is executed. The performance of clustered architectures depends on the inter-cluster communication overhead and the workload balance. We present a scheme that uses runtime information to optimize the trade-off between these figures. The evaluation shows that this scheme can achieve an average speed-up of 35% over a conventional 8-way issue (4 int+4 fp) machine and that it outperforms other previous proposals, either static or dynamic.