An Exploration of Instruction Fetch Requirement in Out-of-Order Superscalar Processors

  • Authors:
  • Pierre Michaud;André/ Seznec;Sté/phan Jourdan

  • Affiliations:
  • IRISA/INRIA, Campus de Beaulieu, 35042 Rennes, France/ pmichaud@irisa.fr;IRISA/INRIA, Campus de Beaulieu, 35042 Rennes, France;Intel Corporation, MS: JF4-354, 2111 NE 25th Ave., Hillsboro, Oregon 97124

  • Venue:
  • International Journal of Parallel Programming
  • Year:
  • 2001

Quantified Score

Hi-index 0.01

Visualization

Abstract

The performance of superscalar processors depends on many parameters with correlated effects. This paper explores the relations between some of these parameters, and more particularly, the requirement in instruction fetch bandwidth. We introduce new enhancements to increase the bandwidth of conventional instruction fetch engines. However, experiments show that the performance does not increase proportionally to the fetch. Once the measured IPC is half the instruction fetch bandwidth, increasing the fetch bandwidth brings very little improvement. In order to better understand this behavior, we develop a model from the empirical observation that the available instruction parallelism grows as the square root of the instruction window size. From the model, we derive that the fetch bandwidth requirement grows as the square root of the distance between mispredicted branches. We also verify experimentally that, to double the IPC, one should both double the fetch bandwidth and decrease the number of mispredicted branches fourfold.