Scalable Load and Store Processing in Latency-Tolerant Processors

  • Authors:
  • Amit Gandhi;Haitham Akkary;Ravi Rajwar;Srikanth T. Srinivasan;Konrad Lai

  • Affiliations:
  • Intel Corp.;Intel Corp.;Intel Corp.;Intel Corp.;Intel Corp.

  • Venue:
  • IEEE Micro
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

New load and store processing algorithms let memory-latency-tolerant architectures sustain thousands of in-flight instructions without scaling cycle-critical fully-associative load and store queues. These algorithms rely on redoing some stores after fetching cache miss data from memory (to fix memory dependences). Doing so provides better power and area characteristics than constantly enforcing memory dependences among a several loads and stores, many of which have unknown addresses.