Direct Instruction Wakeup for Out-of-Order Processors

  • Authors:
  • Marco A. Ramí/rez;Adrian Cristal;Alexander V. Veidenbaum;Luis Villa;Mateo Valero

  • Affiliations:
  • U.P.C., Barcelona Spain/ National Polytechnic Institute, Mé/xico;U.P.C., Barcelona Spain;University of California Irvine, USA;Mexican Petroleum Institute, Mexico;U.P.C., Barcelona Spain

  • Venue:
  • IWIA '04 Proceedings of the Innovative Architecture for Future Generation High-Performance Processors and Systems
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Instruction queues consume a significant amount of power in high-performance processors, primarily due to instruction wakeup logic access to the queue structures. The wakeup logic delay is also a critical timing parameter. This paper proposes a new queue organization using a small number of successor pointers plus a small number of dynamically allocated full successor bit vectors for cases with a larger number of successors. The details of the new organization are described and it is shown to achieve the performance of CAM-based or full dependency matrix organizations using just one pointer per instruction plus eight full bit vectors. Only two full bit vectors are needed when two successor pointers are stored per instruction. Finally, a design and pre-layout of all critical structures in 70nm technology was performed for the proposed organization as well as for a CAM-based baseline. The new design is shown to use 1/2 to 1/5th of the baseline instruction queue power, depending on queue size. It is also shown to use significantly less power than the full dependency matrix based design.