Reducing the branch power cost in embedded processors through static scheduling, profiling and superblock formation

  • Authors:
  • Michael Hicks;Colin Egan;Bruce Christianson;Patrick Quick

  • Affiliations:
  • Compiler Technology and Computer Architecture Group (CTCA), University of Hertfordshire, Hatfield, UK;Compiler Technology and Computer Architecture Group (CTCA), University of Hertfordshire, Hatfield, UK;Compiler Technology and Computer Architecture Group (CTCA), University of Hertfordshire, Hatfield, UK;Compiler Technology and Computer Architecture Group (CTCA), University of Hertfordshire, Hatfield, UK

  • Venue:
  • ACSAC'06 Proceedings of the 11th Asia-Pacific conference on Advances in Computer Systems Architecture
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Dynamic branch predictor logic alone accounts for approximately 10% of total processor power dissipation. Recent research indicates that the power cost of a large dynamic branch predictor is offset by the power savings created by its increased accuracy. We describe a method of reducing dynamic predictor power dissipation without degrading prediction accuracy by using a combination of local delay region scheduling and run time profiling of branches. Feedback into the static code is achieved with hint bits and avoids the need for dynamic prediction for some individual branches. This method requires only minimal hardware modifications and coexists with a dynamic predictor.