A technique to determine power-efficient, high-performance superscalar processors

Authors:
T. M. Conte;K. N. P. Menezes;S. W. Sathaye
Affiliations:
-;-;-
Venue:
HICSS '95 Proceedings of the 28th Hawaii International Conference on System Sciences
Year:
1995

Citing 0
Cited 8

An investigation of the performance of various instruction-issue buffer topologies

Proceedings of the 28th annual international symposium on Microarchitecture
Custom-fit processors: letting applications define architectures

Proceedings of the 29th annual ACM/IEEE international symposium on Microarchitecture
Media architecture: general purpose vs. multiple application-specific programmable processor

DAC '98 Proceedings of the 35th annual Design Automation Conference
System-level power optimization: techniques and tools

ISLPED '99 Proceedings of the 1999 international symposium on Low power electronics and design
System-level power optimization: techniques and tools

ACM Transactions on Design Automation of Electronic Systems (TODAES)
Exploring Hypermedia Processor Design Space

Journal of VLSI Signal Processing Systems - Special issue on multimedia signal processing
Power-Performance Modeling and Tradeoff Analysis for a High End Microprocessor

PACS '00 Proceedings of the First International Workshop on Power-Aware Computer Systems-Revised Papers
New methodology for early-stage, microarchitecture-level power-performance analysis of microprocessors

IBM Journal of Research and Development

Quantified Score

Hi-index	0.00

Visualization

Abstract

Processor performance advances are increasingly inhibited by limitations in thermal power dissipation. Part of the problem is the lack of architectural power estimates before implementation. Although high-performance designs exist that dissipate low power, the method for finding these designs has been through trial-and-error. The paper presents systematic techniques to find low-power, high-performance superscalar processors tailored to specific user benchmarks. The model of power is novel because it separates power into architectural and technology components. The architectural component is found via trace-driven simulation, which also produces performance estimates. An example technology model is presented that estimates the technology component, along with critical delay time and real estate usage. This model is based on case studies of actual designs. It is used to solve an important problem: increasing the duplication in superscalar execution units without excessive power consumption. Results are presented from runs using simulated annealing to maximize processor performance subject to power and area constraints. The major contributions of the paper are the separation of architectural and technology components of dynamic power, the use of trace-driven simulation for architectural power measurement, and the use of a near-optimal search to tailor a processor design to a benchmark.