Synctium: a Near-Threshold Stream Processor for Energy-Constrained Parallel Applications

Authors:
Evgeni Krimer;Robert Pawlowski;Mattan Erez;Patrick Chiang
Affiliations:
University of Texas at Austin, Austin;Oregon State University, Corvallis;University of Texas at Austin, Austin;Oregon State University, Corvallis
Venue:
IEEE Computer Architecture Letters
Year:
2010

Citing 0
Cited 5

Power/performance exploration of single-core and multi-core processor approaches for biomedical signal processing

PATMOS'11 Proceedings of the 21st international conference on Integrated circuit and system design: power and timing modeling, optimization, and simulation
Process variation in near-threshold wide SIMD architectures

Proceedings of the 49th Annual Design Automation Conference
Is dark silicon useful?: harnessing the four horsemen of the coming dark silicon apocalypse

Proceedings of the 49th Annual Design Automation Conference
Lane decoupling for improving the timing-error resiliency of wide-SIMD architectures

Proceedings of the 39th Annual International Symposium on Computer Architecture
A shared-FPU architecture for ultra-low power MPSoCs

Proceedings of the ACM International Conference on Computing Frontiers

Quantified Score

Hi-index	0.00

Visualization

Abstract

While Moore’s law scaling continues to double transistor density every technology generation, supply voltage reduction has essentially stopped, increasing both power density and total energy consumed in conventional microprocessors. Therefore, future processors will require an architecture that can: a) take advantage of the massive amount of transistors that will be available; and b) operate these transistors in the near-threshold supply domain, thereby achieving near optimal energy/computation by balancing the leakage and dynamic energy consumption. Unfortunately, this optimality is typically achieved while running at very low frequencies (i.e. 0:1 - 10MHz) and with only one computation executing per cycle, such that performance is limited. Further, near-threshold designs suffer from severe process variability that can introduce extremely large delay variations. In this paper, we propose a near energy-optimal, stream processor family that relies on massively parallel, near-threshold VLSI circuits and interconnect, incorporating cooperative circuit/architecture techniques to tolerate the expected large delay variations. Initial estimations from circuit simulations show that it is possible to achieve greater than 1 Giga-Operations per second (1GOP/s) with less than 1mW total power consumption, enabling a new class of energy-constrained, high-throughput computing applications.