Quantifying the relationship between the power delivery network and architectural policies in a 3D-stacked memory device

Authors:
Manjunath Shevgoor;Jung-Sik Kim;Niladrish Chatterjee;Rajeev Balasubramonian;Al Davis;Aniruddha N. Udipi
Affiliations:
University Of Utah;Samsung Electronics;University Of Utah;University Of Utah;University Of Utah;ARM R&D
Venue:
Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture
Year:
2013

Citing 24
Cited 0

Scheduling and page migration for multiprocessor compute servers

ASPLOS VI Proceedings of the sixth international conference on Architectural support for programming languages and operating systems
Reducing cache misses using hardware and software page placement

ICS '99 Proceedings of the 13th international conference on Supercomputing
Power aware page allocation

ASPLOS IX Proceedings of the ninth international conference on Architectural support for programming languages and operating systems
Control Techniques to Eliminate Voltage Emergencies in High Performance Processors

HPCA '03 Proceedings of the 9th International Symposium on High-Performance Computer Architecture
Pipeline damping: a microarchitectural technique to reduce inductive noise in supply voltage

Proceedings of the 30th annual international symposium on Computer architecture
Exploiting Resonant Behavior to Reduce Inductive Noise

Proceedings of the 31st annual international symposium on Computer architecture
Eliminating voltage emergencies via microarchitectural voltage control feedback and dynamic optimization

Proceedings of the 2004 international symposium on Low power electronics and design
Page migration with dynamic space-sharing scheduling policies: the case of the SGI 02000

International Journal of Parallel Programming - Special issue II: The 17th annual international conference on supercomputing (ICS'03)
The importance of adopting a package-aware chip design flow

Proceedings of the 43rd annual Design Automation Conference
Managing Distributed, Shared L2 Caches through OS-Level Page Allocation

Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture
Understanding voltage variations in chip multiprocessors using a distributed power-delivery network

Proceedings of the conference on Design, automation and test in Europe
Towards a software approach to mitigate voltage emergencies

ISLPED '07 Proceedings of the 2007 international symposium on Low power electronics and design
Power grid analysis benchmarks

Proceedings of the 2008 Asia and South Pacific Design Automation Conference
DRAM Circuit Design: Fundamental and High-Speed Topics

DRAM Circuit Design: Fundamental and High-Speed Topics
PDRAM: a hybrid PRAM and DRAM main memory system

Proceedings of the 46th Annual Design Automation Conference
Rethinking DRAM design and organization for energy-constrained multi-cores

Proceedings of the 37th annual international symposium on Computer architecture
Handling the problems and opportunities posed by multiple on-chip memory controllers

Proceedings of the 19th international conference on Parallel architectures and compilation techniques
Fabrication cost analysis and cost-aware design space exploration for 3-D ICs

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Page placement in hybrid memory systems

Proceedings of the international conference on Supercomputing
Preventing PCM banks from seizing too much power

Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture
Design Techniques to Facilitate Processor Power Delivery in 3-D Processor-DRAM Integrated Systems

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
A case for exploiting subarray-level parallelism (SALP) in DRAM

Proceedings of the 39th Annual International Symposium on Computer Architecture
Buffer-on-board memory systems

Proceedings of the 39th Annual International Symposium on Computer Architecture
FPB: Fine-grained Power Budgeting to Improve Write Throughput of Multi-level Cell Phase Change Memory

MICRO-45 Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many of the pins on a modern chip are used for power delivery. If fewer pins were used to supply the same current, the wires and pins used for power delivery would have to carry larger currents over longer distances. This results in an "IR-drop" problem, where some of the voltage is dropped across the long resistive wires making up the power delivery network, and the eventual circuits experience fluctuations in their supplied voltage. The same problem also manifests if the pin count is the same, but the current draw is higher. IR-drop can be especially problematic in 3D DRAM devices because (i) low cost (few pins and TSVs) is a high priority, (ii) 3D-stacking increases current draw within the package without providing proportionate room for more pins, and (iii) TSVs add to the resistance of the power delivery network. This paper is the first to characterize the relationship between the power delivery network and the maximum supported activity in a 3D-stacked DRAM memory device. The design of the power delivery network determines if some banks can handle less activity than others. It also determines the combinations of bank activities that are permissible. Both of these attributes can feed into architectural policies. For example, if some banks can handle more activities than others, the architecture benefits by placing data from high-priority threads or data from frequently accessed pages into those banks. The memory controller can also derive higher performance if it schedules requests to specific combinations of banks that do not violate the IR-drop constraint. We first define an IR-drop-aware scheduler that encodes a number of activity constraints. This scheduler, however, falls short of the performance of an unrealistic ideal PDN that imposes no scheduling constraints by 4.6x. By addressing starvation phenomena in the scheduler, the gap is reduced to only 1.47x. Finally, by adding a dynamic page placement policy, performance is within 1.2x of the unrealistic ideal PDN. We thus show that architectural polices can help mitigate the limitations imposed by a cost constrained design.