Exploring hardware overprovisioning in power-constrained, high performance computing

Authors:
Tapasya Patki;David K. Lowenthal;Barry Rountree;Martin Schulz;Bronis R. de Supinski
Affiliations:
The University of Arizona, Tucson, AZ, USA;The University of Arizona, Tucson, AZ, USA;Lawrence Livermore National Laboratory, Livermore, CA, USA;Lawrence Livermore National Laboratory, Livermore, CA, USA;Lawrence Livermore National Laboratory, Livermore, CA, USA
Venue:
Proceedings of the 27th international ACM conference on International conference on supercomputing
Year:
2013

Citing 24
Cited 0

Voltage scheduling problem for dynamically variable voltage processors

ISLPED '98 Proceedings of the 1998 international symposium on Low power electronics and design
Investigating the effect of voltage-switching on low-energy task scheduling in hard real-time systems

Proceedings of the 2001 Asia and South Pacific Design Automation Conference
Energy-conscious compilation based on voltage scaling

Proceedings of the joint conference on Languages, compilers and tools for embedded systems: software and compilers for embedded systems
Task scheduling and voltage selection for energy minimization

Proceedings of the 39th annual Design Automation Conference
A realistic variable voltage scheduling model for real-time applications

Proceedings of the 2002 IEEE/ACM international conference on Computer-aided design
Scheduling with Dynamic Voltage/Speed Adjustment Using Slack Reclamation in Multiprocessor Real-Time Systems

IEEE Transactions on Parallel and Distributed Systems
Practical On-line DVS Scheduling for Fixed-Priority Real-Time Systems

RTAS '05 Proceedings of the 11th IEEE Real Time on Embedded Technology and Applications Symposium
The Thrifty Barrier: Energy-Aware Synchronization in Shared-Memory Multiprocessors

HPCA '04 Proceedings of the 10th International Symposium on High Performance Computer Architecture
Heterogeneous Chip Multiprocessors

Computer
A Power-Aware Run-Time System for High-Performance Computing

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Just In Time Dynamic Voltage Scaling: Exploiting Inter-Node Slack to Save Energy in MPI Programs

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Performance-constrained Distributed DVS Scheduling for Scientific Applications on Power-aware Clusters

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Minimizing execution time in MPI programs on an energy-constrained, power-scalable cluster

Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Online power-performance adaptation of multithreaded programs using hardware event-based prediction

Proceedings of the 20th annual international conference on Supercomputing
CPU MISER: A Performance-Directed, Run-Time System for Power-Aware Clusters

ICPP '07 Proceedings of the 2007 International Conference on Parallel Processing
The Green500 List: Encouraging Sustainable Supercomputing

Computer
Bounding energy consumption in large-scale MPI programs

Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Prediction-Based Power-Performance Adaptation of Multithreaded Scientific Codes

IEEE Transactions on Parallel and Distributed Systems
Prediction models for multi-dimensional power-performance optimization on many cores

Proceedings of the 17th international conference on Parallel architectures and compilation techniques
Adagio: making DVS practical for complex HPC applications

Proceedings of the 23rd international conference on Supercomputing
Conservation cores: reducing the energy of mature computations

Proceedings of the fifteenth edition of ASPLOS on Architectural support for programming languages and operating systems
Safe overprovisioning: using power limits to increase aggregate throughput

PACS'04 Proceedings of the 4th international conference on Power-Aware Computer Systems
Beyond DVFS: A First Look at Performance under a Hardware-Enforced Power Bound

IPDPSW '12 Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum
Strategies for Energy-Efficient Resource Management of Hybrid Programming Models

IEEE Transactions on Parallel and Distributed Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Most recent research in power-aware supercomputing has focused on making individual nodes more efficient and measuring the results in terms of flops per watt. While this work is vital in order to reach exascale computing at 20 megawatts, there has been a dearth of work that explores efficiency at the whole system level. Traditional approaches in supercomputer design use worst-case power provisioning: the total power allocated to the system is determined by the maximum power draw possible per node. In a world where power is plentiful and nodes are scarce, this solution is optimal. However, as power becomes the limiting factor in supercomputer design, worst-case provisioning becomes a drag on performance. In this paper we demonstrate how a policy of overprovisioning hardware with respect to power combined with intelligent, hardware-enforced power bounds consistently leads to greater performance across a range of standard benchmarks. In particular, leveraging overprovisioning requires that applications use effective configurations; the best configuration depends on application scalability and memory contention. We show that using overprovisioning leads to an average speedup of more than 50% over worst-case provisioning.