Energy efficiency in high-performance computing with and without knowledge of applications and services

Authors:
Mohammed Em Diouri;Ghislain L. Tsafack Chetsa;Olivier Glück;Laurent Lefèvre;Jean-Marc Pierson;Patricia Stolf;Georges Da Costa
Affiliations:
INRIA Avalon Team, LIP Laboratory (UMR CNRS, ENS, INRIA, UCB), Ecole Normale Supérieure de Lyon, Université de Lyon, France;INRIA Avalon Team, LIP Laboratory (UMR CNRS, ENS, INRIA, UCB), Ecole Normale Supérieure de Lyon, Université de Lyon, France, IRIT (UMR CNRS), University of Toulouse, Toulouse, France;INRIA Avalon Team, LIP Laboratory (UMR CNRS, ENS, INRIA, UCB), Ecole Normale Supérieure de Lyon, Université de Lyon, France;INRIA Avalon Team, LIP Laboratory (UMR CNRS, ENS, INRIA, UCB), Ecole Normale Supérieure de Lyon, Université de Lyon, France;IRIT (UMR CNRS), University of Toulouse, Toulouse, France;IRIT (UMR CNRS), University of Toulouse, Toulouse, France;IRIT (UMR CNRS), University of Toulouse, Toulouse, France
Venue:
International Journal of High Performance Computing Applications
Year:
2013

Citing 10
Cited 0

The NAS parallel benchmarks—summary and preliminary results

Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Distributed snapshots: determining global states of distributed systems

ACM Transactions on Computer Systems (TOCS)
Live, Runtime Phase Monitoring and Prediction on Real Systems with Application to Dynamic Power Management

Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture
Just-in-time dynamic voltage scaling: Exploiting inter-node slack to save energy in MPI programs

Journal of Parallel and Distributed Computing
Adagio: making DVS practical for complex HPC applications

Proceedings of the 23rd international conference on Supercomputing
Energy and performance characteristics of different parallel implementations of scientific applications on multicore systems

International Journal of High Performance Computing Applications
On the use of cluster-based partial message logging to improve fault tolerance for MPI HPC applications

Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
Uncoordinated Checkpointing Without Domino Effect for Send-Deterministic MPI Applications

IPDPS '11 Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium
Fine-grained dynamic voltage and frequency scaling for precise energy and performance tradeoff based on the ratio of off-chip access to on-chip computation times

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Towards a novel smart and energy-aware service-oriented manager for extreme-scale applications

IGCC '12 Proceedings of the 2012 International Green Computing Conference (IGCC)

Quantified Score

Hi-index	0.00

Visualization

Abstract

The constant demand of raw performance in high-performance computing (HPC) often leads to over-provisioning in high-performance systems which in turn can result in a colossal energy waste due to workload/application variation over time. Proposing energy efficient solutions in the context of large-scale HPC is a real, unavoidable challenge. This article explores two alternative approaches (with or without knowledge of applications and services) dealing with the same goal: reducing the energy usage of large-scale infrastructures which support HPC applications. This article describes the first approach, with knowledge of applications and services, which enables users to choose the less consuming implementation of services. Based on the energy consumption estimation of the different implementations (protocols) for each service, this approach is validated on the case of fault tolerance service in HPC. The 'without knowledge' approach allows some intelligent framework to observe the life of HPC systems and proposes some energy reduction schemes. This framework automatically estimates the energy consumption of the HPC system in order to apply power saving schemes. Both approaches are experimentally evaluated and analysed in terms of energy efficiency.