The NAS parallel benchmarks—summary and preliminary results
Proceedings of the 1991 ACM/IEEE conference on Supercomputing
Distributed snapshots: determining global states of distributed systems
ACM Transactions on Computer Systems (TOCS)
Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture
Just-in-time dynamic voltage scaling: Exploiting inter-node slack to save energy in MPI programs
Journal of Parallel and Distributed Computing
Adagio: making DVS practical for complex HPC applications
Proceedings of the 23rd international conference on Supercomputing
International Journal of High Performance Computing Applications
Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
Uncoordinated Checkpointing Without Domino Effect for Send-Deterministic MPI Applications
IPDPS '11 Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
Towards a novel smart and energy-aware service-oriented manager for extreme-scale applications
IGCC '12 Proceedings of the 2012 International Green Computing Conference (IGCC)
Hi-index | 0.00 |
The constant demand of raw performance in high-performance computing (HPC) often leads to over-provisioning in high-performance systems which in turn can result in a colossal energy waste due to workload/application variation over time. Proposing energy efficient solutions in the context of large-scale HPC is a real, unavoidable challenge. This article explores two alternative approaches (with or without knowledge of applications and services) dealing with the same goal: reducing the energy usage of large-scale infrastructures which support HPC applications. This article describes the first approach, with knowledge of applications and services, which enables users to choose the less consuming implementation of services. Based on the energy consumption estimation of the different implementations (protocols) for each service, this approach is validated on the case of fault tolerance service in HPC. The 'without knowledge' approach allows some intelligent framework to observe the life of HPC systems and proposes some energy reduction schemes. This framework automatically estimates the energy consumption of the HPC system in order to apply power saving schemes. Both approaches are experimentally evaluated and analysed in terms of energy efficiency.