A column pre-ordering strategy for the unsymmetric-pattern multifrontal method
ACM Transactions on Mathematical Software (TOMS)
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
High Resolution Forward And Inverse Earthquake Modeling on Terascale Computers
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
A Portable Programming Interface for Performance Evaluation on Modern Processors
International Journal of High Performance Computing Applications
International Journal of Parallel, Emergent and Distributed Systems
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Computer
High Performance Computing for Computational Science - VECPAR 2008
Performance evaluation of energy efficient ethernet
IEEE Communications Letters
Hardware-oriented numerics and concepts for PDE software
Future Generation Computer Systems
Energy-performance tradeoffs in processor architecture and circuit design: a marginal cost analysis
Proceedings of the 37th annual international symposium on Computer architecture
High-order finite-element seismic wave propagation modeling with MPI on a large GPU cluster
Journal of Computational Physics
Journal of Computational Physics
FEAST—realization of hardware-oriented numerics for HPC simulations with finite elements
Concurrency and Computation: Practice & Experience - International Supercomputing Conference
Making a case for a green500 list
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
The International Exascale Software Project roadmap
International Journal of High Performance Computing Applications
Towards energy efficient parallel computing on consumer electronic devices
ICT-GLOW'11 Proceedings of the First international conference on Information and communication on technology for the fight against global warming
Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems
IPDPSW '11 Proceedings of the 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and PhD Forum
FTI: high performance fault tolerance interface for hybrid systems
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
GPUs and the Future of Parallel Computing
IEEE Micro
The low-power architecture approach towards exascale computing
Proceedings of the second workshop on Scalable algorithms for large-scale systems
Understanding the future of energy-performance trade-off via DVFS in HPC environments
Journal of Parallel and Distributed Computing
High-Performance Energy-Efficient Multicore Embedded Computing
IEEE Transactions on Parallel and Distributed Systems
Energy efficient scheduling of parallel tasks on multiprocessor computers
The Journal of Supercomputing
Energy efficient utilization of resources in cloud computing systems
The Journal of Supercomputing
Proactive thermal management in green datacenters
The Journal of Supercomputing
Energy-efficient high-performance parallel and distributed computing
The Journal of Supercomputing
Full-system analysis and characterization of interactive smartphone applications
IISWC '11 Proceedings of the 2011 IEEE International Symposium on Workload Characterization
Computer Science - Research and Development
Supercomputing with commodity CPUs: are mobile SoCs ready for HPC?
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
IA^3 '13 Proceedings of the 3rd Workshop on Irregular Applications: Architectures and Algorithms
Hi-index | 31.45 |
Power consumption and energy efficiency are becoming critical aspects in the design and operation of large scale HPC facilities, and it is unanimously recognised that future exascale supercomputers will be strongly constrained by their power requirements. At current electricity costs, operating an HPC system over its lifetime can already be on par with the initial deployment cost. These power consumption constraints, and the benefits a more energy-efficient HPC platform may have on other societal areas, have motivated the HPC research community to investigate the use of energy-efficient technologies originally developed for the embedded and especially mobile markets. However, lower power does not always mean lower energy consumption, since execution time often also increases. In order to achieve competitive performance, applications then need to efficiently exploit a larger number of processors. In this article, we discuss how applications can efficiently exploit this new class of low-power architectures to achieve competitive performance. We evaluate if they can benefit from the increased energy efficiency that the architecture is supposed to achieve. The applications that we consider cover three different classes of numerical solution methods for partial differential equations, namely a low-order finite element multigrid solver for huge sparse linear systems of equations, a Lattice-Boltzmann code for fluid simulation, and a high-order spectral element method for acoustic or seismic wave propagation modelling. We evaluate weak and strong scalability on a cluster of 96 ARM Cortex-A9 dual-core processors and demonstrate that the ARM-based cluster can be more efficient in terms of energy to solution when executing the three applications compared to an x86-based reference machine.