Wattch: a framework for architectural-level power analysis and optimizations
Proceedings of the 27th annual international symposium on Computer architecture
Process cruise control: event-driven clock scaling for dynamic power management
CASES '02 Proceedings of the 2002 international conference on Compilers, architecture, and synthesis for embedded systems
The case for power management in web servers
Power aware computing
PowerScope: A Tool for Profiling the Energy Usage of Mobile Applications
WMCSA '99 Proceedings of the Second IEEE Workshop on Mobile Computer Systems and Applications
IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
Computer Architecture: A Quantitative Approach
Computer Architecture: A Quantitative Approach
Runtime Power Monitoring in High-End Processors: Methodology and Empirical Data
Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture
A Power-Aware Run-Time System for High-Performance Computing
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
MegaProto: 1 TFlops/10kW Rack Is Feasible Even with Only Commodity Technology
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Power phase variation in a commercial server workload
Proceedings of the 2006 international symposium on Low power electronics and design
Memory-miser: a performance-constrained runtime system for power-scalable clusters
Proceedings of the 4th international conference on Computing frontiers
Analyzing the Energy-Time Trade-Off in High-Performance Computing Applications
IEEE Transactions on Parallel and Distributed Systems
Effective runtime scalability metric to measure productivity in high performance computing systems
Proceedings of the 5th conference on Computing frontiers
Analysis of dynamic power management on multi-core processors
Proceedings of the 22nd annual international conference on Supercomputing
Just-in-time dynamic voltage scaling: Exploiting inter-node slack to save energy in MPI programs
Journal of Parallel and Distributed Computing
Peak power control algorithm for multi-processor SoC
Proceedings of the 3rd International Conference on Ubiquitous Information Management and Communication
Energy Profiling and Analysis of the HPC Challenge Benchmarks
International Journal of High Performance Computing Applications
Multi-facet approach to reduce energy consumption in clouds and grids: the GREEN-NET framework
Proceedings of the 1st International Conference on Energy-Efficient Computing and Networking
Runtime Energy Adaptation with Low-Impact Instrumented Code in a Power-Scalable Cluster System
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
ACM Transactions on Reconfigurable Technology and Systems (TRETS)
Conjugate gradient sparse solvers: performance-power characteristics
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Making a case for a green500 list
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Evaluating Parallel I/O Energy Efficiency
GREENCOM-CPSCOM '10 Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing
Designing Energy Efficient Communication Runtime Systems for Data Centric Programming Models
GREENCOM-CPSCOM '10 Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing
Understanding Power Measurement Implications in the Green500 List
GREENCOM-CPSCOM '10 Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing
Environment-conscious scheduling of HPC applications on distributed Cloud-oriented data centers
Journal of Parallel and Distributed Computing
An effective speedup metric for measuring productivity in large-scale parallel computer systems
The Journal of Supercomputing
The Survey and Future Evolution of Green Computing
GREENCOM '11 Proceedings of the 2011 IEEE/ACM International Conference on Green Computing and Communications
A generic software partitioning algorithm for pervasive computing
WASA'06 Proceedings of the First international conference on Wireless Algorithms, Systems, and Applications
Characterizing the performance and energy attributes of scientific simulations
ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part I
Compiler-Directed energy-time tradeoff in MPI programs on DVS-Enabled parallel systems
ISPA'06 Proceedings of the 4th international conference on Parallel and Distributed Processing and Applications
From the origins of performance evaluation to new green ICT performance engineering
PERFORM'10 Proceedings of the 2010 IFIP WG 6.3/7.3 international conference on Performance Evaluation of Computer and Communication Systems: milestones and future challenges
Energy based performance tuning for large scale high performance computing systems
Proceedings of the 2012 Symposium on High Performance Computing
Flexible workload generation for HPC cluster efficiency benchmarking
Computer Science - Research and Development
Determine energy-saving potential in wait-states of large-scale parallel programs
Computer Science - Research and Development
Designing energy efficient communication runtime systems: a view from PGAS models
The Journal of Supercomputing
GBench: benchmarking methodology for evaluating the energy efficiency of supercomputers
Computer Science - Research and Development
Application-level voltage and frequency tuning of multi-phase program on the SCC
Proceedings of the 3rd International Workshop on Adaptive Self-Tuning Computing Systems
Hi-index | 0.00 |
Power consumption is a troublesome design constraint for emergent systems such as IBM's BlueGene /L. If current trends continue, future petaflop systems will require 100 megawatts of power to maintain high-performance. To address this problem the power and energy characteristics of highperformance systems must be characterized. To date, power-performance profiles for distributed systems have been limited to interactive commercial workloads. However, scientific workloads are typically non-interactive (batched) processes riddled with interprocess dependences and communication. We present a framework for direct, automatic profiling of power consumption for non-interactive, parallel scientific applications on high-performance distributed systems. Though our approach is general, we use our framework to study the power-performance efficiency of the NAS parallel benchmarks on a 32-node Beowulf cluster. We provide profiles by component (CPU, memory, disk, and NIC), by node (for each of 32 nodes), and by system scale (2, 4, 8, 16, and 32 nodes). Our results indicate power profiles are often regular corresponding to application characteristics and for fixed problem size increasing the number of nodes always increases energy consumption but does not always improve performance. This finding suggests smart schedulers could be used to optimize for energy while maintaining performance.