Power and Energy Profiling of Scientific Applications on Distributed Systems

Authors:
Xizhou Feng;Rong Ge;Kirk W. Cameron
Affiliations:
University of South Carolina, Columbia, SC;University of South Carolina, Columbia, SC;University of South Carolina, Columbia, SC
Venue:
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Year:
2005

Citing 7
Cited 33

Wattch: a framework for architectural-level power analysis and optimizations

Proceedings of the 27th annual international symposium on Computer architecture
Process cruise control: event-driven clock scaling for dynamic power management

CASES '02 Proceedings of the 2002 international conference on Compilers, architecture, and synthesis for embedded systems
The case for power management in web servers

Power aware computing
PowerScope: A Tool for Profiling the Energy Usage of Mobile Applications

WMCSA '99 Proceedings of the Second IEEE Workshop on Mobile Computer Systems and Applications
A Comparison between the Earth Simulator and AlphaServer Systems Using Predictive Application Performance Models

IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
Computer Architecture: A Quantitative Approach

Computer Architecture: A Quantitative Approach
Runtime Power Monitoring in High-End Processors: Methodology and Empirical Data

Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture

High-Performance, Power-Aware Distributed Computing for Scientific Applications

Computer
A Power-Aware Run-Time System for High-Performance Computing

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
MegaProto: 1 TFlops/10kW Rack Is Feasible Even with Only Commodity Technology

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Performance-constrained Distributed DVS Scheduling for Scientific Applications on Power-aware Clusters

SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Power phase variation in a commercial server workload

Proceedings of the 2006 international symposium on Low power electronics and design
Memory-miser: a performance-constrained runtime system for power-scalable clusters

Proceedings of the 4th international conference on Computing frontiers
Analyzing the Energy-Time Trade-Off in High-Performance Computing Applications

IEEE Transactions on Parallel and Distributed Systems
Effective runtime scalability metric to measure productivity in high performance computing systems

Proceedings of the 5th conference on Computing frontiers
Analysis of dynamic power management on multi-core processors

Proceedings of the 22nd annual international conference on Supercomputing
Just-in-time dynamic voltage scaling: Exploiting inter-node slack to save energy in MPI programs

Journal of Parallel and Distributed Computing
Peak power control algorithm for multi-processor SoC

Proceedings of the 3rd International Conference on Ubiquitous Information Management and Communication
Energy Profiling and Analysis of the HPC Challenge Benchmarks

International Journal of High Performance Computing Applications
Multi-facet approach to reduce energy consumption in clouds and grids: the GREEN-NET framework

Proceedings of the 1st International Conference on Energy-Efficient Computing and Networking
Runtime Energy Adaptation with Low-Impact Instrumented Code in a Power-Scalable Cluster System

CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
FPGA-Array with Bandwidth-Reduction Mechanism for Scalable and Power-Efficient Numerical Simulations Based on Finite Difference Methods

ACM Transactions on Reconfigurable Technology and Systems (TRETS)
Conjugate gradient sparse solvers: performance-power characteristics

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Making a case for a green500 list

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
Evaluating Parallel I/O Energy Efficiency

GREENCOM-CPSCOM '10 Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing
Designing Energy Efficient Communication Runtime Systems for Data Centric Programming Models

GREENCOM-CPSCOM '10 Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing
Understanding Power Measurement Implications in the Green500 List

GREENCOM-CPSCOM '10 Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications & Int'l Conference on Cyber, Physical and Social Computing
Environment-conscious scheduling of HPC applications on distributed Cloud-oriented data centers

Journal of Parallel and Distributed Computing
An effective speedup metric for measuring productivity in large-scale parallel computer systems

The Journal of Supercomputing
The Survey and Future Evolution of Green Computing

GREENCOM '11 Proceedings of the 2011 IEEE/ACM International Conference on Green Computing and Communications
A generic software partitioning algorithm for pervasive computing

WASA'06 Proceedings of the First international conference on Wireless Algorithms, Systems, and Applications
Characterizing the performance and energy attributes of scientific simulations

ICCS'06 Proceedings of the 6th international conference on Computational Science - Volume Part I
Compiler-Directed energy-time tradeoff in MPI programs on DVS-Enabled parallel systems

ISPA'06 Proceedings of the 4th international conference on Parallel and Distributed Processing and Applications
From the origins of performance evaluation to new green ICT performance engineering

PERFORM'10 Proceedings of the 2010 IFIP WG 6.3/7.3 international conference on Performance Evaluation of Computer and Communication Systems: milestones and future challenges
Energy based performance tuning for large scale high performance computing systems

Proceedings of the 2012 Symposium on High Performance Computing
Flexible workload generation for HPC cluster efficiency benchmarking

Computer Science - Research and Development
Determine energy-saving potential in wait-states of large-scale parallel programs

Computer Science - Research and Development
Designing energy efficient communication runtime systems: a view from PGAS models

The Journal of Supercomputing
GBench: benchmarking methodology for evaluating the energy efficiency of supercomputers

Computer Science - Research and Development
Application-level voltage and frequency tuning of multi-phase program on the SCC

Proceedings of the 3rd International Workshop on Adaptive Self-Tuning Computing Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Power consumption is a troublesome design constraint for emergent systems such as IBM's BlueGene /L. If current trends continue, future petaflop systems will require 100 megawatts of power to maintain high-performance. To address this problem the power and energy characteristics of highperformance systems must be characterized. To date, power-performance profiles for distributed systems have been limited to interactive commercial workloads. However, scientific workloads are typically non-interactive (batched) processes riddled with interprocess dependences and communication. We present a framework for direct, automatic profiling of power consumption for non-interactive, parallel scientific applications on high-performance distributed systems. Though our approach is general, we use our framework to study the power-performance efficiency of the NAS parallel benchmarks on a 32-node Beowulf cluster. We provide profiles by component (CPU, memory, disk, and NIC), by node (for each of 32 nodes), and by system scale (2, 4, 8, 16, and 32 nodes). Our results indicate power profiles are often regular corresponding to application characteristics and for fixed problem size increasing the number of nodes always increases energy consumption but does not always improve performance. This finding suggests smart schedulers could be used to optimize for energy while maintaining performance.