An Application-Based Performance Characterization of the Columbia Supercluster
SC '05 Proceedings of the 2005 ACM/IEEE conference on Supercomputing
Performance modeling and optimization of a high energy colliding beam simulation code
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Reconfigurable hybrid interconnection for static and dynamic scientific applications
Proceedings of the 4th international conference on Computing frontiers
Performance monitor unit design for an AXI-based multi-core SoC platform
Proceedings of the 2007 ACM symposium on Applied computing
Benchmarking the Columbia Supercluster
International Journal of High Performance Computing Applications
Investigation of leading HPC I/O performance using a scientific-application derived benchmark
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
Analysis of photonic networks for a chip multiprocessor using scientific applications
NOCS '09 Proceedings of the 2009 3rd ACM/IEEE International Symposium on Networks-on-Chip
I/O performance challenges at leadership scale
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Workload characterization using the TAU performance system
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Identifying software usage at HPC centers with the automatic library tracking database
Proceedings of the 2010 TeraGrid Conference
Accelerating I/O Forwarding in IBM Blue Gene/P Systems
Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis
How to measure useful, sustained performance
State of the Practice Reports
Network-theoretic classification of parallel computation patterns
International Journal of High Performance Computing Applications
SERA-IO: Integrating Energy Consciousness into Parallel I/O Middleware
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
A dynamic and adaptive load balancing strategy for parallel file system with large-scale I/O servers
Journal of Parallel and Distributed Computing
Visualizing distributed memory computations with hive plots
Proceedings of the Ninth International Symposium on Visualization for Cyber Security
Throttling I/O streams to accelerate file-IO performance
HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Multiclass classification of distributed memory parallel computations
Pattern Recognition Letters
Hi-index | 0.00 |
The Cosmic Microwave Background (CMB) is an exquisitely sensitive probe of the fundamental parameters of cosmology. Extracting this information is computationally intensive, requiring massively parallel computing and sophisticated numerical algorithms. In this work we present MADbench, a lightweight version of the MADCAP CMB power spectrum estimation code that retains the operational complexity and integrated system requirements. In addition, to quantify communication behavior across a variety of architectural platforms, we introduce the Integrated Performance Monitoring (IPM) package: a portable, lightweight, and scalable tool for effectively extracting MPI message-passing overheads. A performance characterization study is conducted on some of the worldýs most powerful supercomputers, including the superscalar Seaborg (IBM Power3+) and CC-NUMA Columbia (SGI Altix), as well as the vector-based Earth Simulator (NEC SX-6 enhanced) and Phoenix (Cray X1) systems. In-depth analysis shows that in order to bridge the gap between theoretical and sustained system performance, it is critical to gain a clear understanding of how the distinct parts of large-scale parallel applications interact with the individual subcomponents of HEC platforms.