PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing
PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing
Communications of the ACM
Review of Performance Analysis Tools for MPI Parallel Programs
Proceedings of the 8th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
The unicore grid and its options for performance analysis
Performance analysis and grid computing
Productivity in High Performance Computing
International Journal of High Performance Computing Applications
Compressible memory data structures for event-based trace analysis
Future Generation Computer Systems
Monitoring cache behavior on parallel SMP architectures and related programming tools
Future Generation Computer Systems
Preserving time in large-scale communication traces
Proceedings of the 22nd annual international conference on Supercomputing
Scalable load-balance measurement for SPMD codes
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
An Interactive Graphical Environment for Code Optimization
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part II
Euro-Par 2008 Workshops - Parallel Processing
ScalaTrace: Scalable compression and replay of communication traces for high-performance computing
Journal of Parallel and Distributed Computing
Tools for scalable parallel program analysis: Vampir NG, MARMOT, and DeWiz
International Journal of Computational Science and Engineering
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
Comprehensive cache performance tuning with a toolset
Future Generation Computer Systems
Monitoring cache behavior on parallel SMP architectures and related programming tools
Future Generation Computer Systems
Compressible memory data structures for event-based trace analysis
Future Generation Computer Systems
Performance analysis of a parallel application in the GRID
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartII
A new data compression technique for event based program traces
ICCS'03 Proceedings of the 2003 international conference on Computational science: PartIII
Software development in the grid: the DAMIEN tool-set
ICCS'03 Proceedings of the 1st international conference on Computational science: PartI
High Resolution Program Flow Visualization of Hardware Accelerated Hybrid Multi-core Applications
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Automatic generation of executable communication specifications from parallel applications
Proceedings of the international conference on Supercomputing
A scalable eigensolver for large scale-free graphs using 2D graph partitioning
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Implementation and usage of the PERUSE-Interface in open MPI
EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface
New algorithms for performance trace analysis based on compressed complete call graphs
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part II
PARADIS: analysis of transaction-based applications in distributed environments
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part II
ScalaTrace: tracing, analysis and modeling of HPC codes at scale
PARA'10 Proceedings of the 10th international conference on Applied Parallel and Scientific Computing - Volume 2
Development process for clusters on a reconfigurable chip
Computers and Electrical Engineering
Auto-generation of communication benchmark traces
ACM SIGMETRICS Performance Evaluation Review
Alignment-Based metrics for trace comparison
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Hi-index | 0.00 |
Performance optimization remains one of the key issues in parallel computing. Many parallel applications do not benefit from the continually increasing peak performance of todays massively parallel computers, mainly because they have not been designed to operate efficiently on the 1000s of processors of todays top of the range systems. Conventional performance analysis is typically restricted to accumulated data on such large systems, severely limiting its use when dealing with real-world performance bottlenecks. Event based performance analysis can give the detailed insight required, but has to deal with extreme amounts of data, severely limiting its scalability. In this paper, we present an approach for scalable event-driven performance analysis that combines proven tool technology with novel concepts for hierarchical data layout and visualization. This evolutionary approach is being validated by implementing extensions to the performance analysis tool Vampir.