From trace generation to visualization: a performance framework for distributed parallel systems
Proceedings of the 2000 ACM/IEEE conference on Supercomputing
Modeling and detecting performance problems for distributed and parallel programs with JavaPSL
Proceedings of the 2001 ACM/IEEE conference on Supercomputing
On the Scalability of Tracing Mechanisms
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
DiP: A Parallel Program Development Environment
Euro-Par '96 Proceedings of the Second International Euro-Par Conference on Parallel Processing-Volume II
An overview of the BlueGene/L Supercomputer
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
Automatic performance analysis of hybrid MPI/OpenMP applications
Journal of Systems Architecture: the EUROMICRO Journal - Special issue: Evolutions in parallel distributed and network-based processing
An Algebra for Cross-Experiment Performance Analysis
ICPP '04 Proceedings of the 2004 International Conference on Parallel Processing
Construction and Compression of Complete Call Graphs for Post-Mortem Program Trace Analysis
ICPP '05 Proceedings of the 2005 International Conference on Parallel Processing
On-line automated performance diagnosis on thousands of processes
Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Scalable event-based performance measurement in high-end environments
ACM SIGMETRICS Performance Evaluation Review
Preserving time in large-scale communication traces
Proceedings of the 22nd annual international conference on Supercomputing
Automatic analysis of speedup of MPI applications
Proceedings of the 22nd annual international conference on Supercomputing
Knowledge support and automation for performance analysis with PerfExplorer 2.0
Scientific Programming - Large-Scale Programming Tools and Environments
Scientific Programming - Large-Scale Programming Tools and Environments
Lessons learned at 208K: towards debugging millions of cores
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Capturing performance knowledge for automated analysis
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Observing Performance Dynamics Using Parallel Profile Snapshots
Euro-Par '08 Proceedings of the 14th international Euro-Par conference on Parallel Processing
A Generic and Configurable Source-Code Instrumentation Component
ICCS 2009 Proceedings of the 9th International Conference on Computational Science
Performance Profiling for OpenMP Tasks
IWOMP '09 Proceedings of the 5th International Workshop on OpenMP: Evolving OpenMP in an Age of Extreme Parallelism
Recording the control flow of parallel applications to determine iterative and phase-based behavior
Future Generation Computer Systems
Scalable Detection of MPI-2 Remote Memory Access Inefficiency Patterns
Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
MPTD: A Scalable and Flexible Performance Prediction Framework for Parallel Systems
APPT '09 Proceedings of the 8th International Symposium on Advanced Parallel Processing Technologies
An automated component-based performance experiment environment
Proceedings of the 2009 Workshop on Component-Based High Performance Computing
A parallel trace-data interface for scalable performance analysis
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Automatic performance analysis of large scale simulations
Euro-Par'09 Proceedings of the 2009 international conference on Parallel processing
Performance simulation of non-blocking communication in message-passing applications
Euro-Par'09 Proceedings of the 2009 international conference on Parallel processing
Scalable event trace visualization
Euro-Par'09 Proceedings of the 2009 international conference on Parallel processing
Trace profiling: Scalable event tracing on high-end parallel systems
Parallel Computing
PVM/MPI'07 Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Performance analysis and tuning of the XNS CFD solver on Blue Gene/L
PVM/MPI'07 Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Timestamp synchronization for event traces of large-scale message-passing applications
PVM/MPI'07 Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Traces generation to simulate large-scale distributed applications
Proceedings of the Winter Simulation Conference
Hi-index | 0.00 |
Automatic trace analysis is an effective method for identifying complex performance phenomena in parallel applications. However, as the size of parallel systems and the number of processors used by individual applications is continuously raised, the traditional approach of analyzing a single global trace file, as done by kojak's expert trace analyzer, becomes increasingly constrained by the large number of events. In this article, we present a scalable version of the expert analysis based on analyzing separate local trace files with a parallel tool which ‘replays' the target application's communication behavior. We describe the new parallel analyzer architecture and discuss first empirical results.