Improved algorithms for synchronizing computer network clocks
IEEE/ACM Transactions on Networking (TON)
Experience with an adaptive globally-synchronizing clock algorithm
Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures
Time, clocks, and the ordering of events in a distributed system
Communications of the ACM
A Unified Trace Environment for IBM SP Systems
IEEE Parallel & Distributed Technology: Systems & Technology
Toward Scalable Performance Visualization with Jumpshot
International Journal of High Performance Computing Applications
Dynamic statistical profiling of communication activity in distributed applications
SIGMETRICS '02 Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
On the Scalability of Tracing Mechanisms
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Vertical profiling: understanding the behavior of object-priented applications
OOPSLA '04 Proceedings of the 19th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
OOPSLA '05 Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
The Tau Parallel Performance System
International Journal of High Performance Computing Applications
Using hardware performance monitors to understand the behavior of java applications
VM'04 Proceedings of the 3rd conference on Virtual Machine Research And Technology Symposium - Volume 3
Scalability analysis of SPMD codes using expectations
Proceedings of the 21st annual international conference on Supercomputing
Scalable event-based performance measurement in high-end environments
ACM SIGMETRICS Performance Evaluation Review
An efficient format for nearly constant-time access to arbitrary time intervals in large trace files
Scientific Programming - Large-Scale Programming Tools and Environments
Visualizing the runtime behavior of embedded network systems: A toolkit for TinyOS
Science of Computer Programming
Traces synchronization in distributed networks
Journal of Computer Systems, Networks, and Communications
Log file formats for parallel applications: a review
International Journal of Parallel Programming
A parallel trace-data interface for scalable performance analysis
PARA'06 Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing
Proceedings of the 24th ACM International Conference on Supercomputing
The Cilkview scalability analyzer
Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
Software—Practice & Experience
Visual analysis of I/O system behavior for high-end computing
Proceedings of the third international workshop on Large-scale system and application performance
Scalable parallel trace-based performance analysis
EuroPVM/MPI'06 Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface
A performance measurement infrastructure for co-array fortran
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Detecting application load imbalance on high end massively parallel systems
Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
Traces generation to simulate large-scale distributed applications
Proceedings of the Winter Simulation Conference
Energy saving strategies for parallel applications with point-to-point communication phases
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
In this paper we describe a trace analysis framework, from trace generation to visualization. It includes a unified tracing facility on IBM SP systems, a self-defining interval file format, an API for framework extensions, utilities for merging and statistics generation, and a visualization tool with preview and multiple time-space diagrams. The trace environment is extremely scalable, and combines MPI events with system activities in the same set of trace files, one for each SMP node. Since the amount of trace data may be very large, utilities are developed to convert and merge individual trace files into a self-defining interval trace file with multiple frame directories. The interval format allows the development of multiple time-space diagrams, such as thread-activity view, processor-activity view, etc., from the same interval file. A visualization tool, Jumpshot, is modified to visualize these views. A statistics utility is developed using the API along with its graphics viewer.