Journal of Parallel and Distributed Computing
Principles and Practices of Interconnection Networks
Principles and Practices of Interconnection Networks
Toward Scalable Performance Visualization with Jumpshot
International Journal of High Performance Computing Applications
Optimization of MPI collective communication on BlueGene/L systems
Proceedings of the 19th annual international conference on Supercomputing
Scaling an optimistic parallel simulation of large-scale interconnection networks
WSC '05 Proceedings of the 37th conference on Winter simulation
Evaluation of Interconnection Networks Using Full-System Simulators: Lessons Learned
ANSS '07 Proceedings of the 40th Annual Simulation Symposium
Concepts and components of full-system simulation of distributed memory parallel computers
Proceedings of the 16th international symposium on High performance distributed computing
Realistic Evaluation of Interconnection Network Performance at High Loads
PDCAT '07 Proceedings of the Eighth International Conference on Parallel and Distributed Computing, Applications and Technologies
EUROMICRO-PDP'02 Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing
EURO-PDP'00 Proceedings of the 8th Euromicro conference on Parallel and distributed processing
A comparison of three MPI implementations for red storm
PVM/MPI'05 Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Trace-based parallel performance overhead compensation
HPCC'05 Proceedings of the First international conference on High Performance Computing and Communications
INSEE: an interconnection network simulation and evaluation environment
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Full-system simulation of distributed memory multicomputers
Cluster Computing
Future Generation Computer Systems
Hi-index | 0.00 |
This paper addresses the utilization of traces taken from MPI applications to do simulation-based performance studies of parallel computing systems. Different mechanisms to capture traces are discussed, pointing out important limitations of some of them. One of these limitations is the invisibility of message interchanges in collective operations, which is circumvented modifying a trace-capturing library. During a simulation, trace records must be simulated in causal order, to fully comply with application semantics. Alternatives to follow this order, and the risks of not following it, are presented and discussed. The techniques introduced in this paper have been implemented in an in-house developed simulation environment, which is used in two example studies to show its usefulness: an evaluation of alternatives for interconnection network design, and a performance prediction study in which traces from one machine are used to estimate the execution times of applications running in a different machine.