The paragon performance monitoring environment
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
A Measurement-Based Model to Predict the Performance Impact of System Modifications: A Case Study
IEEE Transactions on Parallel and Distributed Systems
Software—Practice & Experience
A structured approach to instrumentation system development and evaluation
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Middleware: a model for distributed system services
Communications of the ACM
Modeling, evaluation, and testing of paradyn instrumentation system
Supercomputing '96 Proceedings of the 1996 ACM/IEEE conference on Supercomputing
Simulation Modeling and Analysis
Simulation Modeling and Analysis
The Art of Computer Programming, 2nd Ed. (Addison-Wesley Series in Computer Science and Information
The Art of Computer Programming, 2nd Ed. (Addison-Wesley Series in Computer Science and Information
Parallel performance prediction using lost cycles analysis
Proceedings of the 1994 ACM/IEEE conference on Supercomputing
IPS-2: The Second Generation of a Parallel Program Measurement System
IEEE Transactions on Parallel and Distributed Systems
Performance Measurement Intrusion and Perturbation Analysis
IEEE Transactions on Parallel and Distributed Systems
JEWEL: Design and Implementation of a Distributed Measurement System
IEEE Transactions on Parallel and Distributed Systems
SPI: an instrumentation development environment for parallel/distributed systems
IPPS '95 Proceedings of the 9th International Symposium on Parallel Processing
VIZIR: An Integrated Environment for Distributed Program Visualization
MASCOTS '95 Proceedings of the 3rd International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems
An Adaptive Cost System for Parallel Program Instrumentation
Euro-Par '96 Proceedings of the Second International Euro-Par Conference on Parallel Processing - Volume I
An interactive interface and RT-Mach support for monitoring and controlling resource management
RTAS '95 Proceedings of the Real-Time Technology and Applications Symposium
Modeling, Evaluation, and Adaptive Control of an Instrumentation System
RTAS '97 Proceedings of the 3rd IEEE Real-Time Technology and Applications Symposium (RTAS '97)
Dynamic analysis for locating product features in Ada code
Proceedings of the 2001 annual ACM SIGAda international conference on Ada
MRNet: A Software-Based Multicast/Reduction Network for Scalable Tools
Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Ygdrasil: aggregator network toolkit for large scale systems and the grid
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Towards scalable event tracing for high end systems
HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Hi-index | 0.00 |
This paper demonstrates the use of a model-based evaluation approach for instrumentation systems (ISs). The overall objective of this study is to provide early feedback to tool developers regarding IS overhead and performance; such feedback helps developers make appropriate design decisions about alternative system configurations and task scheduling policies. We consider three types of system architectures: network of workstations (NOW), symmetric multiprocessors (SMP), and massively parallel processing (MPP) systems. We develop a Resource OCCupancy (ROCC) model for an on-line IS for an existing tool and parameterize it for an IBM SP-2 platform. This model is simulated to answer several 'what if' questions regarding two policies to schedule instrumentation data forwarding: collect-and-forward (CF) and batch-and-forward (BF). In addition, this study investigates two alternatives for forwarding the instrumentation data: direct and binary tree forwarding for an MPP system. Simulation results indicate that the BF policy can significantly reduce the overhead and that the tree forwarding configuration exhibits desirable scalability characteristics for MPP systems. Initial measurement-based testing results indicate more than 60 percent reduction in the direct IS overhead when the BF policy was added to Paradyn parallel performance measurement tool.