Dynamic Configuration for Distributed Systems
IEEE Transactions on Software Engineering
PECAN: Program Development Systems that Support Multiple Views
IEEE Transactions on Software Engineering
MULTILISP: a language for concurrent symbolic computation
ACM Transactions on Programming Languages and Systems (TOPLAS)
Graphical views of parallel programs
ACM SIGSOFT Software Engineering Notes
Parallel processing: the Cm* experience
Parallel processing: the Cm* experience
Monitoring distributed systems
ACM Transactions on Computer Systems (TOCS)
The temporal query language TQuel
ACM Transactions on Database Systems (TODS)
High-performance operating system primitives for robotics and real-time control systems
ACM Transactions on Computer Systems (TOCS)
Debugging Parallel Programs with Instant Replay
IEEE Transactions on Computers
A system for parallel programming
ICSE '87 Proceedings of the 9th international conference on Software Engineering
A relational approach to monitoring complex systems
ACM Transactions on Computer Systems (TOCS)
Solving problems on concurrent processors. Vol. 1: General techniques and regular problems
Solving problems on concurrent processors. Vol. 1: General techniques and regular problems
A Language and System for the Construction and Tuning of Parallel Programs
IEEE Transactions on Software Engineering
An open enviornment for building parallel programming systems
PPEALS '88 Proceedings of the ACM/SIGPLAN conference on Parallel programming: experience with applications, languages and systems
Foundations for the Arcadia environment architecture
SDE 3 Proceedings of the third ACM SIGSOFT/SIGPLAN software engineering symposium on Practical software development environments
Debugging heterogeneous distributed systems using event-based models of behavior
PADD '88 Proceedings of the 1988 ACM SIGPLAN and SIGOPS workshop on Parallel and distributed debugging
CHAOS: why one cannot have only an operating system for real-time applications
ACM SIGOPS Operating Systems Review
The Amber system: parallel programming on a network of multiprocessors
SOSP '89 Proceedings of the twelfth ACM symposium on Operating systems principles
ACM Computing Surveys (CSUR)
Models for monitoring and debugging tools for parallel and distributed software
Journal of Parallel and Distributed Computing - Special issue: software tools for parallel programming and visualization
A Hybrid Monitor for Behavior and Performance Analysis of Distributed Systems
IEEE Transactions on Software Engineering
Concurrency: Practice and Experience
Dynamic adaptation of real-time software
ACM Transactions on Computer Systems (TOCS)
Making real-time reactive systems reliable
ACM SIGOPS Operating Systems Review
PADD '91 Proceedings of the 1991 ACM/ONR workshop on Parallel and distributed debugging
Parallel program visualization using SIEVE.1
ICS '92 Proceedings of the 6th international conference on Supercomputing
Data base design for real-time adaptations
Journal of Systems and Software
The entity-relationship model—toward a unified view of data
ACM Transactions on Database Systems (TODS) - Special issue: papers from the international conference on very large data bases: September 22–24, 1975, Framingham, MA
A methodology for interactive computer service measurement
Communications of the ACM
Principles of Database Systems
Principles of Database Systems
Visualizing the Performance of Parallel Programs
IEEE Software
IPS-2: The Second Generation of a Parallel Program Measurement System
IEEE Transactions on Parallel and Distributed Systems
Performance Measurement Intrusion and Perturbation Analysis
IEEE Transactions on Parallel and Distributed Systems
The Cactis Project: Database Support for Software Environments
IEEE Transactions on Software Engineering
Performance evaluation of communicating processes
Performance evaluation of communicating processes
Ice--an integrated environment for constructing and tuning parallel programs
Ice--an integrated environment for constructing and tuning parallel programs
Topologies: distributed objects in multicomputers
Topologies: distributed objects in multicomputers
An annotated bibliography of interactive program steering
ACM SIGPLAN Notices
IEEE Transactions on Software Engineering
A structured approach to instrumentation system development and evaluation
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Investigating monitoring configurations
SAC '96 Proceedings of the 1996 ACM symposium on Applied Computing
Techniques for High-Performance Computational Steering
IEEE Concurrency
On-Line Monitoring: A Tutorial
Computer
High Performance Computational Steering of Physical Simulations
IPPS '97 Proceedings of the 11th International Symposium on Parallel Processing
Dynamic Querying of Streaming Data with the dQUOB System
IEEE Transactions on Parallel and Distributed Systems
An investigation of monitoring configurations
CASCON '95 Proceedings of the 1995 conference of the Centre for Advanced Studies on Collaborative research
Monitoring overhead in distributed systems: visualization and estimation techniques
CASCON '96 Proceedings of the 1996 conference of the Centre for Advanced Studies on Collaborative research
Toward distributed applications management using the OSI management framework
CASCON '94 Proceedings of the 1994 conference of the Centre for Advanced Studies on Collaborative research
Distributed supercomputing using ACTS
HPDC '96 Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
HPDC '02 Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
Agent construction in Mobile Surveyor
ACM-SE 42 Proceedings of the 42nd annual Southeast regional conference
Automated response using system-call delays
SSYM'00 Proceedings of the 9th conference on USENIX Security Symposium - Volume 9
The role of event description in architecting dependable systems
Architecting dependable systems
Trace profiling: Scalable event tracing on high-end parallel systems
Parallel Computing
Towards scalable event tracing for high end systems
HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Hi-index | 0.00 |
Achieving high performance for parallel or distributed programs often requires substantial amounts of information about the programs themselves, about the systems on which they are executing, and about specific program runs. The monitoring system that collects, analyzes, and makes application-dependent monitoring information available to the programmer and to the executing program is presented. The system may be used foroff-line program analysis, for on-line debugging, and for making on-line, dynamic changes to parallel or distributed programs to enhance their performance. The authors use a high-level, uniform data model for the representation of program information andmonitoring data. They show how this model may be used for the specification of programviews and attributes for monitoring, and demonstrate how such specifications can betranslated into efficient, program-specific monitoring code that uses alternativemechanisms for the distributed analysis and collection to be performed for the specifiedviews. The model's utility has been demonstrated on a wide variety of parallel machines.