Communications of the ACM - Special section on computer architecture
Evaluating two massively parallel machines
Communications of the ACM
Monitoring distributed systems
ACM Transactions on Computer Systems (TOCS)
Clock synchronization of a large multiprocessor system in the presence of malicious faults
IEEE Transactions on Computers
Key Concepts of the INCAS Multicomputer Project
IEEE Transactions on Software Engineering
A relational approach to monitoring complex systems
ACM Transactions on Computer Systems (TOCS)
Global events and global breakpoints in distributed systems
Proceedings of the Twenty-First Annual Hawaii International Conference on Software Track
Monitoring and performance measuring distributed systems during operation
SIGMETRICS '88 Proceedings of the 1988 ACM SIGMETRICS conference on Measurement and modeling of computer systems
The Accuracy of the Clock Synchronization Achieved by TEMPO in Berkeley UNIX 4.3BSD
IEEE Transactions on Software Engineering
Time, clocks, and the ordering of events in a distributed system
Communications of the ACM
Monitoring and Management-Support of Distributed Systems
Proceedings of the European Workshop on Process in Distributed Operating Systems and Distributed Systems Management
The Lady Programming Environment For Distributed Operating Systems
PARLE '89 Proceedings of the Parallel Architectures and Languages Europe, Volume I: Parallel Architectures
Online system performance measurements with software and hybrid monitors
SOSP '73 Proceedings of the fourth ACM symposium on Operating system principles
Application of Real-Time Monitoring to Scheduling Tasks with Random Execution Times
IEEE Transactions on Software Engineering
A Noninterference Monitoring and Replay Mechanism for Real-Time Software Testing and Debugging
IEEE Transactions on Software Engineering
Run-time monitoring of concurrent programs on the Cedar multiprocessor
Proceedings of the 1990 ACM/IEEE conference on Supercomputing
The flight recorder: an architectural aid for system monitoring
PADD '91 Proceedings of the 1991 ACM/ONR workshop on Parallel and distributed debugging
A portable platform for distributed event environments
PADD '91 Proceedings of the 1991 ACM/ONR workshop on Parallel and distributed debugging
A bibliography of parallel debuggers, 1993 edition
PADD '93 Proceedings of the 1993 ACM/ONR workshop on Parallel and distributed debugging
An annotated bibliography of interactive program steering
ACM SIGPLAN Notices
DeeDS towards a distributed and active real-time database system
ACM SIGMOD Record
High-Level Views of Distributed Executions: Convex Abstract Events
Automated Software Engineering
In Search of a Standards-Based Approach to Hybrid Performance Monitoring
IEEE Parallel & Distributed Technology: Systems & Technology
Application-Dependent Dynamic Monitoring of Distributed and Parallel Systems
IEEE Transactions on Parallel and Distributed Systems
Supporting System-Level Testing of Applications by Active Real-Time Database Systems
ARTDB '97 Proceedings of the Second International Workshop on Active, Real-Time, and Temporal Database Systems
A Taxonomy and Catalog of Runtime Software-Fault Monitoring Tools
IEEE Transactions on Software Engineering
A softerware monitor for shared-memory multiprocessor computers
Software—Practice & Experience
An empirical study of hierarchical division for mesh-structured networks
Journal of Computational Methods in Sciences and Engineering - Selected papers from the International Conference on Computer Science, Software Engineering, Information Technology, e-Business, and Applications, 2004
Hi-index | 0.00 |
The authors describe a hybrid monitor for measuring the performance and observing the behavior of distributed systems during execution. They emphasize data collection, analysis and presentation of execution data. A special hardware support, which consists of a test and measurement processor (TMP), was designed and has been implemented in the nodes of experimental multicomputer system consisting of eleven nodes. The operations of the TMP are completely transparent with a minimal, less than 0.1%, overhead to the measured system. In the experimental system, all the TMPs were connected with a central monitoring station, using an independent communication network, in order to provide a global view of the monitored system. The central monitoring station displayed the resulting information in easy-to-read charts and graphs. Experience with the TMP shows that it promotes an improved understanding of run-time behavior and performance measurements, which aids in deriving qualitative and quantitative assessments of distributed systems.