A relational approach to monitoring complex systems
ACM Transactions on Computer Systems (TOCS)
A real-time monitor for a distributed real-time operating system
PADD '88 Proceedings of the 1988 ACM SIGPLAN and SIGOPS workshop on Parallel and distributed debugging
A bibliography of parallel debuggers, 1990 edition
ACM SIGPLAN Notices
Hi-index | 0.00 |
Writing and debugging distributed programs can be difficult. When a program is working, it may be difficult to achieve reasonable execution performance. A cause of these difficulties is a lack of tools for the programmer. We use a model of distributed computation and measurement to implement a program monitoring system for programs running on the Berkeley UNIX 4.2BSD operating system. The model of distributed computation describes the activities of the processes within a distributed program in terms of computation (internal events) and communication (external events). The measurement model separates the detection of external events, event record selection, and data analysis. The implementation of the measurement tools involved changes to the Berkeley UNIX kernel, and the addition daemon processes to allow the monitoring activity to take place across machine boundaries. A user interface has also been implemented. We present a users'' manual and an example of the use of the measurement system.