Graphical views of parallel programs
ACM SIGSOFT Software Engineering Notes
Distributed process groups in the V Kernel
ACM Transactions on Computer Systems (TOCS)
Thoth, a portable real-time operating system
Communications of the ACM
Time, clocks, and the ordering of events in a distributed system
Communications of the ACM
Visualizing algorithms and processes with the aid of a computer
SIGCSE '83 Proceedings of the fourteenth SIGCSE technical symposium on Computer science education
INCENSE: A system for displaying data structures
SIGGRAPH '83 Proceedings of the 10th annual conference on Computer graphics and interactive techniques
Monitoring system behavior in a complex computational environment.
Monitoring system behavior in a complex computational environment.
Development of a debugger for a concurrent language
SIGSOFT '83 Proceedings of the symposium on High-level debugging
SOSP '87 Proceedings of the eleventh ACM Symposium on Operating systems principles
A relational approach to monitoring complex systems
ACM Transactions on Computer Systems (TOCS)
A Language and System for the Construction and Tuning of Parallel Programs
IEEE Transactions on Software Engineering
Handling Timing Errors in Distributed Programs
IEEE Transactions on Software Engineering
Monitoring and performance measuring distributed systems during operation
SIGMETRICS '88 Proceedings of the 1988 ACM SIGMETRICS conference on Measurement and modeling of computer systems
Large-scale parallel programming: experience with BBN butterfly parallel processor
PPEALS '88 Proceedings of the ACM/SIGPLAN conference on Parallel programming: experience with applications, languages and systems
A distributed debugger for Amoeba
PADD '88 Proceedings of the 1988 ACM SIGPLAN and SIGOPS workshop on Parallel and distributed debugging
PADD '88 Proceedings of the 1988 ACM SIGPLAN and SIGOPS workshop on Parallel and distributed debugging
Partial orders for parallel debugging
PADD '88 Proceedings of the 1988 ACM SIGPLAN and SIGOPS workshop on Parallel and distributed debugging
Interactive blackbox debugging for concurrent languages
PADD '88 Proceedings of the 1988 ACM SIGPLAN and SIGOPS workshop on Parallel and distributed debugging
Bibliography on network management
ACM SIGCOMM Computer Communication Review
ACM Computing Surveys (CSUR)
A Hybrid Monitor for Behavior and Performance Analysis of Distributed Systems
IEEE Transactions on Software Engineering
A generic embedded real-time monitor subsystem
CSC '90 Proceedings of the 1990 ACM annual conference on Cooperation
A Noninterference Monitoring and Replay Mechanism for Real-Time Software Testing and Debugging
IEEE Transactions on Software Engineering
A bibliography of parallel debuggers, 1990 edition
ACM SIGPLAN Notices
Restoring consistent global states of distributed computations
PADD '91 Proceedings of the 1991 ACM/ONR workshop on Parallel and distributed debugging
A portable platform for distributed event environments
PADD '91 Proceedings of the 1991 ACM/ONR workshop on Parallel and distributed debugging
High-level language debugging for concurrent programs
ACM Transactions on Computer Systems (TOCS)
Process clustering for distributed debugging
PADD '93 Proceedings of the 1993 ACM/ONR workshop on Parallel and distributed debugging
ISSTA '94 Proceedings of the 1994 ACM SIGSOFT international symposium on Software testing and analysis
Automatic detection of errors in distributed systems
CSC '95 Proceedings of the 1995 ACM 23rd annual conference on Computer science
A performance monitoring application for distributed interactive simulations (DIS)
Proceedings of the 29th conference on Winter simulation
An approach towards distributed simulation of timed petri nets
WSC' 90 Proceedings of the 22nd conference on Winter simulation
Investigating monitoring configurations
SAC '96 Proceedings of the 1996 ACM symposium on Applied Computing
A design and prototyping of an object-oriented program debugger
SAC '97 Proceedings of the 1997 ACM symposium on Applied computing
Active Management Framework for Distributed Multimedia Systems
Journal of Network and Systems Management
Fundamentals of Distributed System Observation
IEEE Software
Objective-Driven Monitoring for Broadband Networks
IEEE Transactions on Knowledge and Data Engineering
JEWEL: Design and Implementation of a Distributed Measurement System
IEEE Transactions on Parallel and Distributed Systems
Application-Dependent Dynamic Monitoring of Distributed and Parallel Systems
IEEE Transactions on Parallel and Distributed Systems
Distributed Performance Monitoring: Methods, Tools, and Applications
IEEE Transactions on Parallel and Distributed Systems
Programmable Agents for Active Distributed Monitoring
DSOM '99 Proceedings of the 10th IFIP/IEEE International Workshop on Distributed Systems: Operations and Management: Active Technologies for Network and Service Management
HERCULE: Non-invasively Tracking JavaTM Component-Based Application Activity
ECOOP '00 Proceedings of the 14th European Conference on Object-Oriented Programming
Deploying Distributed State Information in Mobile Agent Systems
CooplS '01 Proceedings of the 9th International Conference on Cooperative Information Systems
Algorithm Visualization For Distributed Environments
INFOVIS '98 Proceedings of the 1998 IEEE Symposium on Information Visualization
An investigation of monitoring configurations
CASCON '95 Proceedings of the 1995 conference of the Centre for Advanced Studies on Collaborative research
Monitoring overhead in distributed systems: visualization and estimation techniques
CASCON '96 Proceedings of the 1996 conference of the Centre for Advanced Studies on Collaborative research
CASCON '94 Proceedings of the 1994 conference of the Centre for Advanced Studies on Collaborative research
General Test Result Checking with Log File Analysis
IEEE Transactions on Software Engineering
Analyzing Component-Based Systems Using the Self-Organizing Map
EUROMICRO '05 Proceedings of the 31st EUROMICRO Conference on Software Engineering and Advanced Applications
Magpie: online modelling and performance-aware systems
HOTOS'03 Proceedings of the 9th conference on Hot Topics in Operating Systems - Volume 9
Using magpie for request extraction and workload modelling
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
An empirical study of hierarchical division for mesh-structured networks
Journal of Computational Methods in Sciences and Engineering - Selected papers from the International Conference on Computer Science, Software Engineering, Information Technology, e-Business, and Applications, 2004
Proceedings of the 2nd International Conference on Interaction Sciences: Information Technology, Culture and Human
Monitoring and steering Grid applications with GRID superscalar
Future Generation Computer Systems
The role of event description in architecting dependable systems
Architecting dependable systems
GLIMPSE: a generic and flexible monitoring infrastructure
EWDC '11 Proceedings of the 13th European Workshop on Dependable Computing
Trace acquirement from real-time systems based on WCET analysis
ICESS'05 Proceedings of the Second international conference on Embedded Software and Systems
Computer Communications
Adequate monitoring of service compositions
Proceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering
Hi-index | 0.00 |
The monitoring of distributed systems involves the collection, interpretation, and display of information concerning the interactions among concurrently executing processes. This information and its display can support the debugging, testing, performance evaluation, and dynamic documentation of distributed systems. General problems associated with monitoring are outlined in this paper, and the architecture of a general purpose, extensible, distributed monitoring system is presented. Three approaches to the display of process interactions are described: textual traces, animated graphical traces, and a combination of aspects of the textual and graphical approaches. The roles that each of these approaches fulfill in monitoring and debugging distributed systems are identified and compared. Monitoring tools for collecting communication statistics, detecting deadlock, controlling the non-deterministic execution of distributed systems, and for using protocol specifications in monitoring are also described.Our discussion is based on experience in the development and use of a monitoring system within a distributed programming environment called Jade. Jade was developed within the Computer Science Department of the University of Calgary and is now being used to support teaching and research at a number of university and research organizations.