Group communication specifications: a comprehensive study
ACM Computing Surveys (CSUR)
QoS Path Monitoring for Multicast Networks
Journal of Network and Systems Management
Active Management Framework for Distributed Multimedia Systems
Journal of Network and Systems Management
Programmable Agents for Active Distributed Monitoring
DSOM '99 Proceedings of the 10th IFIP/IEEE International Workshop on Distributed Systems: Operations and Management: Active Technologies for Network and Service Management
Efficient Random Process Generation for Reliable Simulation of Complex Systems
ICCS '01 Proceedings of the International Conference on Computational Science-Part II
GBF: a grammar based filter for internet applications
Journal of Network and Computer Applications
Edge-to-edge measurement-based distributed network monitoring
Computer Networks: The International Journal of Computer and Telecommunications Networking
Magpie: online modelling and performance-aware systems
HOTOS'03 Proceedings of the 9th conference on Hot Topics in Operating Systems - Volume 9
Using magpie for request extraction and workload modelling
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
A hierarchical Quality of Service control architecture for configurable multimedia applications
Journal of High Speed Networks
Visualizing processes on the web
Journal of Visual Languages and Computing
Efficient filtering of composite events
BNCOD'03 Proceedings of the 20th British national conference on Databases
The role of event description in architecting dependable systems
Architecting dependable systems
Self-organizing monitoring agents for hierarchical event correlation
DSOM'07 Proceedings of the Distributed systems: operations and management 18th IFIP/IEEE international conference on Managing virtualization of networks and services
GLIMPSE: a generic and flexible monitoring infrastructure
EWDC '11 Proceedings of the 13th European Workshop on Dependable Computing
Towards a model-driven infrastructure for runtime monitoring
SERENE'11 Proceedings of the Third international conference on Software engineering for resilient systems
Toward integrating IP multicasting in internet network management protocols
Computer Communications
A decentralized approach for mining event correlations in distributed system monitoring
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
With the increasing complexity of large-scale distributed (LSD) systems, an efficient monitoring mechanism has become an essential service for improving the performance and reliability of such complex applications.This paper presents a {\em scalable, dynamic, flexible} and {\em non-intrusive} monitoring architecture for managing large-scale distributed (LSD) systems. This architecture, which is referred to as the HiFi monitoring system, detects and classifies interesting primitive and composite events and performs either a corrective or steering action. When appropriate, information is also disseminated to management applications, such reactive control tools.The outlined solution offers improvements over related works by supporting new monitoring techniques such as hierarchical filtering-based monitoring and filter incarnation that improve the monitoring scalability and dynamism which are required for managing large-scale distributed systems. The HiFi monitoring system has been implemented and used at the Old Dominion University for monitoring and steering Interactive Remote Instruction (IRI) which is a large-scale distributed multimedia system for distance learning.