Discovering OMNIPoint: a common approach to the integrated management of networked information systems
Towards a practical alarm correlation system
Proceedings of the fourth international symposium on Integrated network management IV
Centralized vs. distributed fault localization
Proceedings of the fourth international symposium on Integrated network management IV
A coding approach to event correlation
Proceedings of the fourth international symposium on Integrated network management IV
Event correlation using rule and object based techniques
Proceedings of the fourth international symposium on Integrated network management IV
Using master tickets as a storage for problem-solving expertise
Proceedings of the fourth international symposium on Integrated network management IV
Event Correlation in Heterogeneous Networks Using the OSI Management Framework
Proceedings of the IFIP TC6/WG6.6 Third International Symposium on Integrated Network Management with participation of the IEEE Communications Society CNOM and with support from the Institute for Educational Services
The TINA consortium: toward networking telecommunications information services
IEEE Communications Magazine
Active Management Framework for Distributed Multimedia Systems
Journal of Network and Systems Management
Statistical Detection of Enterprise NetworkProblems
Journal of Network and Systems Management
Adaptive Anomaly Detection in Transaction-Oriented Networks
Journal of Network and Systems Management
An Automated Fault Diagnosis System Using Hierarchical Reasoning and Alarm Correlation
Journal of Network and Systems Management
Issues in Managing Soft QoS Requirements in Distributed Systems Using a Policy-Based Framework
POLICY '01 Proceedings of the International Workshop on Policies for Distributed Systems and Networks
An Architecture for Inter-Domain Troubleshooting
Journal of Network and Systems Management
An alarm management framework for automated network fault identification
Computer Communications
Research: A LAN fault diagnosis system
Computer Communications
DAIS'12 Proceedings of the 12th IFIP WG 6.1 international conference on Distributed Applications and Interoperable Systems
Hi-index | 0.00 |
Distributed systems in enterprises as well astelecommunication environments strongly demand moreautomated fault management. A single fault in thesecomplex systems might cause a huge number of symptomatic error messages and side effects to occur. Thecommon root faults for these symptoms have to beidentified to start fault removal procedures as soon aspossible and to decrease system down-time. This paper presents a methodology for fault isolation inintegrated management systems. A generic model isdescribed that unifies the view of the management systemon the managed environment. It integrates the relevant aspects of network, system, and servicemanagement layers in order to perform integrated faultisolation. Our approach is based on a general dependencygraph model. It captures the information that isrequired to determine the root cause of a fault on theone hand, and the set of fault affected services andcustomers on the other hand. The layered TMNarchitecture serves as an example for an integratedmanagement environment throughout this paper.