Fault detection with multiple observers
IEEE/ACM Transactions on Networking (TON)
ECBS '04 Proceedings of the 11th IEEE International Conference and Workshop on Engineering of Computer-Based Systems
Basic Concepts and Taxonomy of Dependable and Secure Computing
IEEE Transactions on Dependable and Secure Computing
Probabilistic fault localization in communication systems using belief networks
IEEE/ACM Transactions on Networking (TON)
Assured service quality by improved fault management
Proceedings of the 2nd international conference on Service oriented computing
Sympathy for the sensor network debugger
Proceedings of the 3rd international conference on Embedded networked sensor systems
Towards a Complete SNMP-Based Supervision of System-on-Chips
Journal of Network and Systems Management
Autonomic network management: some pragmatic considerations
Proceedings of the 2006 SIGCOMM workshop on Internet network management
Proceedings of the 44th annual Southeast regional conference
FUSE: lightweight guaranteed distributed failure notification
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
ICSOC/ServiceWave'09 Proceedings of the 2009 international conference on Service-oriented computing
The semantics of alarm definitions: enabling systematic reasoning about alarms
International Journal of Network Management
Design and Evaluation of Techniques for Resilience and Survivability of the Routing Node
International Journal of Adaptive, Resilient and Autonomic Systems
Hi-index | 0.00 |
Today's network management, as known within the Fault, Configuration, Accounting, Performance, Security (FCAPS) management framework, is moving towards the definition and implementation of 'self-managing' network functions, with the aim of eliminating or drastically reducing human intervention in some of the complex aspects or daunting tasks of network management. The fault management plane of the FCAPS framework deals with the following functions: fault detection, fault diagnosis, localization or isolation, and fault removal. Task automation is at the very heart of self-managing (autonomic) nodes and networks, meaning that all functions and processes related to fault management must be automated as much as possible within the functionalities of self-managing (autonomic) nodes and networks, in order for us to talk about autonomic fault management. At this point in time there are projects calling for implementing new network architectures that are flexible to support on-demand functional composition for context- or situation-aware networking. A number of such projects have started, under the umbrella of the so-called clean-slate network designs. Therefore, this calls for open frameworks for implementing self-managing (autonomic) functions across each of the traditional FCAPS management planes. This paper presents a unified framework for implementing autonomic fault management and failure detection for self-managing networks, a framework we are calling UniFAFF.