Communicating sequential processes
Communicating sequential processes
The need for network management
Computer Communications
Artificial intelligence: a modern approach
Artificial intelligence: a modern approach
Schemes for fault identification in communication networks
IEEE/ACM Transactions on Networking (TON)
SNMP, SNMPv2, and RMON (2nd ed.): practical network management
SNMP, SNMPv2, and RMON (2nd ed.): practical network management
Divide and conquer technique for network fault management
Proceedings of the fifth IFIP/IEEE international symposium on Integrated network management V : integrated management in a virtual world: integrated management in a virtual world
A coding approach to event correlation
Proceedings of the fourth international symposium on Integrated network management IV
A Generic Model for Fault Isolation in IntegratedManagement Systems
Journal of Network and Systems Management
Computing MIB Views via Delegated Agents
SMW '98 Proceedings of the IEEE Third International Workshop on Systems Management
High speed and robust event correlation
IEEE Communications Magazine
Network management research in ATDNet
IEEE Network: The Magazine of Global Internetworking
Probabilistic fault diagnosis in communication systems through incremental hypothesis updating
Computer Networks: The International Journal of Computer and Telecommunications Networking
Probabilistic fault localization in communication systems using belief networks
IEEE/ACM Transactions on Networking (TON)
IP fault localization via risk modeling
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
A graph-based proactive fault identification approach in computer networks
Computer Communications
Methodological Review: A review of causal inference for biomedical informatics
Journal of Biomedical Informatics
Root cause detection in a service-oriented architecture
Proceedings of the ACM SIGMETRICS/international conference on Measurement and modeling of computer systems
Nail-it-down: nailing and fixing configuration faults in cloud environments
Proceedings of the ACM International Conference on Computing Frontiers
Hidden anomaly detection in telecommunication networks
Proceedings of the 8th International Conference on Network and Service Management
Hi-index | 0.00 |
The increasing importance of computer networks in this information age demands a high level of network availability and reliability. As we become more dependent on networks in our so-called cyber-world, network faults and downtime become very costly. Sometimes, a slight fault may cause critical disruptions or remediless damages to the network while the network manager is lost among a large amount of alarm messages. Therefore, the development of a practical and effective system for network fault diagnosis becomes an imperative and critical task. In this paper, we develop a hierarchical domain-oriented reasoning mechanism suitable for the delegated management architecture. It is based on the causality graph of a refined network fault propagation model as a result of our empirical study. An automated fault diagnosis system called Alarm Correlation View (or ACView) for isolating network faults in a multi-domain environment is proposed according to the hierarchical reasoning mechanism. This diagnosis system not only provides the process of automated alarm collection and correlation, but also serves the function of efficient fault localization and identification. Furthermore, an alarm-to-fault mapping strategy is used to enhance the fault reasoning capability for uncertain network fault propagation.