Schemes for fault identification in communication networks
IEEE/ACM Transactions on Networking (TON)
End-to-end routing behavior in the Internet
IEEE/ACM Transactions on Networking (TON)
Distributed fault location in networks using mobile agents
IATA '98 Proceedings of the second international workshop on Intelligent agents for telecommunication applications
Journal of the ACM (JACM)
Smart packets: applying active networks to network management
ACM Transactions on Computer Systems (TOCS)
Coding-based schemes for fault identification in communication networks
International Journal of Network Management
An Automated Fault Diagnosis System Using Hierarchical Reasoning and Alarm Correlation
Journal of Network and Systems Management
ICATPN '97 Proceedings of the 18th International Conference on Application and Theory of Petri Nets
Mobile agents for network management
IEEE Communications Surveys & Tutorials
Research: A LAN fault diagnosis system
Computer Communications
JTMN: a Java-based TMN development and experimentation environment
IEEE Journal on Selected Areas in Communications
IEEE Journal on Selected Areas in Communications
NESTOR: an architecture for network self-management and organization
IEEE Journal on Selected Areas in Communications
Nail-it-down: nailing and fixing configuration faults in cloud environments
Proceedings of the ACM International Conference on Computing Frontiers
Hi-index | 0.24 |
In large-scale computer networks, the isolation of the primary failure source is a challenging task. This article presents a proactive network fault diagnosis approach based on graph theory. Compared with other approaches, the manager of network management system checks the status of the managed devices actively rather than receive messages from those objects passively. The salient feature of this approach is that the possible failure sources, including the real one, can be computed precisely and quickly without any alarm historical information or strict assumptions. This approach does not introduce much processing complexity by taking full use of matrix and Boolean operations. To test and evaluate our proposed algorithm, it is implemented in Java and tested in a real large network environment. The experiment results show that this approach is not only efficient but also scalable on fault identification in large-scale computer networks.