Implementing fault-tolerant services using the state machine approach: a tutorial
ACM Computing Surveys (CSUR)
Horus: a flexible group communication system
Communications of the ACM
Network management (2nd ed.): a practical perspective
Network management (2nd ed.): a practical perspective
A Hierarchical Adaptive Distributed System-Level Diagnosis Algorithm
IEEE Transactions on Computers
A group communication protocol for distributed network management systems
ICCC '95 Proceedings of the 12th international conference on computer communication on Information highways : for a smaller world and better living: for a smaller world and better living
A Configurable Membership Service
IEEE Transactions on Computers
Fault isolation and event correlation for integrated fault management
Proceedings of the fifth IFIP/IEEE international symposium on Integrated network management V : integrated management in a virtual world: integrated management in a virtual world
SNMP,SNMPV2,Snmpv3,and RMON 1 and 2
SNMP,SNMPV2,Snmpv3,and RMON 1 and 2
Building Secure and Reliable Network Applications
Building Secure and Reliable Network Applications
Towards Fault Recovery and Management inCommunication Networks
Journal of Network and Systems Management
Fault-Tolerance by Replication in Distributed Systems
Ada-Europe '96 Proceedings of the 1996 Ada-Europe International Conference on Reliable Software Technologies
A Fault Tolerance Framework for CORBA
FTCS '99 Proceedings of the Twenty-Ninth Annual International Symposium on Fault-Tolerant Computing
Using Multicast-SNMP to Coordinate Distributed Management Agents
SMW '96 Proceedings of the 2nd IEEE International Workshop on Systems Management (SMW'96)
SRDS '98 Proceedings of the The 17th IEEE Symposium on Reliable Distributed Systems
Proactive Network Fault Detection
INFOCOM '97 Proceedings of the INFOCOM '97. Sixteenth Annual Joint Conference of the IEEE Computer and Communications Societies. Driving the Information Revolution
A Distributed Network Connectivity Algorithm
ISADS '03 Proceedings of the The Sixth International Symposium on Autonomous Decentralized Systems (ISADS'03)
Network Fault Management Based on SNMP Agent Groups
ICDCSW '01 Proceedings of the 21st International Conference on Distributed Computing Systems
The ensemble system
ANMP: ad hoc network management protocol
IEEE Journal on Selected Areas in Communications
A Survey of Fault Management in Wireless Sensor Networks
Journal of Network and Systems Management
Hi-index | 0.00 |
This paper presents a new clustering architecture for SNMP agents that supports semi-active replication of managed objects. A cluster of agents provides fault-tolerant object functionality: replicated managed objects of a crashed agent of a given cluster may be accessed through a peer cluster. The proposed architecture is structured in three layers. The lower layer corresponds to the managed objects at the network elements. The middle layer contains management entities called clusters that monitor and replicate managed objects. The upper layer allows the definition of management clusters as well as the relationship between clusters. A practical tool was implemented and is presented. The impact of replication on network performance is evaluated as well as a probabilistic analysis of replicated object consistency.