Design and Analysis of Dynamic Redundancy Networks
IEEE Transactions on Computers
Cisco IOS Configuration Fundamentals
Cisco IOS Configuration Fundamentals
Pinpoint: Problem Determination in Large, Dynamic Internet Services
DSN '02 Proceedings of the 2002 International Conference on Dependable Systems and Networks
Experimental Study of Internet Stability and Backbone Failures
FTCS '99 Proceedings of the Twenty-Ninth Annual International Symposium on Fault-Tolerant Computing
Recovery Oriented Computing (ROC): Motivation, Definition, Techniques,
Recovery Oriented Computing (ROC): Motivation, Definition, Techniques,
Undo for operators: building an undoable e-mail store
ATEC '03 Proceedings of the annual conference on USENIX Annual Technical Conference
Why do internet services fail, and what can be done about it?
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Proposal on network-wide rollback scheme for fast recovery from operator errors
DSOM'07 Proceedings of the Distributed systems: operations and management 18th IFIP/IEEE international conference on Managing virtualization of networks and services
Detecting application-level failures in component-based Internet services
IEEE Transactions on Neural Networks
Hi-index | 0.00 |
Network failures may have a major impact on our society. There are many possible causes of network failures, of which the most significant is operator errors. Consequently, the development of new network management schemes to tackle operator errors is important. We have already proposed a basic idea of a new network-wide rollback scheme to tackle operator errors. In the proposed scheme, we introduce a server to manage historical versions of sets of device configuration. An operator rolls back a set of device configuration via the server when the operator detects a network failure. In this paper, we present a detail of the network-wide rollback scheme. In addition, we provide three rollback procedures, and implement a prototype system to evaluate their rollback time. The proposed scheme will serve for fast recovery from operator errors, as the minimum rollback time is about 41 seconds, when 50 routers are rolled back.