Architecture and Dependability of Large-Scale Internet Services
IEEE Internet Computing
Software Dependability in the Tandem GUARDIAN System
IEEE Transactions on Software Engineering
Pinpoint: Problem Determination in Large, Dynamic Internet Services
DSN '02 Proceedings of the 2002 International Conference on Dependable Systems and Networks
Alert Correlation in a Cooperative Intrusion Detection Framework
SP '02 Proceedings of the 2002 IEEE Symposium on Security and Privacy
Performance debugging for distributed systems of black boxes
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Autonomous recovery in componentized Internet applications
Cluster Computing
Why do internet services fail, and what can be done about it?
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Distributed Diagnosis of Failures in a Three Tier E-Commerce System
SRDS '07 Proceedings of the 26th IEEE International Symposium on Reliable Distributed Systems
High speed and robust event correlation
IEEE Communications Magazine
Adaptive diagnosis in distributed systems
IEEE Transactions on Neural Networks
Hi-index | 0.00 |
For fault Diagnosis in internet service, the detection and localization of high-level failure is very important and a real big challenge. The diagnose methods that passively collect information have two drawbacks: 1) requiring the target system to report its inner message; 2) it's impossible to detect and locate faults before user senses them. This paper proposes an active diagnose method which test internet service with probes and make fault inferences based on the probe results. Probing method is proactive and adaptive with low cost. We evaluate it through applying it to a J2EE application "Pet Store", compare it with a current passive method Pinpoint, and show that our method outperforms Pinpoint.