Machine Learning
Bug isolation via remote program sampling
PLDI '03 Proceedings of the ACM SIGPLAN 2003 conference on Programming language design and implementation
Self-testing software probe system for failure detection and diagnosis
CASCON '94 Proceedings of the 1994 conference of the Centre for Advanced Studies on Collaborative research
The Temporal and Topological Characteristics of BGP Path Changes
ICNP '03 Proceedings of the 11th IEEE International Conference on Network Protocols
Locating internet routing instabilities
Proceedings of the 2004 conference on Applications, technologies, architectures, and protocols for computer communications
Using computers to diagnose computer problems
HOTOS'03 Proceedings of the 9th conference on Hot Topics in Operating Systems - Volume 9
Path-based faliure and evolution management
NSDI'04 Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation - Volume 1
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Approximation algorithms for combinatorial problems
Journal of Computer and System Sciences
Detecting application-level failures in component-based Internet services
IEEE Transactions on Neural Networks
Using queries for distributed monitoring and forensics
Proceedings of the 1st ACM SIGOPS/EuroSys European Conference on Computer Systems 2006
Discrete control for safe execution of IT automation workflows
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Snitch: interactive decision trees for troubleshooting misconfigurations
SYSML'07 Proceedings of the 2nd USENIX workshop on Tackling computer systems problems with machine learning techniques
Nail-it-down: nailing and fixing configuration faults in cloud environments
Proceedings of the ACM International Conference on Computing Frontiers
Hi-index | 0.00 |
Root cause localization, the process of identifying the source of problems in a system using purely external observations, is a significant challenge in many large-scale systems. In this paper, we propose an abstract model that captures the common issues underlying root cause localization and hence provides the ability to leverage solutions across different systems.