Lessons from Giant-Scale Services
IEEE Internet Computing
Networked Windows NT System Field Failure Data Analysis
PRDC '99 Proceedings of the 1999 Pacific Rim International Symposium on Dependable Computing
Why Do Internet Services Fail, and What Can Be Done About It?
Why Do Internet Services Fail, and What Can Be Done About It?
System administrators are users, too: designing workspaces for managing internet-scale systems
CHI '03 Extended Abstracts on Human Factors in Computing Systems
Commercial Fault Tolerance: A Tale of Two Systems
IEEE Transactions on Dependable and Secure Computing
Empirical Characterization of Session---Based Workload and Reliability for Web Servers
Empirical Software Engineering
IBM Journal of Research and Development - IBM BladeCenter systems
An online evolutionary approach to developing internet services
EW 10 Proceedings of the 10th workshop on ACM SIGOPS European workshop
Why do internet services fail, and what can be done about it?
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Automatic configuration of internet services
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Fault-tolerant performance checking application for distributed computing and supply chain networks
Journal of Computational Methods in Sciences and Engineering - Selected papers from the International Conference on Computer Science, Software Engineering, Information Technology, e-Business, and Applications, 2004
Autonomous Decentralized System for Service Assurance and Its Application
ISAS '07 Proceedings of the 4th international symposium on Service Availability
Active Diagnosis of High-Level Faults in Distributed Internet Services
APNOMS '08 Proceedings of the 11th Asia-Pacific Symposium on Network Operations and Management: Challenges for Next Generation Network Operations and Service Management
Achieving Self-Healing in Autonomic Software Systems: a Case-Based Reasoning Approach
Proceedings of the 2005 conference on Self-Organization and Autonomic Informatics (I)
Case-based reasoning for autonomous service failure diagnosis and remediation in software systems
ECCBR'06 Proceedings of the 8th European conference on Advances in Case-Based Reasoning
The Journal of Supercomputing
Hi-index | 0.00 |
An analysis of the architectures and causes of failure at three large-scale Internet services can help developers plan reliable systems offering maximum availability.