Architecture and Dependability of Large-Scale Internet Services
IEEE Internet Computing
Managing Web server performance with AutoTune agents
IBM Systems Journal
Automated Cluster-Based Web Service Performance Tuning
HPDC '04 Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing
STRIDER: A Black-box, State-based Approach to Change and Configuration Management and Support
LISA '03 Proceedings of the 17th USENIX conference on System administration
SmartFrog Meets LCFG: Autonomous Reconfiguration with Central Policy Control
LISA '03 Proceedings of the 17th USENIX conference on System administration
Quantifying the Performability of Cluster-Based Services
IEEE Transactions on Parallel and Distributed Systems
Discovering Correctness Constraints for Self-Management of System Configuration
ICAC '04 Proceedings of the First International Conference on Autonomic Computing
Performance modeling and system management for multi-component online services
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
The collective: a cache-based system management architecture
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Understanding and dealing with operator mistakes in internet services
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Configuration debugging as search: finding the needle in the haystack
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Automatic misconfiguration troubleshooting with peerpressure
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Why do internet services fail, and what can be done about it?
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Towards automatic reverse engineering of software security configurations
Proceedings of the 15th ACM conference on Computer and communications security
Boosting the performance of computing systems through adaptive configuration tuning
Proceedings of the 2009 ACM symposium on Applied Computing
Barricade: defending systems against operator mistakes
Proceedings of the 5th European conference on Computer systems
JustRunIt: experiment-based management of virtualized data centers
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
MassConf: automatic configuration tuning by leveraging user community information
Proceedings of the 2nd ACM/SPEC International Conference on Performance engineering
An empirical study on configuration errors in commercial and open source systems
SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
URL: A unified reinforcement learning approach for autonomic cloud management
Journal of Parallel and Distributed Computing
DejaVu: accelerating resource allocation in virtualized environments
ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
X-ray: automating root-cause diagnosis of performance anomalies in production software
OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
Rapid adjustment and adoption to MIaaS clouds
Proceedings of the Industrial Track of the 13th ACM/IFIP/USENIX International Middleware Conference
Gunther: search-based auto-tuning of mapreduce
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Hi-index | 0.00 |
Recent research has found that operators frequently misconfigure Internet services, causing various availability and performance problems. In this paper, we propose a software infrastructure that eliminates several types of misconfiguration by automating the generation of configuration files in Internet services, even as the services evolve. The infrastructure comprises a custom scripting language, configuration file templates, communicating runtime monitors, and heuristic algorithms to detect dependencies between configuration parameters and select ideal configurations. To demonstrate our infrastructure experimentally, we apply it to a realistic online auction service. Our results show that the infrastructure can simplify operation significantly while eliminating 58% of the misconfigurations found in a previous study of the same service. Furthermore, our results show that the infrastructure can efficiently determine the configuration parameters that lead to high performance as the service evolves through a hardware upgrade and the scheduled maintenance of a few nodes.