Cluster-based scalable network services
Proceedings of the sixteenth ACM symposium on Operating systems principles
Locality-aware request distribution in cluster-based network servers
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Web content adaptation to improve server overload behavior
WWW '99 Proceedings of the eighth international conference on World Wide Web
A statistical approach to predictive detection
Computer Networks: The International Journal of Computer and Telecommunications Networking - Special issue on selected topics in network and systems management
Managing energy and server resources in hosting centers
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach
IEEE Transactions on Parallel and Distributed Systems
Kernel Mechanisms for Service Differentiation in Overloaded Web Servers
Proceedings of the General Track: 2002 USENIX Annual Technical Conference
On admission control for profit maximization of networked service providers
WWW '03 Proceedings of the 12th international conference on World Wide Web
Dynamic Virtual Clusters in a Grid Site Manager
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
A method for transparent admission control and request scheduling in e-commerce web sites
Proceedings of the 13th international conference on World Wide Web
Adaptive overload control for busy internet servers
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Model-based resource provisioning in a web service utility
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
eQoS: Provisioning of Client-Perceived End-to-End QoS Guarantees in Web Servers
IEEE Transactions on Computers
Cataclysm: Scalable overload policing for internet applications
Journal of Network and Computer Applications
Improving the performances of JMS-based applications
International Journal of Autonomic Computing
Modeling and predicting end-to-end response times in multi-tier internet applications
ITC20'07 Proceedings of the 20th international teletraffic conference on Managing traffic performance in converged networks
DynaQoS: model-free self-tuning fuzzy control of virtualized resources for QoS provisioning
Proceedings of the Nineteenth International Workshop on Quality of Service
From Autonomic to Self-Self Behaviors: The JADE Experience
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Optimal resource allocation in synchronized multi-tier Internet services
Performance Evaluation
Performance Evaluation of Overload Control in Multi-cluster Grids
GRID '11 Proceedings of the 2011 IEEE/ACM 12th International Conference on Grid Computing
A self-healing web server using differentiated services
ICSOC'06 Proceedings of the 4th international conference on Service-Oriented Computing
A proactive distributed qos control framework for cluster web site
HPCC'06 Proceedings of the Second international conference on High Performance Computing and Communications
Distributed workload and response time management for web applications
Proceedings of the 7th International Conference on Network and Services Management
Can dynamic provisioning and rejuvenation systems coexist in peace?
DSOM'05 Proceedings of the 16th IFIP/IEEE Ambient Networks international conference on Distributed Systems: operations and Management
Towards transparent and distributed workload management for large scale web servers
Future Generation Computer Systems
Hi-index | 0.01 |
In this paper we present the Cataclysm server platform for handling extreme overloads in hosted Internet applications. The primary contribution of our work is to develop a low overhead, highly scalable admission control technique for Internet applications. Cataclysm provides several desirable features, such as guarantees on response time by conducting accurate size-based admission control, revenue maximization at multiple time-scales via preferential admission of important requests and dynamic capacity provisioning, and the ability to be operational even under extreme overloads. Cataclysm can transparently trade-off the accuracy of its decision making with the intensity of the workload allowing it to handle incoming rates of several tens of thousands of requests/second. We implement a prototype Cataclysm hosting platform on a Linux cluster and demonstrate the benefits of our integrated approach using a variety of workloads.