STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
Probability and statistics with reliability, queuing and computer science applications
Probability and statistics with reliability, queuing and computer science applications
Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach
IEEE Transactions on Parallel and Distributed Systems
Analyzing Computer Systems Performance: With Perl: Pdq (Springer Professional Computing)
Analyzing Computer Systems Performance: With Perl: Pdq (Springer Professional Computing)
An analytical model for multi-tier internet services and its applications
SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
A Conversation with Werner Vogels
Queue - AI
Provisioning servers in the application tier for e-commerce systems
ACM Transactions on Internet Technology (TOIT)
Model-based resource provisioning in a web service utility
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Agile dynamic provisioning of multi-tier Internet applications
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Resource Management in the Autonomic Service-Oriented Architecture
ICAC '06 Proceedings of the 2006 IEEE International Conference on Autonomic Computing
SnowFlock: rapid virtual machine cloning for cloud computing
Proceedings of the 4th ACM European conference on Computer systems
Resource provisioning of web applications in heterogeneous clouds
WebApps'11 Proceedings of the 2nd USENIX conference on Web application development
ConPaaS: an integrated runtime environment for elastic cloud applications
Proceedings of the Workshop on Posters and Demos Track
Modeling the parallel execution of black-box services
HotCloud'11 Proceedings of the 3rd USENIX conference on Hot topics in cloud computing
Economic models for cloud service markets: Pricing and Capacity planning
Theoretical Computer Science
Hi-index | 0.00 |
Dynamic resource provisioning aims at maintaining the end-to-end response time of a web application within a pre-defined SLA. Although the topic has been well studied for monolithic applications, provisioning resources for applications composed of multiple services remains a challenge. When the SLA is violated, one must decide which service(s) should be reprovisioned for optimal effect. We propose to assign an SLA only to the front-end service. Other services are not given any particular response time objectives. Services are autonomously responsible for their own provisioning operations and collaboratively negotiate performance objectives with each other to decide the provisioning service(s). We demonstrate through extensive experiments that our system can add/remove/shift both servers and caches within an entire multi-service application under varying workloads to meet the SLA target and improve resource utilization.