Cluster-based scalable network services
Proceedings of the sixteenth ACM symposium on Operating systems principles
Locality-aware request distribution in cluster-based network servers
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Web content adaptation to improve server overload behavior
WWW '99 Proceedings of the eighth international conference on World Wide Web
Proceedings of the seventeenth ACM symposium on Operating systems principles
Cluster reserves: a mechanism for resource management in cluster-based network servers
Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
SEDA: an architecture for well-conditioned, scalable internet services
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach
IEEE Transactions on Parallel and Distributed Systems
IEEE Transactions on Software Engineering
Kernel Mechanisms for Service Differentiation in Overloaded Web Servers
Proceedings of the General Track: 2002 USENIX Annual Technical Conference
General Bypass Architecture for High-Performance Distributed Applications
Proceedings of the Sixth IFIP WG6.3 Conference on Performance of Computer Networks: Data Communications and their Performance
On admission control for profit maximization of networked service providers
WWW '03 Proceedings of the 12th international conference on World Wide Web
Dynamic resource allocation for shared data centers using online measurements
SIGMETRICS '03 Proceedings of the 2003 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Dynamic Virtual Clusters in a Grid Site Manager
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
Energy Management for Server Clusters
HOTOS '01 Proceedings of the Eighth Workshop on Hot Topics in Operating Systems
Performance analysis of distributed server systems
Performance analysis of distributed server systems
Xen and the art of virtualization
SOSP '03 Proceedings of the nineteenth ACM symposium on Operating systems principles
Sharc: Managing CPU and Network Bandwidth in Shared Clusters
IEEE Transactions on Parallel and Distributed Systems
A method for transparent admission control and request scheduling in e-commerce web sites
Proceedings of the 13th international conference on World Wide Web
Brief announcement: Cataclysm: handling extreme overloads in internet services
Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing
Web Server Software Architectures
IEEE Internet Computing
Integrated resource management for cluster-based internet services
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Resource overbooking and application profiling in shared hosting platforms
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
An analytical model for multi-tier internet services and its applications
SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Resource Allocation for Autonomic Data Centers using Analytic Performance Models
ICAC '05 Proceedings of the Second International Conference on Automatic Computing
Performance modeling and prediction of enterprise JavaBeans with layered queuing network templates
SAVCBS '05 Proceedings of the 2005 conference on Specification and verification of component-based systems
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Adaptive overload control for busy internet servers
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Model-based resource provisioning in a web service utility
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Admission control for statistical QoS: theory and practice
IEEE Network: The Magazine of Global Internetworking
Self-* through self-learning: Overload control for distributed web systems
Computer Networks: The International Journal of Computer and Telecommunications Networking
A cost-sensitive adaptation engine for server consolidation of multitier applications
Proceedings of the 10th ACM/IFIP/USENIX International Conference on Middleware
Resource pool management: Reactive versus proactive or let's be friends
Computer Networks: The International Journal of Computer and Telecommunications Networking
Autonomous resource provisioning for multi-service web applications
Proceedings of the 19th international conference on World wide web
Efficient resource allocation and power saving in multi-tiered systems
Proceedings of the 19th international conference on World wide web
A cost-sensitive adaptation engine for server consolidation of multitier applications
Middleware'09 Proceedings of the ACM/IFIP/USENIX 10th international conference on Middleware
Journal of Systems and Software
Resource allocation algorithms for virtualized service hosting platforms
Journal of Parallel and Distributed Computing
JustRunIt: experiment-based management of virtualized data centers
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
Journal of Network and Computer Applications
Resource allocation across multiple cloud data centres
Proceedings of the 8th International Workshop on Middleware for Grids, Clouds and e-Science
SnowFlock: Virtual Machine Cloning as a First-Class Cloud Primitive
ACM Transactions on Computer Systems (TOCS)
Dynamic resource provisioning for interactive workflow applications on cloud computing platform
MTPP'10 Proceedings of the Second Russia-Taiwan conference on Methods and tools of parallel programming multicomputers
Kaleidoscope: cloud micro-elasticity via VM state coloring
Proceedings of the sixth conference on Computer systems
Adaptive resource provisioning for read intensive multi-tier applications in the cloud
Future Generation Computer Systems
FlurryDB: a dynamically scalable relational database with virtual machine cloning
Proceedings of the 4th Annual International Conference on Systems and Storage
Workload-aware database monitoring and consolidation
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
PERFUME: power and performance guarantee with fuzzy MIMO control in virtualized servers
Proceedings of the Nineteenth International Workshop on Quality of Service
Resource provisioning of web applications in heterogeneous clouds
WebApps'11 Proceedings of the 2nd USENIX conference on Web application development
Utility-directed resource allocation in virtual desktop clouds
Computer Networks: The International Journal of Computer and Telecommunications Networking
Self-management challenges for multi-cloud architectures
ServiceWave'11 Proceedings of the 4th European conference on Towards a service-based internet
Regression-based resource provisioning for session slowdown guarantee in multi-tier Internet servers
Journal of Parallel and Distributed Computing
Energy-aware service allocation
Future Generation Computer Systems
Dynamic scalability for next generation gaming infrastructures
Proceedings of the 4th International ICST Conference on Simulation Tools and Techniques
Decentralized and optimal control of shared resource pools
ACM Transactions on Autonomous and Adaptive Systems (TAAS) - Special section on formal methods in pervasive computing, pervasive adaptation, and self-adaptive systems: Models and algorithms
ACM Transactions on Architecture and Code Optimization (TACO)
Efficient provisioning of bursty scientific workloads on the cloud using adaptive elasticity control
Proceedings of the 3rd workshop on Scientific Cloud Computing Date
Optimizing the energy consumption of large-scale applications
Proceedings of the 8th international ACM SIGSOFT conference on Quality of Software Architectures
Lightweight Resource Scaling for Cloud Applications
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Combining horizontal and vertical composition of services
Service Oriented Computing and Applications
Provisioning multi-tier cloud applications using statistical bounds on sojourn time
Proceedings of the 9th international conference on Autonomic computing
QoS and energy management with Petri nets: A self-adaptive framework
Journal of Systems and Software
Dynamic resource provisioning for cloud-based gaming infrastructures
Computers in Entertainment (CIE) - Theoretical and Practical Computer Applications in Entertainment
Using Layered Bottlenecks for Virtual Machine Provisioning in the Clouds
UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
An Inter-cloud Outsourcing Model to Scale Performance, Availability and Security
UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
A Pluggable Autoscaling Service for Open Cloud PaaS Systems
UCC '12 Proceedings of the 2012 IEEE/ACM Fifth International Conference on Utility and Cloud Computing
Empirical evaluation of power saving policies for data centers
ACM SIGMETRICS Performance Evaluation Review
A game theoretical method for auto-scaling of multi-tiers web applications in cloud
Proceedings of the Fourth Asia-Pacific Symposium on Internetware
Towards transparent and distributed workload management for large scale web servers
Future Generation Computer Systems
Provisioning Virtual Resources Adaptively in Elastic Compute Cloud Platforms
International Journal of Web Services Research
Mechanisms for SLA provisioning in cloud-based service providers
Computer Networks: The International Journal of Computer and Telecommunications Networking
Algorithms for Web service selection with static and dynamic requirements
Service Oriented Computing and Applications
Transforming reactive auto-scaling into proactive auto-scaling
Proceedings of the 3rd International Workshop on Cloud Data and Platforms
Autonomic Provisioning with Self-Adaptive Neural Fuzzy Control for Percentile-Based Delay Guarantee
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
A flexible elastic control plane for private clouds
Proceedings of the 2013 ACM Cloud and Autonomic Computing Conference
Developing an optimized application hosting framework in Clouds
Journal of Computer and System Sciences
A survey on techniques for improving the energy efficiency of large-scale distributed systems
ACM Computing Surveys (CSUR)
GDCSim: A simulator for green data center design and analysis
ACM Transactions on Modeling and Computer Simulation (TOMACS) - Special issue on simulation in complex service systems
Enabling cost-aware and adaptive elasticity of multi-tier cloud applications
Future Generation Computer Systems
Hi-index | 0.00 |
Dynamic capacity provisioning is a useful technique for handling the multi-time-scale variations seen in Internet workloads. In this article, we propose a novel dynamic provisioning technique for multi-tier Internet applications that employs (1) a flexible queuing model to determine how much of the resources to allocate to each tier of the application, and (2) a combination of predictive and reactive methods that determine when to provision these resources, both at large and small time scales. We propose a novel data center architecture based on virtual machine monitors to reduce provisioning overheads. Our experiments on a forty-machine Xen/Linux-based hosting platform demonstrate the responsiveness of our technique in handling dynamic workloads. In one scenario where a flash crowd caused the workload of a three-tier application to double, our technique was able to double the application capacity within five minutes, thus maintaining response-time targets. Our technique also reduced the overhead of switching servers across applications from several minutes to less than a second, while meeting the performance targets of residual sessions.