A measurement-based admission control algorithm for integrated services packet networks
SIGCOMM '95 Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
SEDA: an architecture for well-conditioned, scalable internet services
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach
IEEE Transactions on Parallel and Distributed Systems
Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites
IEEE Transactions on Computers
Kernel Mechanisms for Service Differentiation in Overloaded Web Servers
Proceedings of the General Track: 2002 USENIX Annual Technical Conference
Dynamic Virtual Clusters in a Grid Site Manager
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
SODA: A Service-On-Demand Architecture for Application Service Hosting Utility Platforms
HPDC '03 Proceedings of the 12th IEEE International Symposium on High Performance Distributed Computing
Priority service and max-min fairness
IEEE/ACM Transactions on Networking (TON)
A method for transparent admission control and request scheduling in e-commerce web sites
Proceedings of the 13th international conference on World Wide Web
Replicating Web Applications On-Demand
SCC '04 Proceedings of the 2004 IEEE International Conference on Services Computing
Cataclysm: policing extreme overloads in internet applications
WWW '05 Proceedings of the 14th international conference on World Wide Web
Computing on the edge: a platform for replicating internet applications
Web content caching and distribution
Session-Based Adaptive Overload Control for Secure Dynamic Web Applications
ICPP '05 Proceedings of the 2005 International Conference on Parallel Processing
Dynamic placement for clustered web applications
Proceedings of the 15th international conference on World Wide Web
Controlling Quality of Service in Multi-Tier Web Applications
ICDCS '06 Proceedings of the 26th IEEE International Conference on Distributed Computing Systems
Provisioning servers in the application tier for e-commerce systems
ACM Transactions on Internet Technology (TOIT)
A scalable application placement controller for enterprise data centers
Proceedings of the 16th international conference on World Wide Web
Agile dynamic provisioning of multi-tier Internet applications
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Resource allocation optimization for quantitative service differentiation on server clusters
Journal of Parallel and Distributed Computing
Autonomic Provisioning of Backend Databases in Dynamic Content Web Servers
ICAC '06 Proceedings of the 2006 IEEE International Conference on Autonomic Computing
Efficient application placement in a dynamic hosting platform
Proceedings of the 18th international conference on World wide web
An adaptive feedback controller for SIP server memory overload protection
ICAC '09 Proceedings of the 6th international conference on Autonomic computing
Automated control in cloud computing: challenges and opportunities
ACDC '09 Proceedings of the 1st workshop on Automated control for datacenters and clouds
Dynamic resource allocation for shared data centers using online measurements
IWQoS'03 Proceedings of the 11th international conference on Quality of service
Journal of Network and Computer Applications
Dolly: virtualization-driven database provisioning for the cloud
Proceedings of the 7th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Distributed workload and response time management for web applications
Proceedings of the 7th International Conference on Network and Services Management
A weighted-fair-queuing (WFQ)-based dynamic request scheduling approach in a multi-core system
Future Generation Computer Systems
Adaptive parallel job scheduling with resource admissible allocation on two-level hierarchical grids
Future Generation Computer Systems
Supporting CPU-based guarantees in cloud SLAs via resource-level QoS metrics
Future Generation Computer Systems
A control theory foundation for self-managing computing systems
IEEE Journal on Selected Areas in Communications
Decentralized scalable fairshare scheduling
Future Generation Computer Systems
Deadline-constrained workflow scheduling algorithms for Infrastructure as a Service Clouds
Future Generation Computer Systems
Hi-index | 0.00 |
The rapid expansion of cloud offerings poses fundamental tasks for workload management in a large scale server farm. In order to achieve satisfactory Quality of Service (QoS) and reduce operation cost, we present a fully distributed workload management system in a large scale server environment, e.g., cloud. Different from existing centralized control approaches, the workload management logic hierarchically spreads on each back-end server and front-end proxy. The control solution is designed to offer both overload protection and resource efficiency for the back-end servers, while achieving service differentiation based on Service Level Agreement (SLA). The proposed system can directly work with legacy software stack, because the implementation requires no changes to the target operating system, application servers, or web applications. Our evaluation shows that it achieves both overload protection and service classification under dynamic heavy workload. Furthermore, it also demonstrates negligible management overhead, satisfactory fault-tolerance and fast convergence.