Wide-area traffic: the failure of Poisson modeling
SIGCOMM '94 Proceedings of the conference on Communications architectures, protocols and applications
Multilevel k-way hypergraph partitioning
Proceedings of the 36th annual ACM/IEEE Design Automation Conference
On the scale and performance of cooperative Web proxy caching
Proceedings of the seventeenth ACM symposium on Operating systems principles
Partitioning Customers Into Service Groups
Management Science
On choosing a task assignment policy for a distributed server system
Journal of Parallel and Distributed Computing - Special issue on software support for distributed computing
Trajectory sampling for direct traffic observation
Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication
On maximizing service-level-agreement profits
ACM SIGMETRICS Performance Evaluation Review
Content-Aware Dispatching Algorithms for Cluster-Based Web Servers
Cluster Computing
Measuring the capacity of a streaming media server in a Utility Data Center environment
Proceedings of the tenth ACM international conference on Multimedia
Application specific data replication for edge services
WWW '03 Proceedings of the 12th international conference on World Wide Web
Integrated resource management for cluster-based Internet services
ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
Dynamic cluster reconfiguration for power and performance
Compilers and operating systems for low power
Data Center Fundamentals
Data streaming algorithms for efficient and accurate estimation of flow size distribution
Proceedings of the joint international conference on Measurement and modeling of computer systems
Finding (Recently) Frequent Items in Distributed Data Streams
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Dynamic Provisioning of Multi-tier Internet Applications
ICAC '05 Proceedings of the Second International Conference on Automatic Computing
Fast payload-based flow estimation for traffic monitoring and network security
Proceedings of the 2005 ACM symposium on Architecture for networking and communications systems
Quorum: flexible quality of service for internet services
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
SLA Decomposition: Translating Service Level Objectives to System Level Thresholds
ICAC '07 Proceedings of the Fourth International Conference on Autonomic Computing
Understanding internet video sharing site workload: a view from data center design
Proceedings of the 17th international conference on World Wide Web
Admission control for statistical QoS: theory and practice
IEEE Network: The Magazine of Global Internetworking
SnowFlock: Virtual Machine Cloning as a First-Class Cloud Primitive
ACM Transactions on Computer Systems (TOCS)
Executing Data-Intensive Workloads in a Cloud
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Green data centers: Using hierarchies for scalable energy efficiency in large web clusters
Information Processing Letters
Hi-index | 0.00 |
In data centers hosting scaling Internet applications, operators face the tradeoff dilemma between resource efficiency and Quality of Service (QoS), and the root cause lies in workload dynamics. In this paper, we address the problem with the design of Resilient Workload Manager (ROM). ROM explicitly segregates base workload and trespassing workload, the two naturally different components in application workload, and manages them separately in two resource zones with specialized optimization techniques. As a comprehensive workload management framework, ROM covers workload, data, resource, and Quality of Service of the target applications. It features a fast workload factoring algorithm for distributing incoming application requests, not only on volume but also on content, between the two resource zones; integrated two-dimensional workload shaping, resource planning, and request dispatching schemes for efficient utilization of base workload zone resource; and a simple and high-performance system architecture for dynamic provisioning in trespassing workload zone. Through extensive evaluation, we showed ROM can achieve resource efficiency (e.g., 54.9% server saving) guarantee QoS (based on client-side perceived service quality), reduce data access overhead in the trespassing workload zone during peak load (up to two orders of magnitude), and be adaptive at processing speed (running faster at peak load periods than at regular periods).