Dynamic estimation of CPU demand of web traffic

Authors:
Giovanni Pacifici;Wolfgang Segmuller;Mike Spreitzer;Asser Tantawi
Affiliations:
IBM T.J. Watson Research Center, Yorktown Heights, NY;IBM T.J. Watson Research Center, Yorktown Heights, NY;IBM T.J. Watson Research Center, Yorktown Heights, NY;IBM T.J. Watson Research Center, Yorktown Heights, NY
Venue:
valuetools '06 Proceedings of the 1st international conference on Performance evaluation methodolgies and tools
Year:
2006

Citing 3
Cited 19

Sharc: Managing CPU and Network Bandwidth in Shared Clusters

IEEE Transactions on Parallel and Distributed Systems
Resource overbooking and application profiling in shared hosting platforms

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Performance management for cluster-based web services

IEEE Journal on Selected Areas in Communications

A scalable application placement controller for enterprise data centers

Proceedings of the 16th international conference on World Wide Web
Server virtualization in autonomic management of heterogeneous workloads

ACM SIGOPS Operating Systems Review
Enabling resource sharing between transactional and batch workloads using dynamic application placement

Proceedings of the 9th ACM/IFIP/USENIX International Conference on Middleware
Run-time resource management in SOA virtualized environments

Proceedings of the 1st international workshop on Quality of service-oriented software systems
Rate-based SIP flow management for SLA satisfaction

IM'09 Proceedings of the 11th IFIP/IEEE international conference on Symposium on Integrated Network Management
Joint admission control and resource allocation in virtualized servers

Journal of Parallel and Distributed Computing
Self-adaptive resource management for large-scale shared clusters

Journal of Computer Science and Technology
Untangling mixed information to calibrate resource utilization in virtual machines

Proceedings of the 8th ACM international conference on Autonomic computing
Gossip-based resource allocation for green computing in large clouds

Proceedings of the 7th International Conference on Network and Services Management
Automated simulation-based capacity planning for enterprise data fabrics

Proceedings of the 4th International ICST Conference on Simulation Tools and Techniques
Statistical inference of software performance models for parametric performance completions

QoSA'10 Proceedings of the 6th international conference on Quality of Software Architectures: research into Practice - Reality and Gaps
Efficient experiment selection in automated software performance evaluations

EPEW'11 Proceedings of the 8th European conference on Computer Performance Engineering
Resource-aware adaptive scheduling for mapreduce clusters

Middleware'11 Proceedings of the 12th ACM/IFIP/USENIX international conference on Middleware
Dual time-scale distributed capacity allocation and load redirect algorithms for cloud systems

Journal of Parallel and Distributed Computing
Resource-aware adaptive scheduling for MapReduce clusters

Proceedings of the 12th International Middleware Conference
Algorithms for Web service selection with static and dynamic requirements

Service Oriented Computing and Applications
On-the-fly capacity planning

Proceedings of the 2013 ACM SIGPLAN international conference on Object oriented programming systems languages & applications
On the effects of omitting information exchange between autonomous resource management agents

AIMS'13 Proceedings of the 7th IFIP WG 6.6 international conference on Autonomous Infrastructure, Management, and Security: emerging management mechanisms for the future internet - Volume 7943
Constructing performance model of JMS middleware platform

Proceedings of the 5th ACM/SPEC international conference on Performance engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Managing the resources in a large Web serving system requires knowledge of the resource needs for service request-s of various kinds, and these needs may change over time. Assessing resource needs is commonly performed using techniques such as offline profiling, application instrumentation, and kernel-based instrumentation. Little attention has been given to the dynamic estimation of dynamic resource needs, relying only on external and high-level measurements such as overall resource utilization and request rates. We consider the problem of dynamically estimating dynamic CPU demands of multiple kinds of requests using CPU utilization and throughput measurements. We formulate the problem as a linear regression problem and obtain its basic solution. However, in practice one is faced with issues such as insignificant flows, collinear flows, space and temporal variations, and background noise. In order to deal with such issues, we present several mechanisms such as data aging, flow rejection, flow combining, noise reduction, and smoothing. We implemented these techniques in a Work Profiler component that we delivered as part of a broader system management product. We present experimental results from using this component in scenarios inspired by real-world usage of that product; our technique produces estimates that are roughly within a factor of 2 of the right answer, for the request flows that draw significant CPU power.