The case for persistent-connection HTTP
SIGCOMM '95 Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
Web server workload characterization: the search for invariants
Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
A hierarchial CPU scheduler for multimedia operating systems
OSDI '96 Proceedings of the second USENIX symposium on Operating systems design and implementation
Lazy receiver processing (LRP): a network subsystem architecture for server systems
OSDI '96 Proceedings of the second USENIX symposium on Operating systems design and implementation
Cluster-based scalable network services
Proceedings of the sixteenth ACM symposium on Operating systems principles
The design, implementation and evaluation of SMART: a scheduler for multimedia applications
Proceedings of the sixteenth ACM symposium on Operating systems principles
ONE-IP: techniques for hosting a service on a cluster of machines
Selected papers from the sixth international conference on World Wide Web
Performance isolation: sharing and isolation in shared-memory multiprocessors
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Locality-aware request distribution in cluster-based network servers
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Resource containers: a new facility for resource management in server systems
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Defending against denial of service attacks in Scout
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Optimal stochastic scheduling in multiclass parallel queues
SIGMETRICS '99 Proceedings of the 1999 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Measuring the capacity of a Web server under realistic loads
World Wide Web
SWEB: Towards a Scalable World Wide Web Server on Multicomputers
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Modular real-time resource management in the Rialto operating system
HOTOS '95 Proceedings of the Fifth Workshop on Hot Topics in Operating Systems (HotOS-V)
Scalable content-aware request distribution in cluster-based networks servers
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
The eclipse operating system: providing quality of service via reservation domains
ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference
Using smart clients to build scalable services
ATEC '97 Proceedings of the annual conference on USENIX Annual Technical Conference
Retrofitting quality of service into a time-sharing operating system
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Efficient support for P-HTTP in cluster-based web servers
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Scheduling best-effort and real-time pipelined applications on time-shared clusters
Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures
Managing energy and server resources in hosting centers
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Real-Time Processing in Client-Server Databases
IEEE Transactions on Computers
The state of the art in locally distributed Web-server systems
ACM Computing Surveys (CSUR)
Web switch support for differentiated services
ACM SIGMETRICS Performance Evaluation Review
Ensuring Latency Targets in Multiclass Web Servers
IEEE Transactions on Parallel and Distributed Systems
Stream-Packing: Resource Allocation in Web Server Farms with a QoS Guarantee
HiPC '01 Proceedings of the 8th International Conference on High Performance Computing
New Resource Control Issues in Shared Clusters
IDMS '01 Proceedings of the 8th International Workshop on Interactive Distributed Multimedia Systems
Enforcing Resource Sharing Agreements among Distributed Server Clusters
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Handling Multiple Bottlenecks in Web Servers Using Adaptive Inbound Controls
PIHSN '02 Proceedings of the 7th IFIP/IEEE International Workshop on Protocols for High Speed Networks
Integrated resource management for cluster-based Internet services
ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
Resource overbooking and application profiling in shared hosting platforms
ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
Multicast Injection for Application Network Deployment
LCN '01 Proceedings of the 26th Annual IEEE Conference on Local Computer Networks
Offering different services by server clusters
ACM SIGOPS Operating Systems Review
Measurement-Based Characterization and Classification of QoS-Enhanced Systems
IEEE Transactions on Parallel and Distributed Systems
Sharc: Managing CPU and Network Bandwidth in Shared Clusters
IEEE Transactions on Parallel and Distributed Systems
Stateful distributed interposition
ACM Transactions on Computer Systems (TOCS)
A method for transparent admission control and request scheduling in e-commerce web sites
Proceedings of the 13th international conference on World Wide Web
Supporting Cluster-Based Network Services on Functionally Symmetric Software Architecture
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Using service brokers for accessing backend servers for web applications
Journal of Network and Computer Applications
QoS Aware Job Scheduling in a Cluster-Based Web Server for Multimedia Applications
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
Integrated resource management for cluster-based internet services
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Resource overbooking and application profiling in shared hosting platforms
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Efficient processing of client transactions in real-time
Distributed and Parallel Databases
Energy conservation in heterogeneous server clusters
Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
Overload management as a fundamental service design primitive
EW 10 Proceedings of the 10th workshop on ACM SIGOPS European workshop
Proceedings of the 11th workshop on ACM SIGOPS European workshop
QoS for internet services: done right
Proceedings of the 11th workshop on ACM SIGOPS European workshop
Dynamic placement for clustered web applications
Proceedings of the 15th international conference on World Wide Web
Self-Sizing of Clustered Databases
WOWMOM '06 Proceedings of the 2006 International Symposium on on World of Wireless, Mobile and Multimedia Networks
Quality-of-service differentiation on the internet: a taxonomy
Journal of Network and Computer Applications - Special issue: Network and information security: A computational intelligence approach
Architecting Web sites for high performance
Scientific Programming
Performance modeling and system management for multi-component online services
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Quorum: flexible quality of service for internet services
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Model-based resource provisioning in a web service utility
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
SLA-Driven Clustering of QoS-Aware Application Servers
IEEE Transactions on Software Engineering
Online Web Cluster Capacity Estimation and Its Application to Energy Conservation
IEEE Transactions on Parallel and Distributed Systems
Adaptive control of virtualized resources in utility computing environments
Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Heterogeneity-Aware Workload Distribution in Donation-Based Grids
International Journal of High Performance Computing Applications
Journal of Parallel and Distributed Computing
Agile dynamic provisioning of multi-tier Internet applications
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Multiple-resource request scheduling for differentiated QoS at website gateway
Computer Communications
Cataclysm: Scalable overload policing for internet applications
Journal of Network and Computer Applications
Resource allocation optimization for quantitative service differentiation on server clusters
Journal of Parallel and Distributed Computing
Journal of Systems and Software
Efficient management of data center resources for massively multiplayer online games
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Resource overbooking and application profiling in a shared Internet hosting platform
ACM Transactions on Internet Technology (TOIT)
A Fine-Grained Model for Adaptive On-Demand Provisioning of CPU Shares in Data Centers
IWSOS '08 Proceedings of the 3rd International Workshop on Self-Organizing Systems
Automated control of multiple virtualized resources
Proceedings of the 4th ACM European conference on Computer systems
A multi-agent learning approach to online distributed resource allocation
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Sandpiper: Black-box and gray-box resource management for virtual machines
Computer Networks: The International Journal of Computer and Telecommunications Networking
Energy-Efficient Process Allocation Algorithms in Peer-to-Peer Systems
SEUS '09 Proceedings of the 7th IFIP WG 10.2 International Workshop on Software Technologies for Embedded and Ubiquitous Systems
Architecture-based autonomic deployment of J2EE systems in grids
GPC'07 Proceedings of the 2nd international conference on Advances in grid and pervasive computing
Self-optimization of clustered message-oriented middleware
OTM'07 Proceedings of the 2007 OTM Confederated international conference on On the move to meaningful internet systems: CoopIS, DOA, ODBASE, GADA, and IS - Volume Part I
Proceedings of the 7th International Conference on Advances in Mobile Computing and Multimedia
The impact of virtualization on the performance of Massively Multiplayer Online Games
Proceedings of the 8th Annual Workshop on Network and Systems Support for Games
Resource allocation algorithms for virtualized service hosting platforms
Journal of Parallel and Distributed Computing
JustRunIt: experiment-based management of virtualized data centers
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
International Journal of Advanced Media and Communication
Decentralized workload management for assurance according to heterogeneous service levels
HASE'04 Proceedings of the Eighth IEEE international conference on High assurance systems engineering
An up-to-date survey in web load balancing
World Wide Web
Black-box and gray-box strategies for virtual machine migration
NSDI'07 Proceedings of the 4th USENIX conference on Networked systems design & implementation
A proactive distributed qos control framework for cluster web site
HPCC'06 Proceedings of the Second international conference on High Performance Computing and Communications
Leveraging public resource pools to improve the service compliances of computing utilities
HiPC'04 Proceedings of the 11th international conference on High Performance Computing
DejaVu: accelerating resource allocation in virtualized environments
ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
Decentralized and optimal control of shared resource pools
ACM Transactions on Autonomous and Adaptive Systems (TAAS) - Special section on formal methods in pervasive computing, pervasive adaptation, and self-adaptive systems: Models and algorithms
Online optimization for scheduling preemptable tasks on IaaS cloud systems
Journal of Parallel and Distributed Computing
Web hosting with statistical capacity guarantee
Information Sciences: an International Journal
Hi-index | 0.01 |
In network (e.g., Web) servers, it is often desirable to isolate the performance of different classes of requests from each other. That is, one seeks to achieve that a certain minimal proportion of server resources are available for a class of requests, independent of the load imposed by other requests. Recent work demonstrates how to achieve this performance isolation in servers consisting of a single, centralized node; however, achieving performance isolation in a distributed, cluster based server remains a problem.This paper introduces a new abstraction, the cluster reserve, which represents a resource principal in a cluster based network server. We present a design and evaluate a prototype implementation that extends existing techniques for performance isolation on a single node server to cluster based servers.In our design, the dynamic cluster-wide resource management problem is formulated as a constrained optimization problem, with the resource allocations on individual machines as independent variables, and the desired cluster-wide resource allocations as constraints. Periodically collected resource usages serve as further inputs to the problem.Experimental results show that cluster reserves are effective in providing performance isolation in cluster based servers. We demonstrate that, in a number of different scenarios, cluster reserves are effective in ensuring performance isolation while enabling high utilization of the server resources.