Open issues and challenges in providing quality of service guarantees in high-speed networks
ACM SIGCOMM Computer Communication Review
Lazy receiver processing (LRP): a network subsystem architecture for server systems
OSDI '96 Proceedings of the second USENIX symposium on Operating systems design and implementation
Exploiting process lifetime distributions for dynamic load balancing
ACM Transactions on Computer Systems (TOCS)
Cluster-based scalable network services
Proceedings of the sixteenth ACM symposium on Operating systems principles
CPU reservations and time constraints: efficient, predictable scheduling of independent activities
Proceedings of the sixteenth ACM symposium on Operating systems principles
Self-similarity in World Wide Web traffic: evidence and possible causes
IEEE/ACM Transactions on Networking (TON)
Supporting quality of service in HTTP servers
PODC '98 Proceedings of the seventeenth annual ACM symposium on Principles of distributed computing
Locality-aware request distribution in cluster-based network servers
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Resource containers: a new facility for resource management in server systems
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
A feedback-driven proportion allocator for real-rate scheduling
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Proportional differentiated services: delay differentiation and packet scheduling
Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
Proceedings of the seventeenth ACM symposium on Operating systems principles
Cluster reserves: a mechanism for resource management in cluster-based network servers
Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
On maximizing service-level-agreement profits
Proceedings of the 3rd ACM conference on Electronic Commerce
Managing energy and server resources in hosting centers
SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Value-based scheduling in real-time database systems
The VLDB Journal — The International Journal on Very Large Data Bases
Cluster Load Balancing for Fine-Grain Network Services
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Kernel Mechanisms for Service Differentiation in Overloaded Web Servers
Proceedings of the General Track: 2002 USENIX Annual Technical Conference
Harvest, Yield, and Scalable Tolerant Systems
HOTOS '99 Proceedings of the The Seventh Workshop on Hot Topics in Operating Systems
Admission Control for Soft-Transactions in ACCORD
RTAS '97 Proceedings of the 3rd IEEE Real-Time Technology and Applications Symposium (RTAS '97)
Scalable, distributed data structures for internet service construction
OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
Neptune: scalable replication management and programming support for cluster-based network services
USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
Scalable content-aware request distribution in cluster-based networks servers
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
Isolation with flexibility: a resource management framework for central servers
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
The eclipse operating system: providing quality of service via reservation domains
ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference
Eliminating receive livelock in an interrupt-driven kernel
ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
The multispace: an evolutionary platform for infrastructural services
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Web server support for tiered services
IEEE Network: The Magazine of Global Internetworking
Optimizing data aggregation for cluster-based internet services
Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
A method for transparent admission control and request scheduling in e-commerce web sites
Proceedings of the 13th international conference on World Wide Web
Supporting Cluster-Based Network Services on Functionally Symmetric Software Architecture
Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Quantifying the Performability of Cluster-Based Services
IEEE Transactions on Parallel and Distributed Systems
Adaptive middleware for dynamic component-level deployment
ARM '05 Proceedings of the 4th workshop on Reflective and adaptive middleware systems
Controllable fair queuing for meeting performance goals
Performance Evaluation - Performance 2005
QoS for internet services: done right
Proceedings of the 11th workshop on ACM SIGOPS European workshop
Selective early request termination for busy internet services
Proceedings of the 15th international conference on World Wide Web
Self-Sizing of Clustered Databases
WOWMOM '06 Proceedings of the 2006 International Symposium on on World of Wireless, Mobile and Multimedia Networks
Journal of Parallel and Distributed Computing
IQU: practical queue-based user association management for WLANs
Proceedings of the 12th annual international conference on Mobile computing and networking
The leganet system: Freshness-aware transaction routing in a database cluster
Information Systems
Quality-of-service differentiation on the internet: a taxonomy
Journal of Network and Computer Applications - Special issue: Network and information security: A computational intelligence approach
A scalable application placement controller for enterprise data centers
Proceedings of the 16th international conference on World Wide Web
HOTDEP'06 Proceedings of the 2nd conference on Hot Topics in System Dependability - Volume 2
Performance modeling and system management for multi-component online services
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Quorum: flexible quality of service for internet services
NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Model-based resource provisioning in a web service utility
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Xen and co.: communication-aware CPU scheduling for consolidated xen-based hosting platforms
Proceedings of the 3rd international conference on Virtual execution environments
SLA-Driven Clustering of QoS-Aware Application Servers
IEEE Transactions on Software Engineering
Agile dynamic provisioning of multi-tier Internet applications
ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Hardware counter driven on-the-fly request signatures
Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
Replication degree customization for high availability
Proceedings of the 3rd ACM SIGOPS/EuroSys European Conference on Computer Systems 2008
Development platform for parallel image processing
SSIP'06 Proceedings of the 6th WSEAS International Conference on Signal, Speech and Image Processing
Resource allocation optimization for quantitative service differentiation on server clusters
Journal of Parallel and Distributed Computing
Journal of Systems and Software
Non-intrusive performance management for computer services
Proceedings of the ACM/IFIP/USENIX 2006 International Conference on Middleware
An integrated approach with feedback control for robust Web QoS design
Computer Communications
Managing responsiveness of virtual desktops using passive monitoring
IM'09 Proceedings of the 11th IFIP/IEEE international conference on Symposium on Integrated Network Management
Pricing-based strategies for autonomic control of web servers for time-varying request arrivals
Engineering Applications of Artificial Intelligence
Proceedings of the fifteenth edition of ASPLOS on Architectural support for programming languages and operating systems
Self-optimization of clustered message-oriented middleware
OTM'07 Proceedings of the 2007 OTM Confederated international conference on On the move to meaningful internet systems: CoopIS, DOA, ODBASE, GADA, and IS - Volume Part I
Resource allocation algorithms for virtualized service hosting platforms
Journal of Parallel and Distributed Computing
JustRunIt: experiment-based management of virtualized data centers
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
AWAIT: Efficient overload management for busy multi-tier web services under bursty workloads
ICWE'10 Proceedings of the 10th international conference on Web engineering
Designing next generation data-centers with advanced communication protocols and systems services
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
HotDep'06 Proceedings of the Second conference on Hot topics in system dependability
Utility driven elastic services
Proceedings of the 11th IFIP WG 6.1 international conference on Distributed applications and interoperable systems
Non-intrusive performance management for computer services
Middleware'06 Proceedings of the 7th ACM/IFIP/USENIX international conference on Middleware
Implementation and evaluation of a middleware for self-organizing decentralized web services
SelfMan'06 Proceedings of the Second IEEE international conference on Self-Managed Networks, Systems, and Services
User-Perceived web qos measurement and evaluation system
APWeb'06 Proceedings of the 8th Asia-Pacific Web conference on Frontiers of WWW Research and Development
Developing an optimized application hosting framework in Clouds
Journal of Computer and System Sciences
Hi-index | 0.01 |
Client request rates for Internet services tend to be bursty and thus it is important to maintain efficient resource utilization under a wide range of load conditions. Network service clients typically seek services interactively and maintaining reasonable response time is often imperative for such services. In addition, providing differentiated service qualities and resource allocation to multiple service classes can also be desirable at times. This paper presents an integrated resource management framework (part of Neptune system) that provides flexible service quality specification, efficient resource utilization, and service differentiation for cluster-based services. This framework introduces the metric of quality-aware service yield to combine the overall system efficiency and individual service response time in one flexible model. Resources are managed through a two-level request distribution and scheduling scheme. At the cluster level, a fully decentralized request distribution architecture is employed to achieve high scalability and availability. Inside each service node, an adaptive scheduling policy maintains efficient resource utilization under a wide range of load conditions. Our trace-driven evaluations demonstrate the performance, scalability, and service differentiation achieved by the proposed techniques.