Communications of the ACM
Resource containers: a new facility for resource management in server systems
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
A feedback-driven proportion allocator for real-rate scheduling
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Flow and stretch metrics for scheduling continuous job streams
Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
A methodology for workload characterization of E-commerce sites
Proceedings of the 1st ACM conference on Electronic commerce
Resource management policies for e-commerce servers
ACM SIGMETRICS Performance Evaluation Review
Characterizing the scalability of a large web-based shopping system
ACM Transactions on Internet Technology (TOIT)
Adaptive proportional delay differentiated services: characterization and performance evaluation
IEEE/ACM Transactions on Networking (TON)
Task assignment with unknown duration
Journal of the ACM (JACM)
Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach
IEEE Transactions on Parallel and Distributed Systems
Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites
IEEE Transactions on Computers
Application-level differentiated services for Web servers
World Wide Web
Differentiated Real-Time Data Services for E-Commerce Applications
Electronic Commerce Research
Performance Evaluation of Service Differentiating Internet Servers
IEEE Transactions on Computers
Enforcing Resource Sharing Agreements among Distributed Server Clusters
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Size-based scheduling to improve web performance
ACM Transactions on Computer Systems (TOCS)
ADAPTLOAD: Effective Balancing in Custered Web Servers Under Transient Load Conditions
ICDCS '02 Proceedings of the 22 nd International Conference on Distributed Computing Systems (ICDCS'02)
A Proportional-Delay DiffServ-Enabled Web Server: Admission Control and Dynamic Adaptation
IEEE Transactions on Parallel and Distributed Systems
Modeling and Analysis of 2D Service Differentiation on e-Commerce Servers
ICDCS '04 Proceedings of the 24th International Conference on Distributed Computing Systems (ICDCS'04)
A method for transparent admission control and request scheduling in e-commerce web sites
Proceedings of the 13th international conference on World Wide Web
IEEE Transactions on Parallel and Distributed Systems
Feedback Utilization Control in Distributed Real-Time Systems with End-to-End Tasks
IEEE Transactions on Parallel and Distributed Systems
A self-tuning fuzzy control approach for end-to-end QoS guarantees in web servers
IWQoS'05 Proceedings of the 13th international conference on Quality of Service
Fair bandwidth sharing and delay differentiation: Joint packet scheduling with buffer management
Computer Communications
RDRP: Reward-Driven Request Prioritization for e-Commerce web sites
Electronic Commerce Research and Applications
Business-driven QoS management of B2C web servers
ETM'10 Proceedings of the Third international conference on Incentives, overlays, and economic traffic control
Discovering and usage of customer knowledge in QoS mechanism for B2C web server systems
KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part II
Journal of Network and Computer Applications
URL: A unified reinforcement learning approach for autonomic cloud management
Journal of Parallel and Distributed Computing
Regression-based resource provisioning for session slowdown guarantee in multi-tier Internet servers
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
A scalable e-commerce server should be able to provide different levels of quality of service (QoS) to different types of requests based on clients' navigation patterns and the server capacity. E-Commerce workloads are composed of sessions. In this paper, we propose a session-based two-dimensional (2D) service differentiation model for online transactions: intersession and intrasession. The intersession model aims to provide different levels of QoS to sessions from different customer classes, and the intrasession model aims to provide different levels of QoS to requests in different states of a session. A primary performance metric of online transactions is slowdown. It measures the waiting time of a request relative to its service time. We present a processing rate allocation scheme for 2D proportional slowdown differentiation. We then introduce service slowdown as a systemwide QoS metric of an e-commerce server. It is defined as the weighted sum of request slowdown in different sessions and in different session states. We formulate the problem of 2D service differentiation as an optimization of processing rate allocation with the objective of minimizing the service slowdown of the server. We prove that the derived rate allocation scheme based on the optimization guarantees client requests' slowdown to be square-root proportional to their prespecified differentiation weights in both intersession and intrasession dimensions. We evaluate this square-root proportional rate allocation scheme and a proportional rate allocation scheme via extensive simulations. Results validate that both schemes can achieve predictable, controllable, and fair 2D service differentiation on e-commerce servers. The square-root proportional rate allocation scheme provides 2D service differentiation at a minimum cost of service slowdown.