Locality-aware request distribution in cluster-based network servers
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Network dispatcher: a connection router for scalable Internet services
WWW7 Proceedings of the seventh international conference on World Wide Web 7
On choosing a task assignment policy for a distributed server system
Journal of Parallel and Distributed Computing - Special issue on software support for distributed computing
The state of the art in locally distributed Web-server systems
ACM Computing Surveys (CSUR)
A scalable and highly available system for serving dynamic data at frequently accessed web sites
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
EQUILOAD: a load balancing policy for clustered web servers
Performance Evaluation
High-Performance Web Site Design Techniques
IEEE Internet Computing
Lessons Learned Administering Netscape's Internet Site
IEEE Internet Computing
A scalable and highly available web server
COMPCON '96 Proceedings of the 41st IEEE International Computer Conference
Interposed request routing for scalable network storage
OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
PBS: a unified priority-based scheduler
Proceedings of the 2007 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Scalable content-aware request distribution in cluster-based networks servers
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
Efficient support for P-HTTP in cluster-based web servers
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Session Initiation Protocol (SIP) Server Overload Control: Design and Evaluation
Principles, Systems and Applications of IP Telecommunications. Services and Security for Next Generation Networks
RSVP: a new resource reservation protocol
IEEE Communications Magazine - Part Anniversary
Hi-index | 0.00 |
This paper introduces several novel load-balancing algorithms for distributing Session Initiation Protocol (SIP) requests to a cluster of SIP servers. Our load balancer improves both throughput and response time versus a single node while exposing a single interface to external clients. We present the design, implementation, and evaluation of our system using a cluster of Intel ×86 machines running Linux. We compare our algorithms to several well-known approaches and present scalability results for up to 10 nodes. Our best algorithm, Transaction Least-Work-Left (TLWL), achieves its performance by integrating several features: knowledge of the SIP protocol, dynamic estimates of back-end server load, distinguishing transactions from calls, recognizing variability in call length, and exploiting differences in processing costs for different SIP transactions. By combining these features, our algorithm provides finer-grained load balancing than standard approaches, resulting in throughput improvements of up to 24% and response-time improvements of up to two orders of magnitude. We present a detailed analysis of occupancy to show how our algorithms significantly reduce response time.