Scalable content-aware request distribution in cluster-based networks servers

Authors:
Mohit Aron;Darren Sanders;Peter Druschel;Willy Zwaenepoel
Affiliations:
Department of Computer Science, Rice University, Houston, TX;Department of Computer Science, Rice University, Houston, TX;Department of Computer Science, Rice University, Houston, TX;Department of Computer Science, Rice University, Houston, TX
Venue:
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
Year:
2000

Citing 21
Cited 69

A case for caching file objects inside internetworks

SIGCOMM '93 Conference proceedings on Communications architectures, protocols and applications
The case for persistent-connection HTTP

SIGCOMM '95 Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
Web server workload characterization: the search for invariants

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Eliminating receive livelock in an interrupt-driven kernel

ACM Transactions on Computer Systems (TOCS)
Cluster-based scalable network services

Proceedings of the sixteenth ACM symposium on Operating systems principles
Locality-aware request distribution in cluster-based network servers

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Network dispatcher: a connection router for scalable Internet services

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Soft timers: efficient microsecond software timer support for network processing

Proceedings of the seventeenth ACM symposium on Operating systems principles
Measuring the capacity of a Web server under realistic loads

World Wide Web
SWEB: Towards a Scalable World Wide Web Server on Multicomputers

IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
A scalable and highly available web server

COMPCON '96 Proceedings of the 41st IEEE International Computer Conference
The case for geographical push-caching

HOTOS '95 Proceedings of the Fifth Workshop on Hot Topics in Operating Systems (HotOS-V)
Distributed Packet Rewriting and its Application to Scalable Server Architectures

ICNP '98 Proceedings of the Sixth International Conference on Network Protocols
On the performance of TCP splicing for URL-aware redirection

USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Efficient support for content-based routing in web server clusters

USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Exploring the bounds of web latency reduction from caching and prefetching

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Salamander: a push-based distribution substrate for internet applications

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
A hierarchical internet object cache

ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
HACC: an architecture for cluster-based web servers

WINSYM'99 Proceedings of the 3rd conference on USENIX Windows NT Symposium - Volume 3
Optimistic deltas for WWW latency reduction

ATEC '97 Proceedings of the annual conference on USENIX Annual Technical Conference
Efficient support for P-HTTP in cluster-based web servers

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference

Cluster reserves: a mechanism for resource management in cluster-based network servers

Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Critical path analysis of TCP transactions

Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication
A client-aware dispatching algorithm for web clusters providing multiple services

Proceedings of the 10th international conference on World Wide Web
Efficiency vs. portability in cluster-based network servers

PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
Critical path analysis of TCP transactions

IEEE/ACM Transactions on Networking (TON)
The state of the art in locally distributed Web-server systems

ACM Computing Surveys (CSUR)
Content management on server farm with layer-7 routing

Proceedings of the 2002 ACM symposium on Applied computing
Dynamically Selecting Optimal Distribution Strategies for Web Documents

IEEE Transactions on Computers
A load cluster management system using SNMP and web

International Journal of Network Management
Web transaction analysis and optimization (TAO)

WOSP '02 Proceedings of the 3rd international workshop on Software and performance
Web Proxy Acceleration

Cluster Computing
Content-Aware Dispatching Algorithms for Cluster-Based Web Servers

Cluster Computing
Analytical and experimental evaluation of cluster-based network servers

World Wide Web
Affinity-based management of main memory database clusters

ACM Transactions on Internet Technology (TOIT)
Modeling and performance analysis of QoS-aware load balancing of web-server clusters

Computer Networks: The International Journal of Computer and Telecommunications Networking
Programming the Web with High-Level Programming Languages

ESOP '01 Proceedings of the 10th European Symposium on Programming Languages and Systems
A Fast Connection-Time Redirection Mechanism for Internet Application Scalability

HiPC '02 Proceedings of the 9th International Conference on High Performance Computing
Cluster Load Balancing for Fine-Grain Network Services

IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Handling Multiple Bottlenecks in Web Servers Using Adaptive Inbound Controls

PIHSN '02 Proceedings of the 7th IFIP/IEEE International Workshop on Protocols for High Speed Networks
Kernel Mechanisms for Service Differentiation in Overloaded Web Servers

Proceedings of the General Track: 2002 USENIX Annual Technical Conference
Modular TCP Handoff Design in STREAMS-Based TCP/IP Implementation

ICN '01 Proceedings of the First International Conference on Networking-Part 2
KECho - Event Communication for Distributed Kernel Services

ARCS '02 Proceedings of the International Conference on Architecture of Computing Systems: Trends in Network and Pervasive Computing
Content-Adaptive Request Distribution Strategy for Internet Server Cluster

AISA '02 Proceedings of the First International Workshop on Advanced Internet Services and Applications
Evaluating the Impact of Communication Architecture on the Performability of Cluster-Based Services

HPCA '03 Proceedings of the 9th International Symposium on High-Performance Computer Architecture
Integrated resource management for cluster-based Internet services

ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
Critical path analysis of TCP transactions

ACM SIGCOMM Computer Communication Review - Workshop on data communication in Latin America and the Caribbean
Performance Guarantees for Cluster-Based Internet Services

ICDCS '03 Proceedings of the 23rd International Conference on Distributed Computing Systems
The Content Driven Mobile Internet

Wireless Personal Communications: An International Journal
A Tiered System for Serving Differentiated Content

World Wide Web
Synchronization analysis for decentralizing composite Web services

Proceedings of the 2003 ACM symposium on Applied computing
Cyclone: A High-Performance Cluster-Based Web Server with Socket Cloning

Cluster Computing
Content-aware cooperative caching for cluster-based web servers

Journal of Systems and Software
Research on scheduling algorithms in Web cluster servers

Journal of Computer Science and Technology
Adaptive hash routing for a cluster of client-side web proxies

Journal of Parallel and Distributed Computing
Workload-Aware Load Balancing for Clustered Web Servers

IEEE Transactions on Parallel and Distributed Systems
PRESS: A Clustered Server Based on User-Level Communication

IEEE Transactions on Parallel and Distributed Systems
Quantifying the Performability of Cluster-Based Services

IEEE Transactions on Parallel and Distributed Systems
Integrated resource management for cluster-based internet services

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Energy conservation in heterogeneous server clusters

Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
Weblins: a scalable WWW cluster-based server

Advances in Engineering Software
System support for web hosting services on server clusters

Computers and Electrical Engineering
Architecting Web sites for high performance

Scientific Programming
Making scheduling "cool": temperature-aware workload placement in data centers

ATEC '05 Proceedings of the annual conference on USENIX Annual Technical Conference
Using fault injection and modeling to evaluate the performability of cluster-based services

USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Anypoint: extensible transport switching on the edge

USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Heterogeneity-Aware Workload Distribution in Donation-Based Grids

International Journal of High Performance Computing Applications
Implementation and use of the PLT scheme Web server

Higher-Order and Symbolic Computation
Fingerprint verification on medical image reporting system

Computer Methods and Programs in Biomedicine
Design and implementation of a scalable web cluster system using a light-weight cluster control protocol

CEA'07 Proceedings of the 2007 annual Conference on International Conference on Computer Engineering and Applications
A request distribution method for clustered VOD servers considering buffer sharing effects

Journal of Systems Architecture: the EUROMICRO Journal
Coscheduled distributed-Web servers on system area network

Journal of Parallel and Distributed Computing
A content-based load balancing algorithm with admission control for cluster web servers

Future Generation Computer Systems
Cataclysm: Scalable overload policing for internet applications

Journal of Network and Computer Applications
Front end device for content networking

Proceedings of the conference on Design, automation and test in Europe
Design and implementation of an efficient web cluster with content-based request distribution and file caching

Journal of Systems and Software
DDoS-shield: DDoS-resilient scheduling to counter application layer attacks

IEEE/ACM Transactions on Networking (TON)
Weblins: A scalable WWW cluster-based server

Advances in Engineering Software
A practical learning-based approach for dynamic storage bandwidth allocation

IWQoS'03 Proceedings of the 11th international conference on Quality of service
A predictive and probabilistic load-balancing algorithm for cluster-based web servers

Applied Soft Computing
Resource provisioning in SLA-based cluster computing

JSSPP'10 Proceedings of the 15th international conference on Job scheduling strategies for parallel processing
An up-to-date survey in web load balancing

World Wide Web
Adaptive request distribution in cluster-based web system

KES'11 Proceedings of the 15th international conference on Knowledge-based and intelligent information and engineering systems - Volume Part I
Scalable distributed scheduling for quality of service

ISPA'06 Proceedings of the 2006 international conference on Frontiers of High Performance Computing and Networking
Design of security session reuse in content-based load distribution server

ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
Analysis of security session reusing in distribution server system

ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
Efficient prioritized service recovery using content-aware routing mechanism in web server cluster

ICN'05 Proceedings of the 4th international conference on Networking - Volume Part II
Adaptive admission control algorithm in a QoS-aware Web system

Information Sciences: an International Journal
Design, implementation, and performance of a load balancer for SIP server clusters

IEEE/ACM Transactions on Networking (TON)
High-throughput query scheduling with spatial clustering based on distributed exponential moving average

Distributed and Parallel Databases

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a scalable architecture for content-aware request distribution in web server clusters. In this architecture, a level-4 switch acts as the point of contact for the server on the Inernet and distributes the incoming requests to a number of back-end nodes. The switch does not perform any contect-based distribution. This function is performed by each of the back-end nodes, which may forward the incoming request to another back-end based on the requested contect. In terms of scalability, this architecture compares favorably to existing approaches where a front-end node performs contect-based distribution. In our architecture, the expensive operations of TCP connection estabilishment and handoff are distributes among the back-ends, rather than being centralized in the front-end node. Only a minimal additional latency penatly is paid for much improved scalability. We have implemented this new architecture, and we demonstrate its superior scalability by comparing it to a system that performs contect-aware distribution in the front-end, both under synthetic and trace-drive workloads.