A case for caching file objects inside internetworks
SIGCOMM '93 Conference proceedings on Communications architectures, protocols and applications
The case for persistent-connection HTTP
SIGCOMM '95 Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
Web server workload characterization: the search for invariants
Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Eliminating receive livelock in an interrupt-driven kernel
ACM Transactions on Computer Systems (TOCS)
Cluster-based scalable network services
Proceedings of the sixteenth ACM symposium on Operating systems principles
Locality-aware request distribution in cluster-based network servers
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Network dispatcher: a connection router for scalable Internet services
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Soft timers: efficient microsecond software timer support for network processing
Proceedings of the seventeenth ACM symposium on Operating systems principles
Measuring the capacity of a Web server under realistic loads
World Wide Web
SWEB: Towards a Scalable World Wide Web Server on Multicomputers
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
A scalable and highly available web server
COMPCON '96 Proceedings of the 41st IEEE International Computer Conference
The case for geographical push-caching
HOTOS '95 Proceedings of the Fifth Workshop on Hot Topics in Operating Systems (HotOS-V)
Distributed Packet Rewriting and its Application to Scalable Server Architectures
ICNP '98 Proceedings of the Sixth International Conference on Network Protocols
On the performance of TCP splicing for URL-aware redirection
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Efficient support for content-based routing in web server clusters
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Exploring the bounds of web latency reduction from caching and prefetching
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Salamander: a push-based distribution substrate for internet applications
USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
A hierarchical internet object cache
ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
HACC: an architecture for cluster-based web servers
WINSYM'99 Proceedings of the 3rd conference on USENIX Windows NT Symposium - Volume 3
Optimistic deltas for WWW latency reduction
ATEC '97 Proceedings of the annual conference on USENIX Annual Technical Conference
Efficient support for P-HTTP in cluster-based web servers
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Cluster reserves: a mechanism for resource management in cluster-based network servers
Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Critical path analysis of TCP transactions
Proceedings of the conference on Applications, Technologies, Architectures, and Protocols for Computer Communication
A client-aware dispatching algorithm for web clusters providing multiple services
Proceedings of the 10th international conference on World Wide Web
Efficiency vs. portability in cluster-based network servers
PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
Critical path analysis of TCP transactions
IEEE/ACM Transactions on Networking (TON)
The state of the art in locally distributed Web-server systems
ACM Computing Surveys (CSUR)
Content management on server farm with layer-7 routing
Proceedings of the 2002 ACM symposium on Applied computing
Dynamically Selecting Optimal Distribution Strategies for Web Documents
IEEE Transactions on Computers
A load cluster management system using SNMP and web
International Journal of Network Management
Web transaction analysis and optimization (TAO)
WOSP '02 Proceedings of the 3rd international workshop on Software and performance
Cluster Computing
Content-Aware Dispatching Algorithms for Cluster-Based Web Servers
Cluster Computing
Affinity-based management of main memory database clusters
ACM Transactions on Internet Technology (TOIT)
Modeling and performance analysis of QoS-aware load balancing of web-server clusters
Computer Networks: The International Journal of Computer and Telecommunications Networking
Programming the Web with High-Level Programming Languages
ESOP '01 Proceedings of the 10th European Symposium on Programming Languages and Systems
A Fast Connection-Time Redirection Mechanism for Internet Application Scalability
HiPC '02 Proceedings of the 9th International Conference on High Performance Computing
Cluster Load Balancing for Fine-Grain Network Services
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Handling Multiple Bottlenecks in Web Servers Using Adaptive Inbound Controls
PIHSN '02 Proceedings of the 7th IFIP/IEEE International Workshop on Protocols for High Speed Networks
Kernel Mechanisms for Service Differentiation in Overloaded Web Servers
Proceedings of the General Track: 2002 USENIX Annual Technical Conference
Modular TCP Handoff Design in STREAMS-Based TCP/IP Implementation
ICN '01 Proceedings of the First International Conference on Networking-Part 2
KECho - Event Communication for Distributed Kernel Services
ARCS '02 Proceedings of the International Conference on Architecture of Computing Systems: Trends in Network and Pervasive Computing
Content-Adaptive Request Distribution Strategy for Internet Server Cluster
AISA '02 Proceedings of the First International Workshop on Advanced Internet Services and Applications
Evaluating the Impact of Communication Architecture on the Performability of Cluster-Based Services
HPCA '03 Proceedings of the 9th International Symposium on High-Performance Computer Architecture
Integrated resource management for cluster-based Internet services
ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
Critical path analysis of TCP transactions
ACM SIGCOMM Computer Communication Review - Workshop on data communication in Latin America and the Caribbean
Performance Guarantees for Cluster-Based Internet Services
ICDCS '03 Proceedings of the 23rd International Conference on Distributed Computing Systems
The Content Driven Mobile Internet
Wireless Personal Communications: An International Journal
A Tiered System for Serving Differentiated Content
World Wide Web
Synchronization analysis for decentralizing composite Web services
Proceedings of the 2003 ACM symposium on Applied computing
Cyclone: A High-Performance Cluster-Based Web Server with Socket Cloning
Cluster Computing
Content-aware cooperative caching for cluster-based web servers
Journal of Systems and Software
Research on scheduling algorithms in Web cluster servers
Journal of Computer Science and Technology
Adaptive hash routing for a cluster of client-side web proxies
Journal of Parallel and Distributed Computing
Workload-Aware Load Balancing for Clustered Web Servers
IEEE Transactions on Parallel and Distributed Systems
PRESS: A Clustered Server Based on User-Level Communication
IEEE Transactions on Parallel and Distributed Systems
Quantifying the Performability of Cluster-Based Services
IEEE Transactions on Parallel and Distributed Systems
Integrated resource management for cluster-based internet services
OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Energy conservation in heterogeneous server clusters
Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
Weblins: a scalable WWW cluster-based server
Advances in Engineering Software
System support for web hosting services on server clusters
Computers and Electrical Engineering
Architecting Web sites for high performance
Scientific Programming
Making scheduling "cool": temperature-aware workload placement in data centers
ATEC '05 Proceedings of the annual conference on USENIX Annual Technical Conference
Using fault injection and modeling to evaluate the performability of cluster-based services
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Anypoint: extensible transport switching on the edge
USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Heterogeneity-Aware Workload Distribution in Donation-Based Grids
International Journal of High Performance Computing Applications
Implementation and use of the PLT scheme Web server
Higher-Order and Symbolic Computation
Fingerprint verification on medical image reporting system
Computer Methods and Programs in Biomedicine
CEA'07 Proceedings of the 2007 annual Conference on International Conference on Computer Engineering and Applications
A request distribution method for clustered VOD servers considering buffer sharing effects
Journal of Systems Architecture: the EUROMICRO Journal
Coscheduled distributed-Web servers on system area network
Journal of Parallel and Distributed Computing
A content-based load balancing algorithm with admission control for cluster web servers
Future Generation Computer Systems
Cataclysm: Scalable overload policing for internet applications
Journal of Network and Computer Applications
Front end device for content networking
Proceedings of the conference on Design, automation and test in Europe
Journal of Systems and Software
DDoS-shield: DDoS-resilient scheduling to counter application layer attacks
IEEE/ACM Transactions on Networking (TON)
Weblins: A scalable WWW cluster-based server
Advances in Engineering Software
A practical learning-based approach for dynamic storage bandwidth allocation
IWQoS'03 Proceedings of the 11th international conference on Quality of service
A predictive and probabilistic load-balancing algorithm for cluster-based web servers
Applied Soft Computing
Resource provisioning in SLA-based cluster computing
JSSPP'10 Proceedings of the 15th international conference on Job scheduling strategies for parallel processing
An up-to-date survey in web load balancing
World Wide Web
Adaptive request distribution in cluster-based web system
KES'11 Proceedings of the 15th international conference on Knowledge-based and intelligent information and engineering systems - Volume Part I
Scalable distributed scheduling for quality of service
ISPA'06 Proceedings of the 2006 international conference on Frontiers of High Performance Computing and Networking
Design of security session reuse in content-based load distribution server
ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
Analysis of security session reusing in distribution server system
ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
Efficient prioritized service recovery using content-aware routing mechanism in web server cluster
ICN'05 Proceedings of the 4th international conference on Networking - Volume Part II
Adaptive admission control algorithm in a QoS-aware Web system
Information Sciences: an International Journal
Design, implementation, and performance of a load balancer for SIP server clusters
IEEE/ACM Transactions on Networking (TON)
Distributed and Parallel Databases
Hi-index | 0.00 |
We present a scalable architecture for content-aware request distribution in web server clusters. In this architecture, a level-4 switch acts as the point of contact for the server on the Inernet and distributes the incoming requests to a number of back-end nodes. The switch does not perform any contect-based distribution. This function is performed by each of the back-end nodes, which may forward the incoming request to another back-end based on the requested contect. In terms of scalability, this architecture compares favorably to existing approaches where a front-end node performs contect-based distribution. In our architecture, the expensive operations of TCP connection estabilishment and handoff are distributes among the back-ends, rather than being centralized in the front-end node. Only a minimal additional latency penatly is paid for much improved scalability. We have implemented this new architecture, and we demonstrate its superior scalability by comparing it to a system that performs contect-aware distribution in the front-end, both under synthetic and trace-drive workloads.