Locality-aware request distribution in cluster-based network servers

Authors:
Vivek S. Pai;Mohit Aron;Gaurov Banga;Michael Svendsen;Peter Druschel;Willy Zwaenepoel;Erich Nahum
Affiliations:
Department of Electrical and Computer Engineering, Rice University;Department of Computer Science, Rice University;Department of Computer Science, Rice University;Department of Computer Science, Rice University;Department of Computer Science, Rice University;Department of Computer Science, Rice University;IBM T.J. Watson Research Center
Venue:
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Year:
1998

Citing 18
Cited 175

A case for caching file objects inside internetworks

SIGCOMM '93 Conference proceedings on Communications architectures, protocols and applications
TCP/IP illustrated (vol. 2): the implementation

TCP/IP illustrated (vol. 2): the implementation
Implementing global memory management in a workstation cluster

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
The design and implementation of the 4.4BSD operating system

The design and implementation of the 4.4BSD operating system
Networking support for large scale multiprocessor servers

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Cluster-based scalable network services

Proceedings of the sixteenth ACM symposium on Operating systems principles
Implementing cooperative prefetching and caching in a globally-managed memory system

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
ONE-IP: techniques for hosting a service on a cluster of machines

Selected papers from the sixth international conference on World Wide Web
SWEB: Towards a Scalable World Wide Web Server on Multicomputers

IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
The case for geographical push-caching

HOTOS '95 Proceedings of the Fifth Workshop on Hot Topics in Operating Systems (HotOS-V)
Exploring the bounds of web latency reduction from caching and prefetching

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Improving web server performance by caching dynamic data

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Salamander: a push-based distribution substrate for internet applications

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Cost-aware WWW proxy caching algorithms

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
High-performance caching with the Lava hit-server

ATEC '98 Proceedings of the annual conference on USENIX Annual Technical Conference
A hierarchical internet object cache

ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
Using smart clients to build scalable services

ATEC '97 Proceedings of the annual conference on USENIX Annual Technical Conference
Optimistic deltas for WWW latency reduction

ATEC '97 Proceedings of the annual conference on USENIX Annual Technical Conference

Scheduling optimization for resource-intensive Web requests on server clusters

Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures
Manageability, availability and performance in Porcupine: a highly scalable, cluster-based mail service

Proceedings of the seventeenth ACM symposium on Operating systems principles
Application-layer broker for scalable Internet services with resource reservation

MULTIMEDIA '99 Proceedings of the seventh ACM international conference on Multimedia (Part 2)
Cluster reserves: a mechanism for resource management in cluster-based network servers

Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Analytic modeling of load balancing policies for tasks with heavy-tailed distributions

Proceedings of the 2nd international workshop on Software and performance
Performance Evaluation of Storage Systems Based on Network-Attached Disks

IEEE Transactions on Parallel and Distributed Systems
Manageability, availability, and performance in porcupine: a highly scalable, cluster-based mail service

ACM Transactions on Computer Systems (TOCS)
A client-aware dispatching algorithm for web clusters providing multiple services

Proceedings of the 10th international conference on World Wide Web
Scheduling best-effort and real-time pipelined applications on time-shared clusters

Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures
Efficiency vs. portability in cluster-based network servers

PPoPP '01 Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming
Managing energy and server resources in hosting centers

SOSP '01 Proceedings of the eighteenth ACM symposium on Operating systems principles
Interposed request routing for scalable network storage

ACM Transactions on Computer Systems (TOCS)
A Demand Adaptive and Locality Aware (DALA) streaming media server cluster architecture

NOSSDAV '02 Proceedings of the 12th international workshop on Network and operating systems support for digital audio and video
The state of the art in locally distributed Web-server systems

ACM Computing Surveys (CSUR)
Content management on server farm with layer-7 routing

Proceedings of the 2002 ACM symposium on Applied computing
Design and evaluation of a conit-based continuous consistency model for replicated services

ACM Transactions on Computer Systems (TOCS)
Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites

IEEE Transactions on Computers
Pipeline: a new architecture of high performance servers

ACM SIGOPS Operating Systems Review
Performance study of dispatching algorithms in multi-tier web architectures

ACM SIGMETRICS Performance Evaluation Review
LSMAC vs. LSNAT: Scalable cluster-based Web servers

Cluster Computing
QoS-based Architectures for Geographically Replicated Web Servers

Cluster Computing
Web Proxy Acceleration

Cluster Computing
Content-Aware Dispatching Algorithms for Cluster-Based Web Servers

Cluster Computing
Analytical and experimental evaluation of cluster-based network servers

World Wide Web
Affinity-based management of main memory database clusters

ACM Transactions on Internet Technology (TOIT)
Ensuring Latency Targets in Multiclass Web Servers

IEEE Transactions on Parallel and Distributed Systems
A Fast Connection-Time Redirection Mechanism for Internet Application Scalability

HiPC '02 Proceedings of the 9th International Conference on High Performance Computing
Enforcing Resource Sharing Agreements among Distributed Server Clusters

IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Cluster Load Balancing for Fine-Grain Network Services

IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
Handling Multiple Bottlenecks in Web Servers Using Adaptive Inbound Controls

PIHSN '02 Proceedings of the 7th IFIP/IEEE International Workshop on Protocols for High Speed Networks
Exploiting Gray-Box Knowledge of Buffer-Cache Management

ATEC '02 Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference
Ninja: A Framework for Network Services

ATEC '02 Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference
EtE: Passive End-to-End Internet Service Performance Monitoring

ATEC '02 Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference
Modular TCP Handoff Design in STREAMS-Based TCP/IP Implementation

ICN '01 Proceedings of the First International Conference on Networking-Part 2
Tuning of QoS Aware Load Balancing Algorithm (QoS-LB) for Highly Loaded Server Clusters

ICN '01 Proceedings of the First International Conference on Networking-Part 2
Efficient Numerical Error Bounding for Replicated Network Services

VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Content-Adaptive Request Distribution Strategy for Internet Server Cluster

AISA '02 Proceedings of the First International Workshop on Advanced Internet Services and Applications
Optimizing data aggregation for cluster-based internet services

Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
Evaluating the Impact of Communication Architecture on the Performability of Cluster-Based Services

HPCA '03 Proceedings of the 9th International Symposium on High-Performance Computer Architecture
SPIRAL: A Client-Transparent Third-Party Transfer Scheme for Network Attached Disks

MSS '03 Proceedings of the 20 th IEEE/11 th NASA Goddard Conference on Mass Storage Systems and Technologies (MSS'03)
Integrated resource management for cluster-based Internet services

ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
The effectiveness of request redirection on CDN robustness

ACM SIGOPS Operating Systems Review - OSDI '02: Proceedings of the 5th symposium on Operating systems design and implementation
FLEX: Load Balancing and Management Strategy for Scalable Web Hosting Service

ISCC '00 Proceedings of the Fifth IEEE Symposium on Computers and Communications (ISCC 2000)
Performance Guarantees for Cluster-Based Internet Services

ICDCS '03 Proceedings of the 23rd International Conference on Distributed Computing Systems
A Tiered System for Serving Differentiated Content

World Wide Web
Considering processing cost in network simulations

MoMeTools '03 Proceedings of the ACM SIGCOMM workshop on Models, methods and tools for reproducible network research
Synchronization analysis for decentralizing composite Web services

Proceedings of the 2003 ACM symposium on Applied computing
A Cluster-Based Web System Providing Differentiated and Guaranteed Services

Cluster Computing
Cyclone: A High-Performance Cluster-Based Web Server with Socket Cloning

Cluster Computing
Document replication and distribution in extensible geographically distributed web servers

Journal of Parallel and Distributed Computing - Scalable web services and architecture
On scalable and locality-aware web document sharing

Journal of Parallel and Distributed Computing - Scalable web services and architecture
On caching effectiveness of web clusters under persistent connections

Journal of Parallel and Distributed Computing - Scalable web services and architecture
Content-aware cooperative caching for cluster-based web servers

Journal of Systems and Software
Class-Dependent Assignment in cluster-based servers

Proceedings of the 2004 ACM symposium on Applied computing
Research on scheduling algorithms in Web cluster servers

Journal of Computer Science and Technology
On the performance and use of dense servers

IBM Journal of Research and Development
D-SPTF: decentralized request distribution in brick-based storage systems

ASPLOS XI Proceedings of the 11th international conference on Architectural support for programming languages and operating systems
Managing the Performance Impact of Web Security

Electronic Commerce Research
Replication for web hosting systems

ACM Computing Surveys (CSUR)
Replication for web hosting systems

ACM Computing Surveys (CSUR)
Adaptive hash routing for a cluster of client-side web proxies

Journal of Parallel and Distributed Computing
Workload-Aware Load Balancing for Clustered Web Servers

IEEE Transactions on Parallel and Distributed Systems
Supporting Cluster-Based Network Services on Functionally Symmetric Software Architecture

Proceedings of the 2004 ACM/IEEE conference on Supercomputing
Quantifying and Improving the Availability of High-Performance Cluster-Based Internet Services

Proceedings of the 2003 ACM/IEEE conference on Supercomputing
Using certified policies to regulate E-commerce transactions

ACM Transactions on Internet Technology (TOIT)
Using service brokers for accessing backend servers for web applications

Journal of Network and Computer Applications
PRESS: A Clustered Server Based on User-Level Communication

IEEE Transactions on Parallel and Distributed Systems
A Comparative Evaluation of Transparent Scaling Techniques for Dynamic Content Servers

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Measuring Scalability of Resource Management Systems

IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 1 - Volume 02
Integrated resource management for cluster-based internet services

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
The effectiveness of request redirection on CDN robustness

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Cataclysm: policing extreme overloads in internet applications

WWW '05 Proceedings of the 14th international conference on World Wide Web
Migratable sockets in cluster computing

Journal of Systems and Software - Special issue: Software engineering education and training
Energy conservation in heterogeneous server clusters

Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
Approximate Algorithms for Document Placement in Distributed Web Servers

IEEE Transactions on Parallel and Distributed Systems
Transparent caching with strong consistency in dynamic content web sites

Proceedings of the 19th annual international conference on Supercomputing
SpliceNP: a TCP splicer using a network processor

Proceedings of the 2005 ACM symposium on Architecture for networking and communications systems
Circus: Opportunistic Block Reordering for Scalable Content Servers

FAST '04 Proceedings of the 3rd USENIX Conference on File and Storage Technologies
Replication for Adaptive Responsiveness in Service-Oriented Systems

QSIC '05 Proceedings of the Fifth International Conference on Quality Software
Deferred Assignment Scheduling in Cluster-Based Servers

Cluster Computing
Performance analysis of TLS Web servers

ACM Transactions on Computer Systems (TOCS)
The costs and limits of availability for replicated services

ACM Transactions on Computer Systems (TOCS)
Weblins: a scalable WWW cluster-based server

Advances in Engineering Software
A light-weight, collaborative temporary file system for clustered Web servers

Journal of Parallel and Distributed Computing - Special issue: 18th International parallel and distributed processing symposium
A Network Processor-Based, Content-Aware Switch

IEEE Micro
Computation spreading: employing hardware migration to specialize CMP cores on-the-fly

Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
Load prediction models in web-based systems

valuetools '06 Proceedings of the 1st international conference on Performance evaluation methodolgies and tools
TCP offload through connection handoff

Proceedings of the 1st ACM SIGOPS/EuroSys European Conference on Computer Systems 2006
System support for web hosting services on server clusters

Computers and Electrical Engineering
Architecting Web sites for high performance

Scientific Programming
Performance modeling and system management for multi-component online services

NSDI'05 Proceedings of the 2nd conference on Symposium on Networked Systems Design & Implementation - Volume 2
Interposed request routing for scalable network storage

OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
Design and evaluation of a continuous consistency model for replicated services

OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
CSP: a novel system architecture for scalable internet and communication services

USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
Neptune: scalable replication management and programming support for cluster-based network services

USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
System support for scalable, reliable and highly manageable web hosting service

USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
Fine-grained failover using connection migration

USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
Using fault injection and modeling to evaluate the performability of cluster-based services

USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Model-based resource provisioning in a web service utility

USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Conflict-aware scheduling for dynamic content applications

USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Anypoint: extensible transport switching on the edge

USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Network-sensitive service discovery

USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4
Efficient support for content-based routing in web server clusters

USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Archipelago: an Island-based file system for highly available and scalable internet services

WSS'00 Proceedings of the 4th conference on USENIX Windows Systems Symposium - Volume 4
Scalable content-aware request distribution in cluster-based networks servers

ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
HACC: an architecture for cluster-based web servers

WINSYM'99 Proceedings of the 3rd conference on USENIX Windows NT Symposium - Volume 3
Efficient support for P-HTTP in cluster-based web servers

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Flash: an efficient and portable web server

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Online Web Cluster Capacity Estimation and Its Application to Energy Conservation

IEEE Transactions on Parallel and Distributed Systems
An SSL Back-End Forwarding Scheme in Cluster-Based Web Servers

IEEE Transactions on Parallel and Distributed Systems
Tashkent+: memory-aware load balancing and update filtering in replicated databases

Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Heterogeneity-Aware Workload Distribution in Donation-Based Grids

International Journal of High Performance Computing Applications
ReDAL: An Efficient and Practical Request Distribution Technique for Application Server Clusters

IEEE Transactions on Parallel and Distributed Systems
Comparison and analysis of ten static heuristics-based Internet data replication techniques

Journal of Parallel and Distributed Computing
Agile dynamic provisioning of multi-tier Internet applications

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Design and implementation of a scalable web cluster system using a light-weight cluster control protocol

CEA'07 Proceedings of the 2007 annual Conference on International Conference on Computer Engineering and Applications
A request distribution method for clustered VOD servers considering buffer sharing effects

Journal of Systems Architecture: the EUROMICRO Journal
Effective load balancing for cluster-based servers employing job preemption

Performance Evaluation
Models and framework for supporting runtime decisions in Web-based systems

ACM Transactions on the Web (TWEB)
Coscheduled distributed-Web servers on system area network

Journal of Parallel and Distributed Computing
A content-based load balancing algorithm with admission control for cluster web servers

Future Generation Computer Systems
ADAPTIVE AND INTELLIGENT REQUEST DISTRIBUTION FOR CONTENT DELIVERY NETWORKS

Cybernetics and Systems
Cataclysm: Scalable overload policing for internet applications

Journal of Network and Computer Applications
Resource allocation optimization for quantitative service differentiation on server clusters

Journal of Parallel and Distributed Computing
Front end device for content networking

Proceedings of the conference on Design, automation and test in Europe
Design and implementation of an efficient web cluster with content-based request distribution and file caching

Journal of Systems and Software
Adaptive Request Scheduling for Parallel Scientific Web Services

SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
Cache-aware load balancing for question answering

Proceedings of the 17th ACM conference on Information and knowledge management
Kinesis: A new approach to replica placement in distributed storage systems

ACM Transactions on Storage (TOS)
Rethinking FTP: Aggressive block reordering for large file transfers

ACM Transactions on Storage (TOS)
Experimental evaluation of horizontal and vertical scalability of cluster-based application servers for transactional workloads

AIC'08 Proceedings of the 8th conference on Applied informatics and communications
The design and implementation of an extensible network backup system in realtime

Proceedings of the 3rd International Conference on Ubiquitous Information Management and Communication
New content-aware request distribution policies in web clusters providing multiple services

Proceedings of the 2009 ACM symposium on Applied Computing
A Unified Framework for Load Distribution and Fault-Tolerance of Application Servers

Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Weblins: A scalable WWW cluster-based server

Advances in Engineering Software
Distributed redirection for the World-Wide Web

Computer Networks: The International Journal of Computer and Telecommunications Networking
Adaptive fast path architecture

IBM Journal of Research and Development
Global Distribution of HTTP Requests Using the Fuzzy-Neural Decision-Making Mechanism

ICCCI '09 Proceedings of the 1st International Conference on Computational Collective Intelligence. Semantic Web, Social Networks and Multiagent Systems
Deterministic Processing of WWW Pages by the Web Service

ICCCI '09 Proceedings of the 1st International Conference on Computational Collective Intelligence. Semantic Web, Social Networks and Multiagent Systems
On evaluating decentralized parallel I/O scheduling strategies for parallel file systems

VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
CloudViews: communal data sharing in public clouds

HotCloud'09 Proceedings of the 2009 conference on Hot topics in cloud computing
Peloton: coordinated resource management for sensor networks

HotOS'09 Proceedings of the 12th conference on Hot topics in operating systems
Centrifuge: integrated lease management and partitioning for cloud services

NSDI'10 Proceedings of the 7th USENIX conference on Networked systems design and implementation
A predictive and probabilistic load-balancing algorithm for cluster-based web servers

Applied Soft Computing
Managing QoS through prioritization in web services

WISEW'03 Proceedings of the Fourth international conference on Web information systems engineering workshops
Queue weighting load-balancing technique for database replication in dynamic content web sites

ACS'09 Proceedings of the 9th WSEAS international conference on Applied computer science
Implementation of a stackable file system for real-time network backup

International Journal of Autonomic Computing
Neuro-fuzzy models in global HTTP request distribution

ICCCI'10 Proceedings of the Second international conference on Computational collective intelligence: technologies and applications - Volume PartI
An up-to-date survey in web load balancing

World Wide Web
Circus: opportunistic block reordering for scalable content servers

FAST'04 Proceedings of the 3rd USENIX conference on File and storage technologies
Decentralized content aware load balancing algorithm for distributed computing environments

Proceedings of the International Conference & Workshop on Emerging Trends in Technology
Buffer cache de-duplication for query dispatch in replicated databases

DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications: Part II
Semantics of caching with SPOCA: a stateless, proportional, optimally-consistent addressing algorithm

USENIXATC'11 Proceedings of the 2011 USENIX conference on USENIX annual technical conference
Review: A survey on content-centric technologies for the current Internet: CDN and P2P solutions

Computer Communications
Content-Based distribution for load sharing in locally clustered web servers

ADVIS'04 Proceedings of the Third international conference on Advances in Information Systems
A proactive distributed qos control framework for cluster web site

HPCC'06 Proceedings of the Second international conference on High Performance Computing and Communications
Balancing HTTP traffic using dynamically updated weights, an implementation approach

PCI'05 Proceedings of the 10th Panhellenic conference on Advances in Informatics
Fuzzy-neural web switch supporting differentiated service

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Design of security session reuse in content-based load distribution server

ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
Analysis of security session reusing in distribution server system

ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part V
Considering network context for efficient simulation of highly parallel network processors

ICCNMC'05 Proceedings of the Third international conference on Networking and Mobile Computing
Using adaptive fuzzy-neural control to minimize response time in cluster-based web systems

AWIC'05 Proceedings of the Third international conference on Advances in Web Intelligence
Performance evaluation of fuzzy-neural HTTP request distribution for web clusters

ICAISC'06 Proceedings of the 8th international conference on Artificial Intelligence and Soft Computing
Adaptive admission control algorithm in a QoS-aware Web system

Information Sciences: an International Journal
An architecture for highly available wide-area service composition

Computer Communications
On caching search engine query results

Computer Communications
Performance analysis of interactive internet systems for a class of systems with dynamically changing offers

CEE-SET'09 Proceedings of the 4th IFIP TC 2 Central and East European conference on Advances in Software Engineering Techniques
Barely alive memory servers: Keeping data active in a low-power state

ACM Journal on Emerging Technologies in Computing Systems (JETC)
Design, implementation, and performance of a load balancer for SIP server clusters

IEEE/ACM Transactions on Networking (TON)
High-throughput query scheduling with spatial clustering based on distributed exponential moving average

Distributed and Parallel Databases
xOMB: extensible open middleboxes with commodity servers

Proceedings of the eighth ACM/IEEE symposium on Architectures for networking and communications systems
Kernel support for fine-grained load balancing in a web cluster providing streaming service

ICA3PP'12 Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part I
Power-aware resource allocation for CPU-and memory-intense internet services

E2DC'12 Proceedings of the First international conference on Energy Efficient Data Centers
A cost-based database request distribution technique for online e-commerce applications

MIS Quarterly
Web hosting with statistical capacity guarantee

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

We consider cluster-based network servers in which a front-end directs incoming requests to one of a number of back-ends. Specifically, we consider content-based request distribution: the front-end uses the content requested, in addition to information about the load on the back-end nodes, to choose which back-end will handle this request. Content-based request distribution can improve locality in the back-ends' main memory caches, increase secondary storage scalability by partitioning the server's database, and provide the ability to employ back-end nodes that are specialized for certain types of requests.As a specific policy for content-based request distribution, we introduce a simple, practical strategy for locality-aware request distribution (LARD). With LARD, the front-end distributes incoming requests in a manner that achieves high locality in the back-ends' main memory caches as well as load balancing. Locality is increased by dynamically subdividing the server's working set over the back-ends. Trace-based simulation results and measurements on a prototype implementation demonstrate substantial performance improvements over state-of-the-art approaches that use only load information to distribute requests. On workloads with working sets that do not fit in a single server node's main memory cache, the achieved throughput exceeds that of the state-of-the-art approach by a factor of two to four.With content-based distribution, incoming requests must be handed off to a back-end in a manner transparent to the client, after the front-end has inspected the content of the request. To this end, we introduce an efficient TCP handoflprotocol that can hand off an established TCP connection in a client-transparent manner.