Coscheduled distributed-Web servers on system area network

Authors:
Jin-Ha Kim;Gyu Sang Choi;Chita R. Das
Affiliations:
Department of Computer Science and Engineering, The Pennsylvania State, University University Park, PA 16802, United States;Department of Computer Science and Engineering, The Pennsylvania State, University University Park, PA 16802, United States;Department of Computer Science and Engineering, The Pennsylvania State, University University Park, PA 16802, United States
Venue:
Journal of Parallel and Distributed Computing
Year:
2008

Citing 27
Cited 2

Coscheduling based on runtime identification of activity working sets

International Journal of Parallel Programming
Effective distributed scheduling of parallel workloads

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Self-similarity in World Wide Web traffic: evidence and possible causes

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Internet Web servers: workload characterization and performance implications

IEEE/ACM Transactions on Networking (TON)
Scheduling with implicit information in distributed systems

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Locality-aware request distribution in cluster-based network servers

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Summary of WWW characterizations

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Estimating the heavy tail index from scaling properties

Methodology and Computing in Applied Probability
Alternatives to coscheduling a network of workstations

Journal of Parallel and Distributed Computing - Special issue on software support for distributed computing
Manageability, availability, and performance in porcupine: a highly scalable, cluster-based mail service

ACM Transactions on Computer Systems (TOCS)
Distributed cooperative Apache web server

Proceedings of the 10th international conference on World Wide Web
Analysis of SRPT scheduling: investigating unfairness

Proceedings of the 2001 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Characterizing reference locality in the WWW

DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
The state of the art in locally distributed Web-server systems

ACM Computing Surveys (CSUR)
An implementation and analysis of the virtual interface architecture

SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Modeling and analysis of dynamic coscheduling in parallel and distributed environments

SIGMETRICS '02 Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Understanding the Linux Kernel

Understanding the Linux Kernel
Myrinet: A Gigabit-per-Second Local Area Network

IEEE Micro
Dynamic Coscheduling on Workstation Clusters

IPPS/SPDP '98 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Ninja: A Framework for Network Services

ATEC '02 Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference
User-Level Communication in Cluster-Based Servers

HPCA '02 Proceedings of the 8th International Symposium on High-Performance Computer Architecture
A Cluster-Based Web System Providing Differentiated and Guaranteed Services

Cluster Computing
Adaptive Parallel Job Scheduling with Flexible Coscheduling

IEEE Transactions on Parallel and Distributed Systems
Scalable content-aware request distribution in cluster-based networks servers

ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
The multispace: an evolutionary platform for infrastructural services

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Efficient support for P-HTTP in cluster-based web servers

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Flash: an efficient and portable web server

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference

Dynamic information-based scalable hashing on a cluster of web cache servers

Concurrency and Computation: Practice & Experience
A message negotiation approach to e-services by utility function and multi-criteria decision analysis

Computers & Mathematics with Applications

Quantified Score

Hi-index	0.00

Visualization

Abstract

As cluster-based Web servers are increasingly adopted to host a variety of network-based services, improving the performance of such servers has become critical to satisfy the customers' demands. Especially, the user response time is an important factor so that clients feel satisfied with the Web services. In this paper, we investigate the feasibility of minimizing the response time of a server by exploiting the advantages of both user-level communication and coscheduling. We, thus, propose a coscheduled server model based on the recently proposed distributed PRESS Web server, where the remote cache accesses can be coscheduled on different nodes to reduce the response time. We experiment this concept using two known coscheduling techniques, called dynamic coscheduling (DCS) and DCS with immediate blocking. We have developed a comprehensive simulation testbed that captures the underlying communication layer in a cluster, the characteristics of various coscheduling algorithms, and the characteristics of the distributed server model to estimate the average delay and throughput with different system configurations. The accuracy of the VIA communication layer and the DCS mechanism is verified using measurements on a 16-node Linux cluster. Extensive simulation of four server models (PRESS over VIA, coscheduled PRESS model with DCS, with DCS and blocking, and Adaptive) using 32-node cluster configurations indicates that the average response time of a distributed server can be minimized significantly by coscheduling the communicating processes. The use of the DCS scheme reduced the average latency up to four times to the PRESS over VIA model that uses only user-level communication.