Coscheduling based on runtime identification of activity working sets
International Journal of Parallel Programming
Effective distributed scheduling of parallel workloads
Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Self-similarity in World Wide Web traffic: evidence and possible causes
Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Internet Web servers: workload characterization and performance implications
IEEE/ACM Transactions on Networking (TON)
Scheduling with implicit information in distributed systems
SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Locality-aware request distribution in cluster-based network servers
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Summary of WWW characterizations
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Estimating the heavy tail index from scaling properties
Methodology and Computing in Applied Probability
Alternatives to coscheduling a network of workstations
Journal of Parallel and Distributed Computing - Special issue on software support for distributed computing
ACM Transactions on Computer Systems (TOCS)
Distributed cooperative Apache web server
Proceedings of the 10th international conference on World Wide Web
Analysis of SRPT scheduling: investigating unfairness
Proceedings of the 2001 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Characterizing reference locality in the WWW
DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
The state of the art in locally distributed Web-server systems
ACM Computing Surveys (CSUR)
An implementation and analysis of the virtual interface architecture
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Modeling and analysis of dynamic coscheduling in parallel and distributed environments
SIGMETRICS '02 Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Understanding the Linux Kernel
Understanding the Linux Kernel
Dynamic Coscheduling on Workstation Clusters
IPPS/SPDP '98 Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing
Ninja: A Framework for Network Services
ATEC '02 Proceedings of the General Track of the annual conference on USENIX Annual Technical Conference
User-Level Communication in Cluster-Based Servers
HPCA '02 Proceedings of the 8th International Symposium on High-Performance Computer Architecture
Adaptive Parallel Job Scheduling with Flexible Coscheduling
IEEE Transactions on Parallel and Distributed Systems
Scalable content-aware request distribution in cluster-based networks servers
ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
The multispace: an evolutionary platform for infrastructural services
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Efficient support for P-HTTP in cluster-based web servers
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Flash: an efficient and portable web server
ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference
Dynamic information-based scalable hashing on a cluster of web cache servers
Concurrency and Computation: Practice & Experience
Computers & Mathematics with Applications
Hi-index | 0.00 |
As cluster-based Web servers are increasingly adopted to host a variety of network-based services, improving the performance of such servers has become critical to satisfy the customers' demands. Especially, the user response time is an important factor so that clients feel satisfied with the Web services. In this paper, we investigate the feasibility of minimizing the response time of a server by exploiting the advantages of both user-level communication and coscheduling. We, thus, propose a coscheduled server model based on the recently proposed distributed PRESS Web server, where the remote cache accesses can be coscheduled on different nodes to reduce the response time. We experiment this concept using two known coscheduling techniques, called dynamic coscheduling (DCS) and DCS with immediate blocking. We have developed a comprehensive simulation testbed that captures the underlying communication layer in a cluster, the characteristics of various coscheduling algorithms, and the characteristics of the distributed server model to estimate the average delay and throughput with different system configurations. The accuracy of the VIA communication layer and the DCS mechanism is verified using measurements on a 16-node Linux cluster. Extensive simulation of four server models (PRESS over VIA, coscheduled PRESS model with DCS, with DCS and blocking, and Adaptive) using 32-node cluster configurations indicates that the average response time of a distributed server can be minimized significantly by coscheduling the communicating processes. The use of the DCS scheme reduced the average latency up to four times to the PRESS over VIA model that uses only user-level communication.