Design and implementation of an efficient web cluster with content-based request distribution and file caching

Authors:
Mei-Ling Chiang;Yu-Chen Lin;Lian-Feng Guo
Affiliations:
Department of Information Management, National Chi-Nan University, Puli, Taiwan, ROC;Department of Information Management, National Chi-Nan University, Puli, Taiwan, ROC;Department of Information Management, National Chi-Nan University, Puli, Taiwan, ROC
Venue:
Journal of Systems and Software
Year:
2008

Citing 18
Cited 4

Main memory caching of Web documents

Proceedings of the fifth international World Wide Web conference on Computer networks and ISDN systems
Internet Web servers: workload characterization and performance implications

IEEE/ACM Transactions on Networking (TON)
Locality-aware request distribution in cluster-based network servers

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Cluster reserves: a mechanism for resource management in cluster-based network servers

Proceedings of the 2000 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
A client-aware dispatching algorithm for web clusters providing multiple services

Proceedings of the 10th international conference on World Wide Web
The state of the art in locally distributed Web-server systems

ACM Computing Surveys (CSUR)
Content management on server farm with layer-7 routing

Proceedings of the 2002 ACM symposium on Applied computing
Kernel-based Web switches providing content-aware routing

NCA '03 Proceedings of the Second IEEE International Symposium on Network Computing and Applications
Scalable Web Server Cluster Design with Workload-Aware Request Distribution Strategy WARD

WECWIS '01 Proceedings of the Third International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems (WECWIS '01)
Cyclone: A High-Performance Cluster-Based Web Server with Socket Cloning

Cluster Computing
Integrated resource management for cluster-based internet services

OSDI '02 Proceedings of the 5th symposium on Operating systems design and implementationCopyright restrictions prevent ACM from being able to make the PDFs for this conference available for downloading
Autonomous semantic link networking model for the Knowledge Grid: Research Articles

Concurrency and Computation: Practice & Experience - Autonomous Grid Computing
System support for scalable, reliable and highly manageable web hosting service

USITS'01 Proceedings of the 3rd conference on USENIX Symposium on Internet Technologies and Systems - Volume 3
Efficient support for content-based routing in web server clusters

USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Peer-to-Peer in Metric Space and Semantic Space

IEEE Transactions on Knowledge and Data Engineering
Scalable content-aware request distribution in cluster-based networks servers

ATEC '00 Proceedings of the annual conference on USENIX Annual Technical Conference
HACC: an architecture for cluster-based web servers

WINSYM'99 Proceedings of the 3rd conference on USENIX Windows NT Symposium - Volume 3
Efficient support for P-HTTP in cluster-based web servers

ATEC '99 Proceedings of the annual conference on USENIX Annual Technical Conference

New content-aware request distribution policies in web clusters providing multiple services

Proceedings of the 2009 ACM symposium on Applied Computing
A distributed server architecture supporting dynamic resource provisioning for BPM-oriented workflow management systems

Journal of Systems and Software
An up-to-date survey in web load balancing

World Wide Web
Decentralized content aware load balancing algorithm for distributed computing environments

Proceedings of the International Conference & Workshop on Emerging Trends in Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

We have implemented an efficient and scalable web cluster named LVS-CAD/FC (i.e. LVS with Content-Aware Dispatching and File Caching). In LVS-CAD/FC, a kernel-level one-way content-aware web switch based on TCP Rebuilding is implemented to examine and distribute the HTTP requests from clients to web servers, and the fast Multiple TCP Rebuilding is implemented to efficiently support persistent connection. Besides, a file-based web cache stores a small set of the most frequently accessed web files in server RAM to reduce disk I/Os and a light-weight redirect method is developed to efficiently redirect requests to this cache. In this paper, we have further proposed new policies related to content-based workload-aware request distribution, in which the web switch considers the content of requests and workload characterization in request dispatching. In particular, web files with more access frequencies would be duplicated in more servers' file-based caches, such that hot web files can be served by more servers. Our goals are to improve cluster performance by obtaining better memory utilization and increasing the cache hit rates while achieving load balancing among servers. Experimental results of practical implementation on Linux show that LVS-CAD/FC is efficient and scales well. Besides, LVS-CAD/FC with the proposed policies can achieve 66.89% better performance than the Linux Virtual Server with a content-blind web switch.