Optimizing Network Performance In Replicated Hosting

Authors:
Ningning Hu;Peter Steenkiste;Oliver Spatscheck;Jia Wang
Affiliations:
Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;AT&T Labs, NJ;AT&T Labs, NJ
Venue:
WCW '05 Proceedings of the 10th International Workshop on Web Content Caching and Distribution
Year:
2005

Citing 0
Cited 3

Exploiting internet route sharing for large scale available bandwidth estimation

IMC '05 Proceedings of the 5th ACM SIGCOMM conference on Internet Measurement
Path similarity evaluation using Bloom filters

Computer Networks: The International Journal of Computer and Telecommunications Networking
Managing traffic demand uncertainty in replica server placement with robust optimization

NETWORKING'06 Proceedings of the 5th international IFIP-TC6 conference on Networking Technologies, Services, and Protocols; Performance of Computer and Communication Networks; Mobile and Wireless Communications Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Most important commercial Web sites maintain multiple replicas of their server infrastructure to increase both reliability and performance. In this paper, we study how many replicas should be used and where they should be placed in order to improve client network performance, including both the latency (e.g., round-trip time) between clients and the replicas, and the bandwidth performance between them. This study is based on a large scale measurement study from an 18-node infrastructure, which reveals for the first time the distribution of today驴s Internet end-user access bandwidth. For example, we find that 50% of end users have access bandwidth less than 4.2Mbps. Using a greedy algorithm, we show that the first five replicas dominate latency optimization in our measurement infrastructure, while the first two replicas dominate bandwidth optimization. We also found that geographic diversity does not help as much for bandwidth optimization as it does for latency. To determine the proper trade-off between latency and bandwidth, we use a simplified TCP model to show that, when content size is less than 10KB, the deployment should focus on optimizing latency, while for content sizes larger than 1MB, the deployment should focus on optimizing bandwidth.