Implementation and comparison of distributed caching schemes

  • Authors:
  • S Selvakumar;P Prabhakar

  • Affiliations:
  • Department of Computer Science and Engineering, Regional Engineering College, Tiruchirappalli 620015, Tamil Nadu, India;Software Engineer, Hewlett Packard, Indian Software Operation, Bangalore, India

  • Venue:
  • Computer Communications
  • Year:
  • 2001

Quantified Score

Hi-index 0.24

Visualization

Abstract

Shared Web Caches allow multiple clients to quickly access a pool of popular web pages. An organization that provides shared caching to its web clients will typically have a collection of shared caches rather than a single cache. If a collection of shared caches is used, it is required to coordinate the caches so that all cached pages in the collection are shared among the clients of the organization. In this paper, two protocol schemes for coordinating the collection of shared caches are investigated. The first scheme is based on Internet Caching Protocol (ICP) (D. Wessels, K. Claffy, ICP and the Squid WebCache, National Laboratory for Applied Network Research, http://www.nlanr.net/(wessels/papers/icp-squid.ps.gz). In the ICP scheme, the web caches query other caches for the web pages and fetch the web pages from the neighbors if they have cached the requested page. The second scheme is the hash routing scheme in which the client (browser) has to find the hash value for the URL of the requested page and send the request to the corresponding cache server (D. Wessels, K. Claffy, IEEE Network magazine, November-December (1997); G. Thalur, C.V. Ravishankar, IEEE/ACM Trans. Networking (1997); V. Valloppillil, J. Cohen, Hierarchical HTTP Routing Protocol, Internet Draft, http://ww.nlanr.net/Cache/ICP/draft-vinod-icp-traffic-dist00.txt; Super proxy script, How to make Distributed Proxy Servers by URL Hashing, White Paper, http://naragw.sharp.co.jp/sps, August (1996)). These two schemes have been implemented, and compared with respect to the page retrieval latency and the adaptability of the cache servers when a peer cache server fails. Our analysis shows that the hash routing schemes have significant performance advantages over ICP with respect to the average latency under normal conditions but when failure rate of the cache server is significant the ICP provides good adaptability. Also, we observe that the hashing function used in the hash routing scheme must have certain features such as quick calculation of the hash value and uniform distribution of the web pages (among cache servers).