Optimal Web cache sizing: scalable methods for exact solutions

Authors:
T. Kelly;D. Reeves
Affiliations:
Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109, USA;Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109, USA
Venue:
Computer Communications
Year:
2001

Citing 20
Cited 6

Amortized efficiency of list update and paging rules

Communications of the ACM
Data structures and network algorithms

Data structures and network algorithms
The 5 minute rule for trading memory for disc accesses and the 10 byte rule for trading memory for CPU time

SIGMOD '87 Proceedings of the 1987 ACM SIGMOD international conference on Management of data
Efficient (stack) algorithms for analysis of write-back and sector memories

ACM Transactions on Computer Systems (TOCS)
Market-oriented programming: some early lessons

Market-based control
Page replacement with multi-size pages and applications to Web caching

STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
Internet Web servers: workload characterization and performance implications

IEEE/ACM Transactions on Networking (TON)
Generating representative Web workloads for network and server performance evaluation

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Web server performance measurement and modeling techniques

Performance Evaluation - Special issue on tools for performance evaluation
On the scale and performance of cooperative Web proxy caching

Proceedings of the seventeenth ACM symposium on Operating systems principles
Workload characterization of a Web proxy in a cable modem environment

ACM SIGMETRICS Performance Evaluation Review
Characterizing reference locality in the WWW

DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
Changes in Web client access patterns: Characteristics and caching implications

World Wide Web
Characterizing Reference Locality in the WWW

Characterizing Reference Locality in the WWW
Efficient Analysis of Caching Systems

Efficient Analysis of Caching Systems
Organization-based analysis of web-object sharing and caching

USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Using full reference history for efficient document replacement in web caches

USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Cost-aware WWW proxy caching algorithms

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
Evaluation techniques for storage hierarchies

IBM Systems Journal
LRU stack processing

IBM Journal of Research and Development

Improving the responsiveness of internet services with automatic cache placement

Proceedings of the 4th ACM European conference on Computer systems
Architecture of the internet archive

SYSTOR '09 Proceedings of SYSTOR 2009: The Israeli Experimental Systems Conference
On the optimization of storage capacity allocation for content distribution

Computer Networks: The International Journal of Computer and Telecommunications Networking
Quantitative analysis of zipf’s law on web cache

ISPA'05 Proceedings of the Third international conference on Parallel and Distributed Processing and Applications
Thin-client Web access patterns: Measurements from a cache-busting proxy

Computer Communications
A novel cache size optimization scheme based on manifold learning in Content Centric Networking

Journal of Network and Computer Applications

Quantified Score

Hi-index	0.24

Visualization

Abstract

This paper describes two approaches to the problem of determining exact optimal storage capacity for Web caches based on expected workload and the monetary costs of memory and bandwidth. The first approach considers memory/bandwidth tradeoffs in an idealized model. It assumes that workload consists of independent references drawn from a known distribution (e.g. Zipf) and caches employ a ''Perfect LFU'' removal policy. We derive conditions under which a shared higher-level ''parent'' cache serving several lower-level ''child'' caches is economically viable. We also characterize circumstances under which globally optimal storage capacities in such a hierarchy can be determined through a decentralized computation in which caches individually minimize local monetary expenditures. The second approach is applicable if the workload at a single cache is represented by an explicit request sequence and the cache employs any one of a large family of removal policies that includes LRU. The miss costs associated with individual requests may be completely arbitrary, and the cost of cache storage need only be monotonic. We use an efficient single-pass simulation algorithm to compute aggregate miss cost as a function of cache size in O(MlogM) time and O(M) memory, where M is the number of requests in the workload. Because it allows us to compute arbitrarily weighted hit rates at all cache sizes with modest computational resources, this algorithm permits us to measure cache performance with no loss of precision. The same basic algorithm also permits us to compute complete stack distance transformations in O(MlogN) time and O(N) memory, where N is the number of unique items referenced. Experiments on very large reference streams show that our algorithm computes stack distances more quickly than several alternative approaches, demonstrating that it is a useful tool for measuring temporal locality in cache workloads.