Modeling correlations in web traces and implications for designing replacement policies

Authors:
Konstantinos Psounis;An Zhu;Balaji Prabhakar;Rajeev Motwani
Affiliations:
Department of Electrical Engineering, University of Southern California, Los Angeles, CA;Department of Computer Science, Stanford University, Stanford, CA;Departments of Electrical Engineering and Computer Science, Stanford University, Stanford, CA;Department of Computer Science, Stanford University, Stanford, CA
Venue:
Computer Networks: The International Journal of Computer and Telecommunications Networking
Year:
2004

Citing 21
Cited 6

An inter-reference gap model for temporal locality in program behavior

Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Using predictive prefetching to improve World Wide Web latency

ACM SIGCOMM Computer Communication Review
Generating representative Web workloads for network and server performance evaluation

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Proxy caching that estimates page load delays

Selected papers from the sixth international conference on World Wide Web
On the existence of a spectrum of policies that subsumes the least recently used (LRU) and least frequently used (LFU) policies

SIGMETRICS '99 Proceedings of the 1999 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Towards estimation error guarantees for distinct values

PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
On self-organizing sequential search heuristics

Communications of the ACM
Temporal locality and its impact on Web proxy cache performance

Performance Evaluation - Special issue on internet performance modelling
Characterizing reference locality in the WWW

DIS '96 Proceedings of the fourth international conference on on Parallel and distributed information systems
Markov Decision Processes: Discrete Stochastic Dynamic Programming

Markov Decision Processes: Discrete Stochastic Dynamic Programming
Introduction to Stochastic Dynamic Programming: Probability and Mathematical

Introduction to Stochastic Dynamic Programming: Probability and Mathematical
Neuro-Dynamic Programming

Neuro-Dynamic Programming
Probabilistic methods for web caching

Performance Evaluation
Operating Systems Theory

Operating Systems Theory
Operating System Concepts

Operating System Concepts
Efficient randomized web-cache replacement schemes using samples from past eviction times

IEEE/ACM Transactions on Networking (TON)
ProWGen: a synthetic workload generation tool for simulation evaluation of web proxy caches

Computer Networks: The International Journal of Computer and Telecommunications Networking
Sources and Characteristics of Web Temporal Locality

MASCOTS '00 Proceedings of the 8th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems
Popularity-Aware Greedy Dual-Size Web Proxy Caching Algorithms

ICDCS '00 Proceedings of the The 20th International Conference on Distributed Computing Systems ( ICDCS 2000)
Properties and applications of the least-recently-used stack model

Properties and applications of the least-recently-used stack model
Cost-aware WWW proxy caching algorithms

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems

Modeling spatially correlated data in sensor networks

ACM Transactions on Sensor Networks (TOSN)
Mistreatment-resilient distributed caching

Computer Networks: The International Journal of Computer and Telecommunications Networking
Distributed Selfish Caching

IEEE Transactions on Parallel and Distributed Systems
Approximate analysis of LRU in the case of short term correlations

Computer Networks: The International Journal of Computer and Telecommunications Networking
Traffic modeling and proportional partial caching for peer-to-peer systems

IEEE/ACM Transactions on Networking (TON)
A feedback control approach to mitigating mistreatment in distributed caching groups

NETWORKING'06 Proceedings of the 5th international IFIP-TC6 conference on Networking Technologies, Services, and Protocols; Performance of Computer and Communication Networks; Mobile and Wireless Communications Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

A number of web cache-related algorithms, such as replacement and prefetching policies, rely on specific characteristics present in the sequence of requests for efficient performance. Further, there is an increasing need to synthetically generate long traces of web requests for studying the performance of algorithms and systems related to the web. These reasons motivate us to obtain a simple and accurate model of web request traces.Our Markovian model precisely captures the degrees to which temporal correlations and document popularity influence web trace requests. We describe a mathematical procedure to extract the model parameters from real traces and generate synthetic traces using these parameters. This procedure is verified by standard statistical analysis. We also validate the model by comparing the hit ratios for real traces and their synthetic counterparts under various caching algorithms.As an important by-product, the model provides guidelines for designing efficient replacement algorithms. We obtain optimal algorithms given the parameters of the model. We also introduce a spectrum of practicable, high-performance algorithms that adapt to the degree of temporal correlation present in the request sequence, and discuss related implementation concerns.