Design and evaluation of web proxies by leveraging self-similarity of web traffic

Authors:
Rachid El Abdouni Khayari
Affiliations:
University of the Armed Forces, Munich Department of Computer Science, Neubiberg, Germany
Venue:
Computer Networks: The International Journal of Computer and Telecommunications Networking - Special issue: Network modelling and simulation
Year:
2006

Citing 29
Cited 0

Data cache management using frequency-based replacement

SIGMETRICS '90 Proceedings of the 1990 ACM SIGMETRICS conference on Measurement and modeling of computer systems
On the self-similar nature of Ethernet traffic

SIGCOMM '93 Conference proceedings on Communications architectures, protocols and applications
The Markov-modulated Poisson process (MMPP) cookbook

Performance Evaluation
Queue response to input correlation functions: continuous spectral analysis

IEEE/ACM Transactions on Networking (TON)
Analysis, modeling and generation of self-similar VBR video traffic

SIGCOMM '94 Proceedings of the conference on Communications architectures, protocols and applications
New models for pseudo self-similar traffic

Performance Evaluation - Special issue on applied probability modelling in telecommunication
Internet Web servers: workload characterization and performance implications

IEEE/ACM Transactions on Networking (TON)
Self-similarity in World Wide Web traffic: evidence and possible causes

IEEE/ACM Transactions on Networking (TON)
Fitting mixtures of exponentials to long-tail distributions to analyze network performance models

Performance Evaluation
Analysis and modeling of World Wide Web traffic for capacity dimensioning of Internet access lines

Performance Evaluation - Special issue on performance and control of network systems
Performance of Computer Communication Systems: A Model-Based Approach

Performance of Computer Communication Systems: A Model-Based Approach
Computer Networking: A Top-Down Approach Featuring the Internet

Computer Networking: A Top-Down Approach Featuring the Internet
Modeling and analysis of power-tail distributions via classical teletraffic methods

Queueing Systems: Theory and Applications
Changes in Web client access patterns: Characteristics and caching implications

World Wide Web
On the autocorrelation structure of TCP traffic

Computer Networks: The International Journal of Computer and Telecommunications Networking - Special issue: Advances in modeling and engineering of Longe-Range dependent traffic
On the autocorrelation structure of TCP traffic

Computer Networks: The International Journal of Computer and Telecommunications Networking - Special issue: Advances in modeling and engineering of Longe-Range dependent traffic
Scheduling Strategy to improve Response Time for Web Applications

HPCN Europe 1998 Proceedings of the International Conference and Exhibition on High-Performance Computing and Networking
Markovian Modeling of Real Data Traffic: Heuristic Phase Type and MAP Fitting of Heavy Tailed and Fractal Like Samples

Performance Evaluation of Complex Systems: Techniques and Tools, Performance 2002, Tutorial Lectures
Heavy Tails: The Effect of the Service Discipline

TOOLS '02 Proceedings of the 12th International Conference on Computer Performance Evaluation, Modelling Techniques and Tools
A Validation of the Pseudo Self-Similar Traffic Model

DSN '02 Proceedings of the 2002 International Conference on Dependable Systems and Networks
Fitting world-wide web request traces with the EM-algorithm

Performance Evaluation - Special issue: Internet performance and control of network systems
Characteristics of WWW Client-based Traces

Characteristics of WWW Client-based Traces
Acyclic discrete phase type distributions: properties and a parameter estimation algorithm

Performance Evaluation
The impact of the service discipline on delay asymptotics

Performance Evaluation - Modelling techniques and tools for computer performance evaluation
The pseudo-self-similar traffic model: application and validation

Performance Evaluation - Dependable systems and networks-performance and dependability symposium (DSN-PDS) 2002: Selected papers
The scale factor: a new degree of freedom in phase-type approximation

Performance Evaluation - Dependable systems and networks-performance and dependability symposium (DSN-PDS) 2002: Selected papers
An EM-based technique for approximating long-tailed data sets with PH distributions

Performance Evaluation - Internet performance symposium (IPS 2002)
Connection scheduling in web servers

USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Cost-aware WWW proxy caching algorithms

USITS'97 Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper a new concept for the analysis of communication systems and their performance is presented. Our point of view is that insights in system workload can help in developing new methods for improving the perceived system performance. From measurements, we gained ideas about the typical request pattern. These obtained insights have been used in developing new methods for improving the system performance. To validate these new approaches, simulations have been used.First, we present a fitting algorithm which directly deals with measurement data instead of an intermediate heavy-tailed distribution. This method provides good results for approximating the object-size distribution as well as the performance measures in an M|G|1 queue. The results of the fitting procedure allow a classification of the considered events; they provide a perfect classification of the space of data sizes in different classes.Furthermore, we develop a new caching algorithm, class-based, least recently used (C-LRU), with the aim to obtain a good balance between small and large documents in the cache. Similarly, the new scheduling algorithm, class-based interleaving weighted fair queueing (CI-WFQ), exploits the distribution of the object sizes being requested to set its parameters such that good mean response times are obtained and starvation does not occur. We have found that both methods are suitable for the use in Web proxy servers, and present, in many cases, an improvement over the yet existing strategies. For the comparison of the methods, we have used trace-driven simulations. Both algorithms are parameterized using information on the requested object-size distribution. In this way, they can be seen as potentially adaptive to the considered workload.