An empirical investigation of Web session workloads: Can self-similarity be explained by deterministic chaos?

  • Authors:
  • Scott Dick;Omolbanin Yazdanbaksh;Xiuli Tang;Toan Huynh;James Miller

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • Information Processing and Management: an International Journal
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

Several studies of Web server workloads have hypothesized that these workloads are self-similar. The explanation commonly advanced for this phenomenon is that the distribution of Web server requests may be heavy-tailed. However, there is another possible explanation: self-similarity can also arise from deterministic, chaotic processes. To our knowledge, this possibility has not previously been investigated, and so existing studies on Web workloads lack an adequate comparison against this alternative. We conduct an empirical study of workloads from two different Web sites: one public university, and one private company, using the largest datasets that have been described in the literature. Our study employs methods from nonlinear time series analysis to search for chaotic behavior in the web logs of these two sites. While we do find that the deterministic components (i.e. the well-known ''weekend effect'') are significant components in these time series, we do not find evidence of chaotic behavior. Predictive modeling experiments contrasting heavy-tailed with deterministic models showed that both approaches were equally effective in modeling our datasets.