Modeling object characteristics of dynamic Web content

  • Authors:
  • W. Shi;E. Collins;V. Karamcheti

  • Affiliations:
  • Department of Computer Science, Wayne State University, 420 State Hall, 5143 Cass Ave, Detroit, MI;Department of Computer Science, New York University, Room 704, 715 Broadway, New York, NY;Department of Computer Science, New York University, Room 704, 715 Broadway, New York, NY

  • Venue:
  • Journal of Parallel and Distributed Computing - Scalable web services and architecture
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Requests for dynamic and personalized content have increasingly become a significant part of Internet traffic, driven both by a growth in dynamic Web services and a "trickle-down" effect stemming from the effectiveness of caches and content-distribution networks at serving static content. To efficiently serve this trend, several server-side and cache-side techniques have recently been proposed. Although such techniques, which exploit different forms of reuse at the sub-document level, appear promising, a significant impediment to their widespread deployment is (1) the absence of good models describing characteristics of dynamic Web content and (2) the lack of effective synthetic content generators, which reduce the effort involved in verifying the effectiveness of a proposed solution.This paper addresses both of these shortcomings. Its primary contribution is a set of models that capture the characteristics of dynamic content both in terms of independent parameters such as the distributions of object sizes and their freshness, times, as well as derived parameters such as content reusability across time and linked documents. These models are derived from an analysis of the content from six representative news and e-commerce sites, using both size-based and level-based splitting techniques to infer document objects. A secondary contribution is a Java-based dynamic content emulator, which uses these models to generate edgeside include-based dynamic content and serve requests for whole documents as well as separate objects.