Adaptive pull-based policies for wide area data delivery

Authors:
Laura Bright;Avigdor Gal;Louiqa Raschid
Affiliations:
Portland State University, Portland, OR;Technion---Israel Institute of Technology, Haifa, Israel;University of Maryland, College Park, MD
Venue:
ACM Transactions on Database Systems (TODS)
Year:
2006

Citing 30
Cited 4

Scale and performance in a distributed file system

ACM Transactions on Computer Systems (TOCS)
Caching in the Sprite network file system

ACM Transactions on Computer Systems (TOCS)
Data caching issues in an information retrieval system

ACM Transactions on Database Systems (TODS)
Lazy release consistency for software distributed shared memory

ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
Estimating the current mean of a process subject to abrupt changes

Technometrics
Implementing data cubes efficiently

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Transactional client-server cache consistency: alternatives and performance

ACM Transactions on Database Systems (TODS)
“Data in your face”: push technology in perspective

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Piggyback server invalidation for proxy cache coherency

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Synchronizing a database to improve freshness

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Engineering server-driven consistency for large scale dynamic Web services

Proceedings of the 10th international conference on World Wide Web
Adaptive push-pull: disseminating dynamic web data

Proceedings of the 10th international conference on World Wide Web
Adaptive precision setting for cached approximate values

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Enabling dynamic content caching for database-driven web sites

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Divergence caching in client-server architectures

PDIS '94 Proceedings of the third international conference on on Parallel and distributed information systems
Managing periodically updated data in relational databases: a stochastic modeling approach

Journal of the ACM (JACM)
Best-effort cache synchronization with source cooperation

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Supporting diverse mobile applications with client profiles

WOWMOM '02 Proceedings of the 5th ACM international workshop on Wireless mobile multimedia
Selection of Views to Materialize in a Data Warehouse

ICDT '97 Proceedings of the 6th International Conference on Database Theory
Materialized Views Selection in a Multidimensional Database

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Update Propagation Strategies for Improving the Quality of Data on the Web

Proceedings of the 27th International Conference on Very Large Data Bases
An Update-Risk Based Approach to TTL Estimation in Web Caching

WISE '02 Proceedings of the 3rd International Conference on Web Information Systems Engineering
Monitoring the dynamic web to respond to continuous queries

WWW '03 Proceedings of the 12th international conference on World Wide Web
Maintaining Strong Cache Consistency in the World-Wide Web

ICDCS '97 Proceedings of the 17th International Conference on Distributed Computing Systems (ICDCS '97)
Estimating frequency of change

ACM Transactions on Internet Technology (TOIT)
Improving data delivery in wide area and mobile environments

Improving data delivery in wide area and mobile environments
World-wide web cache consistency

ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
Effective change detection using sampling

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Using latency-recency profiles for data delivery on the web

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
WIC: a general-purpose algorithm for monitoring web information sources

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30

Maintaining dynamic channel profiles on the web

Proceedings of the VLDB Endowment
Using Knowledge Base for Event-Driven Scheduling of Web Monitoring Systems

EC-Web 2009 Proceedings of the 10th International Conference on E-Commerce and Web Technologies
Feeding the world: a comprehensive dataset and analysis of a real world snapshot of web feeds

Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services
Profile-Based online data delivery

OTM'06 Proceedings of the 2006 international conference on On the Move to Meaningful Internet Systems: AWeSOMe, CAMS, COMINF, IS, KSinBIT, MIOS-CIAO, MONET - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

Wide area data delivery requires timely propagation of up-to-date information to thousands of clients over a wide area network. Applications include web caching, RSS source monitoring, and email access via a mobile network. Data sources vary widely in their update patterns and may experience different update rates at different times or unexpected changes to update patterns. Traditional data delivery solutions are either push-based, which requires servers to push updates to clients, or pull-based, which require clients to check for updates at servers. While push-based solutions ensure timely data delivery, they are not always feasible to implement and may not scale to a large number of clients. In this article, we present adaptive pull-based policies that explicitly aim to reduce the overhead of contacting remote servers, compared to existing pull-based policies, while meeting freshness requirements. We model updates to data sources using update histories, and present two novel history-based policies to estimate when updates occur; they are based on individual history and aggregate history. These policies are presented within an architectural framework that supports their deployment either client-side or server-side. We further develop two adaptive policies to handle objects that initially may have insufficient history or objects that experience changes in update patterns. Extensive experimental evaluation using three data traces from diverse applications shows that history-based policies can reduce contact between clients and servers by up to 60% compared to existing pull-based policies while providing a comparable level of data freshness. Our experiments further demonstrate that our adaptive policies can select the best policy to match the behavior of an object and perform better than any individual policy, thus they dominate standalone policies.