Best-effort refresh strategies for content-based RSS feed aggregation

  • Authors:
  • Roxana Horincar;Bernd Amann;Thierry Artières

  • Affiliations:
  • LIP6, University Pierre et Marie Curie, Paris, France;LIP6, University Pierre et Marie Curie, Paris, France;LIP6, University Pierre et Marie Curie, Paris, France

  • Venue:
  • WISE'10 Proceedings of the 11th international conference on Web information systems engineering
  • Year:
  • 2010

Quantified Score

Hi-index 0.02

Visualization

Abstract

During the past several years RSS-based content syndication has become a standard technique for efficiently and timely disseminating information on the web. From a data processing perspective RSS feeds are standard XML resources which are periodically refreshed by feed aggregators for generating continuous streams of items. In this article, we study the problem of information loss in the context of a content-based feed aggregation system and we propose a new best-effort refresh strategy for RSS feeds under limited bandwidth. This strategy is evaluated experimentally and compared to other state-of-the-art crawling strategies for web pages.