Optimal prefetching via data compression

  • Authors:
  • Jeffrey Scott Vitter;P. Krishnan

  • Affiliations:
  • Duke Univ., Durham, NC;Bell Labs, Holmdel, NJ

  • Venue:
  • Journal of the ACM (JACM)
  • Year:
  • 1996

Quantified Score

Hi-index 0.06

Visualization

Abstract

Caching and prefetching are important mechanisms for speeding up access time to data on secondary storage. Recent work in competitive online algorithms has uncovered several promising new algorithms for caching. In this paper, we apply a form of the competitive philosophy for the first time to the problem of prefetching to develop an optimal universal prefetcher in terms of fault rate, with particular applications to large-scale databases and hypertext systems. Our prediction algorithms with particular applications to large-scale databases and hypertext systems. Our prediction algorithms for prefetching are novel in that they are based on data compression techniques that are both theoretically optimal and good in practice. Intuitively, in order to compress data effectively, you have to be able to predict future data well, and thus good data compressors should be able to predict well for purposes of prefetching. We show for powerful models such as Markov sources and mthe order Markov sources that the page fault rate incurred by our prefetching algorithms are optimal in the limit for almost all sequences of page requests.