Online algorithms for prefetching and caching on parallel disks

Authors:
Rahul Shah;Peter J. Varman;Jeffrey Scott Vitter
Affiliations:
Purdue University, West Lafayette, IN;Rice University, Houston, TX;Purdue University, West Lafayette, IN
Venue:
Proceedings of the sixteenth annual ACM symposium on Parallelism in algorithms and architectures
Year:
2004

Citing 14
Cited 4

Amortized efficiency of list update and paging rules

Communications of the ACM
Competitive paging algorithms

Journal of Algorithms
Competitive paging and dual-guided on-line weighted caching and watching algorithms

Competitive paging and dual-guided on-line weighted caching and watching algorithms
A study of integrated prefetching and caching strategies

Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Integrated parallel prefetching and caching

Proceedings of the 1996 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
Competitive parallel disk prefetching and buffer management

Proceedings of the fifth workshop on I/O in parallel and distributed systems
Minimizing stall time in single and parallel disk systems

STOC '98 Proceedings of the thirtieth annual ACM symposium on Theory of computing
Online computation and competitive analysis

Online computation and competitive analysis
On competitive on-line paging with lookahead

Theoretical Computer Science
Optimal read-once parallel disk scheduling

Proceedings of the sixth workshop on I/O in parallel and distributed systems
Optimal prefetching and caching for parallel I/O sytems

Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures
External memory algorithms and data structures: dealing with massive data

ACM Computing Surveys (CSUR)
Integrated prefetching and caching in single and parallel disk systems

Proceedings of the fifteenth annual ACM symposium on Parallel algorithms and architectures
Near-optimal parallel prefetching and caching

FOCS '96 Proceedings of the 37th Annual Symposium on Foundations of Computer Science

On competitive online read-many parallel disks scheduling

Proceedings of the seventeenth annual ACM symposium on Parallelism in algorithms and architectures
Tight competitive ratios for parallel disk prefetching and caching

Proceedings of the twentieth annual symposium on Parallelism in algorithms and architectures
Algorithms and data structures for external memory

Foundations and Trends® in Theoretical Computer Science
Cooperating to buy shoes in the real world: online cycle picking in directed graphs

Proceedings of the South African Institute for Computer Scientists and Information Technologists Conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

Parallel disks provide a cost effective way of speeding up I/Os in applications that work with large amounts of data. The main challenge is to achieve as much parallelism as possible, using prefetching to avoid bottlenecks in disk access. Efficient algorithms have been developed for some particular patterns of accessing the disk blocks. In this paper, we consider general request sequences. When the request sequence consists of unique block requests, the problem is called prefetching and is a well-solved problem for arbitrary request sequences. When the reference sequence can have repeated references to the same block, we need to devise an effective caching policy as well. While optimum offline algorithms have been recently designed for the problem, in the online case, no effective algorithm was previously known. Our main contribution is a deterministic online algorithm threshold-LRU which achieves O((MD/L)2/3) competitive ratio and a randomized online algorithm threshold-MARK which achieves O(√(MD/L) log (MD/L)) competitive ratio for the caching/prefetching problem on the parallel disk model (PDM), where D is the number of disks, M is the size of fast memory buffer, and M+L is the amount of lookahead available in the request sequence. The best-known lower bound on the competitive ratio is Ω(≾MD/L) for lookahead L ≥ M in both models. We also show that if the deterministic online algorithm is allowed to have twice the memory of the offline then a tight competitive ratio of Θ(≾MD/L) can be achieved. This problem generalizes the well-known paging problem on a single disk to the parallel disk model.