SARC: sequential prefetching in adaptive replacement cache

Authors:
Binny S. Gill;Dharmendra S. Modha
Affiliations:
IBM Almaden Research Center, San Jose, CA;IBM Almaden Research Center, San Jose, CA
Venue:
ATEC '05 Proceedings of the annual conference on USENIX Annual Technical Conference
Year:
2005

Citing 26
Cited 27

A fast file system for UNIX

ACM Transactions on Computer Systems (TOCS)
The design of the UNIX operating system

The design of the UNIX operating system
Measurements of a distributed file system

SOSP '91 Proceedings of the thirteenth ACM symposium on Operating systems principles
A study of integrated prefetching and caching strategies

Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Informed prefetching and caching

SOSP '95 Proceedings of the fifteenth ACM symposium on Operating systems principles
Operating systems (2nd ed.): design and implementation

Operating systems (2nd ed.): design and implementation
An analytic behavior model for disk drives with readahead caches and request reordering

SIGMETRICS '98/PERFORMANCE '98 Proceedings of the 1998 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Automatic I/O hint generation through speculative execution

OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
Sequentiality and prefetching in database systems

ACM Transactions on Database Systems (TODS)
A trace-driven analysis of the UNIX 4.2 BSD file system

Proceedings of the tenth ACM symposium on Operating systems principles
Data prefetch mechanisms

ACM Computing Surveys (CSUR)
I/O reference behavior of production database workloads and the TPC benchmarks—an analysis at the logical level

ACM Transactions on Database Systems (TODS)
Adaptive caching for demand prepaging

Proceedings of the 3rd international symposium on Memory management
Operating Systems Theory

Operating Systems Theory
Operating System Concepts

Operating System Concepts
Performance analysis of a relational data base management system

SIGMOD '79 Proceedings of the 1979 ACM SIGMOD international conference on Management of data
Fido: A Cache That Learns to Fetch

VLDB '91 Proceedings of the 17th International Conference on Very Large Data Bases
Design and Implementation of a Predictive File Prefetching Algorithm

Proceedings of the General Track: 2002 USENIX Annual Technical Conference
The Multics Input/Output system

SOSP '71 Proceedings of the third ACM symposium on Operating systems principles
Rules of Thumb in Data Engineering

ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Outperforming LRU with an Adaptive Replacement Cache Algorithm

Computer
IBM TotalStorage Enterprise Storage Server: A designer's view

IBM Systems Journal
Characteristics of production database workloads and the TPC benchmarks

IBM Systems Journal - End-to-end security
ARC: A Self-Tuning, Low Overhead Replacement Cache

FAST '03 Proceedings of the 2nd USENIX Conference on File and Storage Technologies
A low-overhead high-performance unified buffer management scheme that exploits sequential and looping references

OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
Ibm totalstorage enterprise storage server model 800

Ibm totalstorage enterprise storage server model 800

A buffer cache management scheme exploiting both temporal and spatial localities

ACM Transactions on Storage (TOS)
DULO: an effective buffer cache management scheme to exploit both temporal and spatial locality

FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
WOW: wise ordering for writes - combining spatial and temporal locality in non-volatile caches

FAST'05 Proceedings of the 4th conference on USENIX Conference on File and Storage Technologies - Volume 4
Competitive prefetching for concurrent sequential I/O

Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007
Optimal multistream sequential prefetching in a shared cache

ACM Transactions on Storage (TOS)
On multi-level exclusive caching: offline optimality and why promotions are better than demotions

FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
TaP: table-based prefetching for storage caches

FAST'08 Proceedings of the 6th USENIX Conference on File and Storage Technologies
On the design of a new Linux readahead framework

ACM SIGOPS Operating Systems Review - Research and developments in the Linux kernel
Prefetching with adaptive cache culling for striped disk arrays

ATC'08 USENIX 2008 Annual Technical Conference on Annual Technical Conference
Prefetch throttling and data pinning for improving performance of shared caches

Proceedings of the 2008 ACM/IEEE conference on Supercomputing
Memory resource allocation for file system prefetching: from a supply chain management perspective

Proceedings of the 4th ACM European conference on Computer systems
RPP: reference pattern based prefetching controller

Proceedings of the 2009 ACM symposium on Applied Computing
Using machine learning techniques to enhance the performance of an automatic backup and recovery system

Proceedings of the 3rd Annual Haifa Experimental Systems Conference
Computation mapping for multi-level storage cache hierarchies

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Cashing in on hints for better prefetching and caching in PVFS and MPI-IO

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
STOW: a spatially and temporally optimized write caching algorithm

USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
FAST: quick application launch on solid-state drives

FAST'11 Proceedings of the 9th USENIX conference on File and stroage technologies
Management of Multilevel, Multiclient Cache Hierarchies with Application Hints

ACM Transactions on Computer Systems (TOCS)
Cost-aware caching schemes in heterogeneous storage systems

The Journal of Supercomputing
A driver-layer caching policy for removable storage devices

ACM Transactions on Storage (TOS)
Sustainable predictive storage management: on-line grouping for energy and latency reduction

Proceedings of the 4th Annual International Conference on Systems and Storage
Virtual I/O caching: dynamic storage cache management for concurrent workloads

Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Compiler-directed file layout optimization for hierarchical storage systems

SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Improving Cache Management Policies Using Dynamic Reuse Distances

MICRO-45 Proceedings of the 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture
Mortar: filling the gaps in data center memory

Proceedings of the 4th annual Symposium on Cloud Computing
Mortar: filling the gaps in data center memory

Proceedings of the 10th ACM SIGPLAN/SIGOPS international conference on Virtual execution environments
Compiler-directed file layout optimization for hierarchical storage systems

Scientific Programming - Selected Papers from Super Computing 2012

Quantified Score

Hi-index	0.00

Visualization

Abstract

Sequentiality of reference is an ubiquitous access pattern dating back at least to Multics. Sequential workloads lend themselves to highly accurate prediction and prefetching. In spite of the simplicity of the workload, design and analysis of a good sequential prefetching algorithm and associated cache replacement policy turns out to be surprisingly intricate. As first contribution, we uncover and remedy an anomaly (akin to famous Belady's anomaly) that plagues sequential prefetching when integrated with caching. Typical workloads contain a mix of sequential and random streams. As second contribution, we design a self-tuning, low overhead, simple to implement, locally adaptive, novel cache management policy SARC that dynamically and adaptively partitions the cache space amongst sequential and random streams so as to reduce the read misses. As third contribution, we implemented SARC along with two popular state-of-the-art LRU variants on hardware for IBM's flagship storage controller Shark. On Shark hardware with 8 GB cache and 16 RAID-5 arrays that is serving a workload akin to Storage Performance Council's widely adopted SPC-1 benchmark, SARC consistently and dramatically outperforms the two LRU variants shifting the throughput-response time curve to the right and thus fundamentally increasing the capacity of the system. As anecdotal evidence, at the peak throughput, SARC has average response time of 5.18ms as compared to 33.35ms and 8.92ms for the two LRU variants.