A Model of Workloads and its Use in Miss-Rate Prediction for Fully Associative Caches

Authors:
Jaswinder Pal Singh;Harold S. Stone;Dominique F. Thiébaut
Affiliations:
-;-;-
Venue:
IEEE Transactions on Computers
Year:
1992

Citing 14
Cited 26

Line (block) size choice for CPU cache memories

IEEE Transactions on Computers
Footprints in the cache

ACM Transactions on Computer Systems (TOCS)
Influence of program transients in computer cache-memories

Influence of program transients in computer cache-memories
An analytical cache model

ACM Transactions on Computer Systems (TOCS)
The Stack Growth Function: Cache Line Reference Models

IEEE Transactions on Computers
On the Fractal Dimension of Computer Programs and its Application to the Prediction of the Cache Miss Ratio

IEEE Transactions on Computers
Evaluating Associativity in CPU Caches

IEEE Transactions on Computers
Performance Analysis of Cache Memories

Journal of the ACM (JACM)
Transient behavior of cache memories

ACM Transactions on Computer Systems (TOCS)
Cold-start vs. warm-start miss ratios

Communications of the ACM
A simple linear model of demand paging performance

Communications of the ACM
Properties of the working-set model

Communications of the ACM
Correction to "Evaluating Associativity in CPU Caches"

IEEE Transactions on Computers
Using cache memory to reduce processor-memory traffic

ISCA '83 Proceedings of the 10th annual international symposium on Computer architecture

Expected I-cache miss rates via the gap model

ISCA '94 Proceedings of the 21st annual international symposium on Computer architecture
An inter-reference gap model for temporal locality in program behavior

Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Scheduling for cache affinity in parallelized communication protocols

Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
The effectiveness of affinity-based scheduling in multiprocessor network protocol processing (extended version)

IEEE/ACM Transactions on Networking (TON)
The influence of caches on the performance of heaps

Journal of Experimental Algorithmics (JEA)
A Subsystem-Oriented Performance Analysis Methodology for Shared-Bus Multiprocessors

IEEE Transactions on Parallel and Distributed Systems
Compression-Based Program Characterization for Improving Cache Memory Performance

IEEE Transactions on Computers
Cache-conscious structure layout

Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
Cache performance analysis of traversals and random accesses

Proceedings of the tenth annual ACM-SIAM symposium on Discrete algorithms
Automatic and efficient evaluation of memory hierarchies for embedded systems

Proceedings of the 32nd annual ACM/IEEE international symposium on Microarchitecture
A Blocked All-Pairs Shortest-Path Algorithm

SWAT '00 Proceedings of the 7th Scandinavian Workshop on Algorithm Theory
Program Modelling via Inter-Reference Gaps and Applications

MASCOTS '95 Proceedings of the 3rd International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems
Improving performance by cache driven memory management

HPCA '95 Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture
The impact of extrinsic cache performance on predictability of real-time systems

RTCSA '95 Proceedings of the 2nd International Workshop on Real-Time Computing Systems and Applications
Highly accurate and efficient evaluation of randomising set index functions

Journal of Systems Architecture: the EUROMICRO Journal
A blocked all-pairs shortest-paths algorithm

Journal of Experimental Algorithmics (JEA)
Architecture based analysis of performance, reliability and security of software systems

Proceedings of the 5th international workshop on Software and performance
Analysis of scratch-pad and data-cache performance using statistical methods

ASP-DAC '06 Proceedings of the 2006 Asia and South Pacific Design Automation Conference
Cache miss behavior: is it √2?

Proceedings of the 3rd conference on Computing frontiers
An analytical model for cache replacement policy performance

SIGMETRICS '06/Performance '06 Proceedings of the joint international conference on Measurement and modeling of computer systems
Quantifying software performance, reliability and security: An architecture-based approach

Journal of Systems and Software
Fast, accurate design space exploration of embedded systems memory configurations

Proceedings of the 2007 ACM symposium on Applied computing
The effectiveness of affinity-based scheduling in multiprocessor networking

INFOCOM'96 Proceedings of the Fifteenth annual joint conference of the IEEE computer and communications societies conference on The conference on computer communications - Volume 1
Two-level cache architecture to reduce memory accesses for IP lookups

Proceedings of the 23rd International Teletraffic Congress
Energy-optimal caches with guaranteed lifetime

Proceedings of the 2012 ACM/IEEE international symposium on Low power electronics and design
Computer performance analysis and the Pi Theorem

Computer Science - Research and Development

Quantified Score

Hi-index	14.98

Visualization

Abstract

A mathematical model for the behavior of programs or workloads is presented and from it is extracted the miss ratio of a finite, fully associative cache (or other first-level memory) using least-recently-used replacement under those workloads. To obtain miss ratios, the function u(t, L), defined to be the number of unique lines of size L referenced before time t, is modeled. Empirical observations show that this function appears to have the form u(t, L)=(W L/sup a/t/sup b/) (d/sup log/ /sup L log t/) where W, a, b, d are constants that are related, respectively, to the working set size, locality of references to nearby addresses (spatial locality), temporal locality (locality in time not attributable to spatial locality), and interactions between spatial locality and temporal locality. The miss ratio of a finite fully associative cache can be approximated as the time derivative of u(t, L) evaluated where the function has a value equal to the size of the cache. When the miss ratios from this model are compared to measured miss ratios for a representative trace, the accuracy is high for large caches. For smaller caches, the model is close but not highly precise.