Static load classification for improving the value predictability of data-cache misses

Authors:
Martin Burtscher;Amer Diwan;Matthias Hauswirth
Affiliations:
Cornell University;University of Colorado;University of Colorado
Venue:
PLDI '02 Proceedings of the ACM SIGPLAN 2002 Conference on Programming language design and implementation
Year:
2002

Citing 25
Cited 3

Cache behavior of combinator graph reduction

ACM Transactions on Programming Languages and Systems (TOPLAS)
Caching considerations for generational garbage collection

LFP '92 Proceedings of the 1992 ACM conference on LISP and functional programming
ATOM: a system for building customized program analysis tools

PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
Cache performance of garbage-collected programming languages

Cache performance of garbage-collected programming languages
Memory system performance of programs with intensive heap allocation

ACM Transactions on Computer Systems (TOCS)
Value locality and load value prediction

Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
Exceeding the dataflow limit via value prediction

Proceedings of the 29th annual ACM/IEEE international symposium on Microarchitecture
Measuring the cost of storage management

Lisp and Symbolic Computation
The predictability of data values

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Can program profiling support value prediction?

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Highly accurate data value prediction using hybrid predictors

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Predicting data cache misses in non-numeric applications through correlation profiling

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Type-based alias analysis

PLDI '98 Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation
Modeling program predictability

Proceedings of the 25th annual international symposium on Computer architecture
Predictive techniques for aggressive load speculation

MICRO 31 Proceedings of the 31st annual ACM/IEEE international symposium on Microarchitecture
Value speculation scheduling for high performance processors

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Selective value prediction

ISCA '99 Proceedings of the 26th annual international symposium on Computer architecture
The Jalapeño dynamic optimizing compiler for Java

JAVA '99 Proceedings of the ACM 1999 conference on Java Grande
A High-Bandwidth Memory Pipeline for Wide Issue Processors

IEEE Transactions on Computers
Efficacy and Performance Impact of Value Prediction

PACT '98 Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques
Split Last-Address Predictor

PACT '98 Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques
Exploring Last n Value Prediction

PACT '99 Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques
Hybridizing and Coalescing Load Value Predictors

ICCD '00 Proceedings of the 2000 IEEE International Conference on Computer Design: VLSI in Computers & Processors
Differential FCM: Increasing Value Prediction Accuracy by Improving Table Usage Efficiency

HPCA '01 Proceedings of the 7th International Symposium on High-Performance Computer Architecture
Improving context-based load value prediction (instruction-level parallelism)

Improving context-based load value prediction (instruction-level parallelism)

Hybrid Load-Value Predictors

IEEE Transactions on Computers
Static Identification of Delinquent Loads

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Identifying the sources of cache misses in Java programs without relying on hardware counters

Proceedings of the 2012 international symposium on Memory Management

Quantified Score

Hi-index	0.00

Visualization

Abstract

While caches are effective at avoiding most main-memory accesses, the few remaining memory references are still expensive. Even one cache miss per one hundred accesses can double a program's execution time. To better tolerate the data-cache miss latency, architects have proposed various speculation mechanisms, including load-value prediction. A load-value predictor guesses the result of a load so that the dependent instructions can immediately proceed without having to wait for the memory access to complete. To use the prediction resources most effectively, speculation should be restricted to loads that are likely to miss in the cache and that are likely to be predicted correctly. Prior work has considered hardware- and profile-based methods to make these decisions. Our work focuses on making these decisions at compile time. We show that a simple compiler classification is effective at separating the loads that should be speculated from the loads that should not. We present results for a number of C and Java programs and demonstrate that our results are consistent across programming languages and across program inputs.