Frequent value locality and value-centric data cache design

Authors:
Youtao Zhang;Jun Yang;Rajiv Gupta
Affiliations:
Department of Computer Science, The University of Arizona, Tucson, Arizona;Department of Computer Science, The University of Arizona, Tucson, Arizona;Department of Computer Science, The University of Arizona, Tucson, Arizona
Venue:
ASPLOS IX Proceedings of the ninth international conference on Architectural support for programming languages and operating systems
Year:
2000

Citing 9
Cited 40

Value locality and load value prediction

Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
Improving code density using compression techniques

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Procedure based program compression

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Can program profiling support value prediction?

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Improving direct-mapped cache performance by the addition of a small fully-associative cache and prefetch buffers

ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
Reconfigurable caches and their application to media processing

Proceedings of the 27th annual international symposium on Computer architecture
Compiler techniques for code compaction

ACM Transactions on Programming Languages and Systems (TOPLAS)
Bidwidth analysis with application to silicon compilation

PLDI '00 Proceedings of the ACM SIGPLAN 2000 conference on Programming language design and implementation
Dynamically Exploiting Narrow Width Operands to Improve Processor Power and Performance

HPCA '99 Proceedings of the 5th International Symposium on High Performance Computer Architecture

Frequent value compression in data caches

Proceedings of the 33rd annual ACM/IEEE international symposium on Microarchitecture
Load and store reuse using register file contents

ICS '01 Proceedings of the 15th international conference on Supercomputing
FV encoding for low-power data I/O

ISLPED '01 Proceedings of the 2001 international symposium on Low power electronics and design
Frequent value locality and its applications

ACM Transactions on Embedded Computing Systems (TECS)
Low-Cost Value Predictors Using Frequent Value Locality

ISHPC '02 Proceedings of the 4th International Symposium on High Performance Computing
Data Compression Transformations for Dynamically Allocated Data Structures

CC '02 Proceedings of the 11th International Conference on Compiler Construction
Energy efficient frequent value data cache design

Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture
Catching Accurate Profiles in Hardware

HPCA '03 Proceedings of the 9th International Symposium on High-Performance Computer Architecture
Non redundant data cache

Proceedings of the 2003 international symposium on Low power electronics and design
Power efficient encoding techniques for off-chip data buses

Proceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems
Fast Secure Processor for Inhibiting Software Piracy and Tampering

Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture
Improving 64-Bit Java IPF Performance by Compressing Heap References

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
Adaptive Cache Compression for High-Performance Processors

Proceedings of the 31st annual international symposium on Computer architecture
A Content Aware Integer Register File Organization

Proceedings of the 31st annual international symposium on Computer architecture
Frequent value encoding for low power data buses

ACM Transactions on Design Automation of Electronic Systems (TODAES)
MicroLib: A Case for the Quantitative Comparison of Micro-Architecture Mechanisms

Proceedings of the 37th annual IEEE/ACM International Symposium on Microarchitecture
Improving Memory Encryption Performance in Secure Processors

IEEE Transactions on Computers
A compressed memory hierarchy using an indirect index cache

WMPI '04 Proceedings of the 3rd workshop on Memory performance issues: in conjunction with the 31st international symposium on computer architecture
Zero clustering: an approach to extend zero compression to instruction caches

GLSVLSI '05 Proceedings of the 15th ACM Great Lakes symposium on VLSI
A Robust Main-Memory Compression Scheme

Proceedings of the 32nd annual international symposium on Computer Architecture
An asymmetric clustered processor based on value content

Proceedings of the 19th annual international conference on Supercomputing
Restrictive Compression Techniques to Increase Level 1 Cache Capacity

ICCD '05 Proceedings of the 2005 International Conference on Computer Design
Fire-and-Forget: Load/Store Scheduling with No Store Queue at All

Proceedings of the 39th Annual IEEE/ACM International Symposium on Microarchitecture
Enhancing server availability and security through failure-oblivious computing

OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Compression in cache design

Proceedings of the 21st annual international conference on Supercomputing
Increasing cache capacity through word filtering

Proceedings of the 21st annual international conference on Supercomputing
Leakage energy reduction in cache memory by data compression

ACM SIGARCH Computer Architecture News - Special issue: ALPS '07---advanced low power systems
Early detection and bypassing of trivial operations to improve energy efficiency of processors

Microprocessors & Microsystems
CATCH: a mechanism for dynamically detecting Cache-Content-Duplication and its application to instruction caches

Proceedings of the conference on Design, automation and test in Europe
A Unified Compressed Cache Hierarchy Using Simple Frequent Pattern Compression and Partial Cache Line Prefetching

ICESS '07 Proceedings of the 3rd international conference on Embedded Software and Systems
Energy-efficient encoding techniques for off-chip data buses

ACM Transactions on Embedded Computing Systems (TECS)
Adaptive data compression for high-performance low-power on-chip networks

Proceedings of the 41st annual IEEE/ACM International Symposium on Microarchitecture
Eliminating energy of same-content-cell-columns of on-chip SRAM arrays

Proceedings of the 17th IEEE/ACM international symposium on Low-power electronics and design
CATCH: A mechanism for dynamically detecting cache-content-duplication in instruction caches

ACM Transactions on Architecture and Code Optimization (TACO)
Dynamic dictionary-based data compression for level-1 caches

ARCS'06 Proceedings of the 19th international conference on Architecture of Computing Systems
A space-efficient on-chip compressed cache organization for high performance computing

ISPA'04 Proceedings of the Second international conference on Parallel and Distributed Processing and Applications
Energy-Efficient value-based selective refresh for embedded DRAMs

PATMOS'05 Proceedings of the 15th international conference on Integrated Circuit and System Design: power and Timing Modeling, Optimization and Simulation
Lossless and lossy memory I/O link compression for improving performance of GPGPU workloads

Proceedings of the 21st international conference on Parallel architectures and compilation techniques
Base-delta-immediate compression: practical data compression for on-chip caches

Proceedings of the 21st international conference on Parallel architectures and compilation techniques
Linearly compressed pages: a low-complexity, low-latency main memory compression framework

Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

By studying the behavior of programs in the SPECint95 suite we observed that six out of eight programs exhibit a new kind of value locality, the frequent value locality, according to which a few values appear very frequently in memory locations and are therefore involved in a large fraction of memory accesses. In these six programs ten distinct values occupy over 50% of all memory locations and on an average account for nearly 50% of all memory accesses during program execution. This observation holds for smaller blocks of consecutive memory locations and the set of frequent values remains quite stable over the execution of the program.In the six benchmarks with frequent value locality, on an average 50% of all cache misses occur during the reading or writing of the ten most frequently accessed values. We propose a new data cache structure, the frequent value cache (FVC), which employs a value-centric approach to caching data locations for exploiting the frequent value locality phenomenon. FVC is a small direct-mapped cache which is dedicated to holding only frequently occurring values. The value-centric nature of FVC enables us to store data in a compressed form where the compression is achieved by encoding the frequent values using a few bits. Moreover this simple compression scheme preserves the random access to data values in a cache line.Our experiments demonstrate that by augmenting a direct mapped cache (DMC) with a direct mapped FVC of size no more than 3 Kbytes we can obtain reductions in miss rates ranging from 1% to 68%. In fact we observed that higher reductions in miss rates can he achieved by augmenting a DMC with a small FVC as opposed to doubling the size of DMC for the 124.m88ksim and 134.perl benchmarks.