A self-tuning cache architecture for embedded systems

Authors:
Chuanjun Zhang;Frank Vahid;Roman Lysecky
Affiliations:
University of California, Riverside, CA;University of California, Riverside, CA;University of California, Riverside, CA
Venue:
ACM Transactions on Embedded Computing Systems (TECS)
Year:
2004

Citing 13
Cited 12

An analytical cache model

ACM Transactions on Computer Systems (TOCS)
MediaBench: a tool for evaluating and synthesizing multimedia and communicatons systems

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Adapting cache line size to application behavior

ICS '99 Proceedings of the 13th international conference on Supercomputing
Way-predicting set-associative cache for high performance and low energy consumption

ISLPED '99 Proceedings of the 1999 international symposium on Low power electronics and design
Selective cache ways: on-demand cache resource allocation

Proceedings of the 32nd annual ACM/IEEE international symposium on Microarchitecture
A low power unified cache architecture providing power and performance flexibility (poster session)

ISLPED '00 Proceedings of the 2000 international symposium on Low power electronics and design
Memory hierarchy reconfiguration for energy and performance in general-purpose processor architectures

Proceedings of the 33rd annual ACM/IEEE international symposium on Microarchitecture
Cache decay: exploiting generational behavior to reduce cache leakage power

ISCA '01 Proceedings of the 28th annual international symposium on Computer architecture
DRG-cache: a data retention gated-ground cache for low power

Proceedings of the 39th annual Design Automation Conference
Reducing set-associative cache energy via way-prediction and selective direct-mapping

Proceedings of the 34th annual ACM/IEEE international symposium on Microarchitecture
Energy Benefits of a Configurable Line Size Cache for Embedded Systems

ISVLSI '03 Proceedings of the IEEE Computer Society Annual Symposium on VLSI (ISVLSI'03)
A highly configurable cache architecture for embedded systems

Proceedings of the 30th annual international symposium on Computer architecture
Platune: a tuning framework for system-on-a-chip platforms

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

A highly configurable cache for low energy embedded systems

ACM Transactions on Embedded Computing Systems (TECS)
Application-specific customization of parameterized FPGA soft-core processors

Proceedings of the 2006 IEEE/ACM international conference on Computer-aided design
A cache design for high performance embedded systems

Journal of Embedded Computing - Cache exploitation in embedded systems
Eliminating inter-process cache interference through cache reconfigurability for real-time and low-power embedded multi-tasking systems

CASES '07 Proceedings of the 2007 international conference on Compilers, architecture, and synthesis for embedded systems
Cache partitioning for energy-efficient and interference-free embedded multitasking

ACM Transactions on Embedded Computing Systems (TECS)
A combined optimization method for tuning two-level memory hierarchy considering energy consumption

EURASIP Journal on Embedded Systems
On the interplay of loop caching, code compression, and cache configuration

Proceedings of the 16th Asia and South Pacific Design Automation Conference
T-SPaCS: a two-level single-pass cache simulation methodology

Proceedings of the 16th Asia and South Pacific Design Automation Conference
Fast configurable-cache tuning with a unified second-level cache

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
A survey on cache tuning from a power/energy perspective

ACM Computing Surveys (CSUR)
Two-level caches tuning technique for energy consumption in reconfigurable embedded MPSoC

Journal of Systems Architecture: the EUROMICRO Journal
Thread-criticality aware dynamic cache reconfiguration in multi-core system

Proceedings of the International Conference on Computer-Aided Design

Quantified Score

Hi-index	0.00

Visualization

Abstract

Memory accesses often account for about half of a microprocessor system's power consumption. Customizing a microprocessor cache's total size, line size, and associativity to a particular program is well known to have tremendous benefits for performance and power. Customizing caches has until recently been restricted to core-based flows, in which a new chip will be fabricated. However, several configurable cache architectures have been proposed recently for use in prefabricated microprocessor platforms. Tuning those caches to a program is still, however, a cumbersome task left for designers, assisted in part by recent computer-aided design (CAD) tuning aids. We propose to move that CAD on-chip, which can greatly increase the acceptance of tunable caches. We introduce on-chip hardware implementing an efficient cache tuning heuristic that can automatically, transparently, and dynamically tune the cache to an executing program. Our heuristic seeks not only to reduce the number of configurations that must be examined, but also traverses the search space in a way that minimizes costly cache flushes. By simulating numerous Powerstone and MediaBench benchmarks, we show that such a dynamic self-tuning cache saves on average 40% of total memory access energy over a standard nontuned reference cache.