Cache design trade-offs for power and performance optimization: a case study
ISLPED '95 Proceedings of the 1995 international symposium on Low power design
The filter cache: an energy efficient memory structure
MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
ISLPED '99 Proceedings of the 1999 international symposium on Low power electronics and design
Way-predicting set-associative cache for high performance and low energy consumption
ISLPED '99 Proceedings of the 1999 international symposium on Low power electronics and design
Selective cache ways: on-demand cache resource allocation
Proceedings of the 32nd annual ACM/IEEE international symposium on Microarchitecture
Wattch: a framework for architectural-level power analysis and optimizations
Proceedings of the 27th annual international symposium on Computer architecture
SH3: High Code Density, Low Power
IEEE Micro
Proceedings of the 2001 International Conference on Parallel Architectures and Compilation Techniques
Effective instruction scheduling techniques for an interleaved cache clustered VLIW processor
Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture
Predictive sequential associative cache
HPCA '96 Proceedings of the 2nd IEEE Symposium on High-Performance Computer Architecture
Design and analysis of low-power cache using two-level filter scheme
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Power-Aware Deterministic Block Allocation for Low-Power Way-Selective Cache Structure
ICCD '04 Proceedings of the IEEE International Conference on Computer Design
A way-halting cache for low-energy high-performance systems
ACM Transactions on Architecture and Code Optimization (TACO)
Word-interleaved cache: an energy efficient data cache architecture
Proceedings of the 13th international symposium on Low power electronics and design
Low Vccmin fault-tolerant cache with highly predictable performance
Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture
Hi-index | 0.00 |
In this paper, we propose Selective Word Reading (SWR) technique for a low power processor without a loss of performance. The development of this technique was motivated by the differences between store unit sizes per storage level. In typical cases, the CPU register stores data with a unit size of one word, the L1 cache stores data using a unit size of four words and the L2 cache stores with a unit size of eight words. In SWR, only the necessary part of the block is activated during the cache access process. Therefore, the L1 cache controller reads one word from the L1 cache, which uses a unit size of four words, and the L2 cache controller reads four words from the L2 cache, which uses a unit size of eight words. The proper numbers of mats in a sub-bank must be chosen for the highest SWR best efficiency. For a four-way set associative L1 cache with a size of 32kB, a block size of 32B and four mats per sub-bank, SWR accomplishes dynamic energy savings of 67.54% without considering the leakage energy and 56.75% when the leakage energy is considered with no performance degradation and negligible area reduction. Additionally, with a 16-way set associative L2 cache with a size of 1MB, a block size of 64B and eight mats per sub-bank, SWR accomplishes dynamic energy savings of 60.41% for the cache overall before the leakage energy is considered and 11.71% after.