Selective word reading for high performance and low power processor

Authors:
Yun Kyo Cho;Seong Tae Jhang;Chu Shik Jhon
Affiliations:
Seoul National University, Gwanak-gu, Seoul, Korea;The University of Suwon, Hwaseong-si, Gyeonggi-do, Korea;Seoul national University, Gwanak-gu, Seoul, Korea
Venue:
Proceedings of the 2011 ACM Symposium on Research in Applied Computation
Year:
2011

Citing 15
Cited 0

Cache design trade-offs for power and performance optimization: a case study

ISLPED '95 Proceedings of the 1995 international symposium on Low power design
The filter cache: an energy efficient memory structure

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Reducing power in superscalar processor caches using subbanking, multiple line buffers and bit-line segmentation

ISLPED '99 Proceedings of the 1999 international symposium on Low power electronics and design
Way-predicting set-associative cache for high performance and low energy consumption

ISLPED '99 Proceedings of the 1999 international symposium on Low power electronics and design
Selective cache ways: on-demand cache resource allocation

Proceedings of the 32nd annual ACM/IEEE international symposium on Microarchitecture
Wattch: a framework for architectural-level power analysis and optimizations

Proceedings of the 27th annual international symposium on Computer architecture
SH3: High Code Density, Low Power

IEEE Micro
Reactive-Associative Caches

Proceedings of the 2001 International Conference on Parallel Architectures and Compilation Techniques
Effective instruction scheduling techniques for an interleaved cache clustered VLIW processor

Proceedings of the 35th annual ACM/IEEE international symposium on Microarchitecture
Predictive sequential associative cache

HPCA '96 Proceedings of the 2nd IEEE Symposium on High-Performance Computer Architecture
Design and analysis of low-power cache using two-level filter scheme

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Power-Aware Deterministic Block Allocation for Low-Power Way-Selective Cache Structure

ICCD '04 Proceedings of the IEEE International Conference on Computer Design
A way-halting cache for low-energy high-performance systems

ACM Transactions on Architecture and Code Optimization (TACO)
Word-interleaved cache: an energy efficient data cache architecture

Proceedings of the 13th international symposium on Low power electronics and design
Low Vccmin fault-tolerant cache with highly predictable performance

Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we propose Selective Word Reading (SWR) technique for a low power processor without a loss of performance. The development of this technique was motivated by the differences between store unit sizes per storage level. In typical cases, the CPU register stores data with a unit size of one word, the L1 cache stores data using a unit size of four words and the L2 cache stores with a unit size of eight words. In SWR, only the necessary part of the block is activated during the cache access process. Therefore, the L1 cache controller reads one word from the L1 cache, which uses a unit size of four words, and the L2 cache controller reads four words from the L2 cache, which uses a unit size of eight words. The proper numbers of mats in a sub-bank must be chosen for the highest SWR best efficiency. For a four-way set associative L1 cache with a size of 32kB, a block size of 32B and four mats per sub-bank, SWR accomplishes dynamic energy savings of 67.54% without considering the leakage energy and 56.75% when the leakage energy is considered with no performance degradation and negligible area reduction. Additionally, with a 16-way set associative L2 cache with a size of 1MB, a block size of 64B and eight mats per sub-bank, SWR accomplishes dynamic energy savings of 60.41% for the cache overall before the leakage energy is considered and 11.71% after.