Multi retention level STT-RAM cache designs with a dynamic refresh scheme

Authors:
Zhenyu Sun;Xiuyuan Bi;Hai (Helen) Li;Weng-Fai Wong;Zhong-Liang Ong;Xiaochun Zhu;Wenqing Wu
Affiliations:
Polytechnic Institute of New York University, Metrotech Center, Brooklyn, NY;Polytechnic Institute of New York University, Metrotech Center, Brooklyn, NY;Polytechnic Institute of New York University, Metrotech Center, Brooklyn, NY;National University of Singapore, Computing Drive, Singapore;National University of Singapore, Computing Drive, Singapore;Qualcomm Incorporated, Morehouse Drive, San Diego;Qualcomm Incorporated, Morehouse Drive, San Diego
Venue:
Proceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture
Year:
2011

Citing 7
Cited 11

Circuit and microarchitecture evaluation of 3D stacking magnetic RAM (MRAM) as a universal memory replacement

Proceedings of the 45th annual Design Automation Conference
System-level cost analysis and design exploration for three-dimensional integrated circuits (3D ICs)

Proceedings of the 2009 Asia and South Pacific Design Automation Conference
Energy reduction for STT-RAM using early write termination

Proceedings of the 2009 International Conference on Computer-Aided Design
Power and performance of read-write aware hybrid caches with non-volatile memories

Proceedings of the Conference on Design, Automation and Test in Europe
A forward body-biased low-leakage SRAM cache: device, circuit and architecture considerations

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Design of last-level on-chip cache using spin-torque transfer RAM (STT RAM)

IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Relaxing non-volatility for fast and energy-efficient STT-RAM caches

HPCA '11 Proceedings of the 2011 IEEE 17th International Symposium on High Performance Computer Architecture

Optimizing NAND flash-based SSDs via retention relaxation

FAST'12 Proceedings of the 10th USENIX conference on File and Storage Technologies
A dual-mode architecture for fast-switching STT-RAM

Proceedings of the 2012 ACM/IEEE international symposium on Low power electronics and design
Improving energy efficiency of write-asymmetric memories by log style write

Proceedings of the 2012 ACM/IEEE international symposium on Low power electronics and design
Asymmetric-access aware optimization for STT-RAM caches with process variations

Proceedings of the 23rd ACM international conference on Great lakes symposium on VLSI
OAP: an obstruction-aware cache management policy for STT-RAM last-level caches

Proceedings of the Conference on Design, Automation and Test in Europe
Cache coherence enabled adaptive refresh for volatile STT-RAM

Proceedings of the Conference on Design, Automation and Test in Europe
D-MRAM cache: enhancing energy efficiency with 3T-1MTJ DRAM/MRAM hybrid memory

Proceedings of the Conference on Design, Automation and Test in Europe
Cross-layer racetrack memory design for ultra high density and low power consumption

Proceedings of the 50th Annual Design Automation Conference
Low-energy volatile STT-RAM cache design using cache-coherence-enabled adaptive refresh

ACM Transactions on Design Automation of Electronic Systems (TODAES)
NVM duet: unified working memory and persistent store architecture

Proceedings of the 19th international conference on Architectural support for programming languages and operating systems
C1C: A configurable, compiler-guided STT-RAM L1 cache

ACM Transactions on Architecture and Code Optimization (TACO)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Spin-transfer torque random access memory (STT-RAM) has received increasing attention because of its attractive features: good scalability, zero standby power, non-volatility and radiation hardness. The use of STT-RAM technology in the last level on-chip caches has been proposed as it minimizes cache leakage power with technology scaling down. Furthermore, the cell area of STT-RAM is only 1/9 ~ 1/3 that of SRAM. This allows for a much larger cache with the same die footprint, improving overall system performance through reducing cache misses. However, deploying STT-RAM technology in L1 caches is challenging because of the long and power-consuming write operations. In this paper, we propose both L1 and lower level cache designs that use STT-RAM. In particular, our designs use STT-RAM cells with various data retention time and write performances, made possible by different magnetic tunneling junction (MTJ) designs. For the fast STT-RAM bits with reduced data retention time, a counter controlled dynamic refresh scheme is proposed to maintain the data validity. Our dynamic scheme saves more than 80% refresh energy compared to the simple refresh scheme proposed in previous works. A L1 cache built with ultra low retention STT-RAM coupled with our proposed dynamic refresh scheme can achieve 9.2% in performance improvement, and saves up to 30% of the total energy when compared to one that uses traditional SRAM. For lower level caches with relative large cache capacity, we propose a data migration scheme that moves data between portions of the cache with different retention characteristics so as to maximize the performance and power benefits. Our experiments show that on the average, our proposed multi retention level STT-RAM cache reduces 30 ~ 70% of the total energy compared to previous works, while improving IPC performance for both 2-level and 3-level cache hierarchy.