Exploring hybrid memory for GPU energy efficiency through software-hardware co-design

  • Authors:
  • Bin Wang;Bo Wu;Dong Li;Xipeng Shen;Weikuan Yu;Yizheng Jiao;Jeffrey S. Vetter

  • Affiliations:
  • Auburn University, Auburn, AL, USA;College of William & Mary, Williamsburg, VA, USA;Oak Ridge National Laboratory, Oak Ridge, TN, USA;College of William & Mary, Williamsburg, VA, USA;Auburn University, Auburn, AL, USA;Auburn University, Auburn, AL, USA;Oak Ridge National Laboratory, Oak Ridge, TN, USA

  • Venue:
  • PACT '13 Proceedings of the 22nd international conference on Parallel architectures and compilation techniques
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Hybrid memory designs, such as DRAM plus Phase Change Memory (PCM), have shown some promise for alleviating power and density issues faced by traditional memory systems. But previous studies have concentrated on CPU systems with a modest level of parallelism. This work studies the problem in a massively parallel setting. Specifically, it investigates the special implications to hybrid memory imposed by the massive parallelism in GPU. It empirically shows that, contrary to promising results demonstrated for CPU, previous designs of PCM-based hybrid memory result in significant degradation to the energy efficiency of GPU. It reveals that the fundamental reason comes from a multi-facet mismatch between those designs and the massive parallelism in GPU. It presents a solution that centers around a close cooperation between compiler-directed data placement and hardware-assisted runtime adaptation. The co-design approach helps tap into the full potential of hybrid memory for GPU without requiring dramatic hardware changes over previous designs, yielding 6% and 49% energy saving on average compared to pure DRAM and pure PCM respectively, and keeping performance loss less than 2%.