Inter-warp instruction temporal locality in deep-multithreaded GPUs

  • Authors:
  • Ahmad Lashgar;Amirali Baniasadi;Ahmad Khonsari

  • Affiliations:
  • School of Electrical and Computer Engineering, University College of Engineering, University of Tehran, Tehran, Iran;Electrical and Computer Engineering Department, University of Victoria, Victoria, British Columbia, Canada;School of Electrical and Computer Engineering, University College of Engineering, University of Tehran, Tehran, Iran,School of Computer Science, Institute for Research in Fundamental Sciences, Teh ...

  • Venue:
  • ARCS'13 Proceedings of the 26th international conference on Architecture of Computing Systems
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

GPUs employ thousands of threads per core to achieve high throughput. These threads exhibit localities in control-flow, instruction and data addresses and values. In this study we investigate inter-warp instruction temporal locality and show that during short intervals a significant share of fetched instructions are fetched unnecessarily. This observation provides several opportunities to enhance GPUs. We discuss different possibilities and evaluate filter cache as a case study. Moreover, we investigate how variations in microarchitectural parameters impacts potential filter cache benefits in GPUs.