Hierarchical overlapped tiling

  • Authors:
  • Xing Zhou;Jean-Pierre Giacalone;María Jesús Garzarán;Robert H. Kuhn;Yang Ni;David Padua

  • Affiliations:
  • University of Illinois at Urbana-Champaign;Intel Corporation;University of Illinois at Urbana-Champaign;Intel Corporation;Intel Corporation;University of Illinois at Urbana-Champaign

  • Venue:
  • Proceedings of the Tenth International Symposium on Code Generation and Optimization
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper introduces hierarchical overlapped tiling, a transformation that applies loop tiling and fusion to conventional loops. Overlapped tiling is a useful transformation to reduce communication overhead, but it may also generate a significant amount of redundant computation. Hierarchical overlapped tiling performs overlapped tiling hierarchically to balance communication overhead and redundant computation, and thus has the potential to provide better performance. In this paper, we describe the hierarchical overlapped tiling optimization and its implementation in an OpenCL compiler. We also evaluate the effectiveness of this optimization using 8 programs that implement different forms of stencil computation. Our results show that hierarchical overlapped tiling achieves an average 37% speedup over traditional tiling on a 32-core workstation.