A Compiler Framework for Tiling Imperfectly-Nested Loops

  • Authors:
  • Yonghong Song;Zhiyuan Li

  • Affiliations:
  • -;-

  • Venue:
  • LCPC '99 Proceedings of the 12th International Workshop on Languages and Compilers for Parallel Computing
  • Year:
  • 1999

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents an integrated compiler framework for tiling a class of nontrivial imperfectly-nested loops such that cache locality is improved. We develop a new memory cost model to analyze data reuse in terms of both the cache and the TLB, based on which we compute the tile size with or without array duplication. We determine whether to duplicate arrays for tiling by comparing the respective exploited reuse factors. The preliminary results with several benchmark programs show that the transformed programs achieve a speedup of 1.09 to 3.82 over the original programs.