Optimizing the memory bandwidth with loop fusion

  • Authors:
  • Paul Marchal;José Ignacio Gómez;Francky Catthoor

  • Affiliations:
  • IMEC/KULEUVEN, Heverlee, Belgium;DACYA U.C.M., Madrid, Spain;IMEC/KULEUVEN, Heverlee, Belgium

  • Venue:
  • Proceedings of the 2nd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

The memory bandwidth largely determines the performance and energy cost of embedded systems. At the compiler level, several techniques improve the memory bandwidth at the scope of a basic block, but often fail to exploit all. We propose a technique to optimize the memory bandwidth across the boundaries of a basic block. Our technique incrementally fuses loops to better use the available bandwidth. The resulting performance depends on how the data is assigned to the memories of the memory layer. At the same time, the assignment also strongly influences the energy cost. Therefore, we combine in our approach the fusion and assignment decisions. Designers can use our output to trade-off the energy cost with the system's performance.