Cache sensitive modulo scheduling

  • Authors:
  • F. Jesús Sánchez;Antonio González

  • Affiliations:
  • Department of Computer Architecture, Universitat Politècnica de Catalunya, Campus Nord - c./ Jordi Girona, 1-3 - Mòdul D6, 08034 - Barcelona, Spain;Department of Computer Architecture, Universitat Politècnica de Catalunya, Campus Nord - c./ Jordi Girona, l-3 - Mòdul D6, 08034 - Barcelona, Spain

  • Venue:
  • MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
  • Year:
  • 1997

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper focuses on the interaction between software prefetching (both binding and nonbinding) and software pipelining for VLIW machines. First, it is shown that evaluating software pipelined schedules without considering memory effects can be rather inaccurate due to stalls caused by dependences with memory instructions (even if a lockup-free cache is considered). It is also shown that the penalty of the stalls is in general higher than the effect of spill code. Second, we show that in general binding schemes are more powerful than nonbinding ones for software pipelined schedules. Finally, the main contribution of this paper is an heuristic scheme that schedules some memory operations according to the locality estimated at compile time and other attributes of the dependence graph. The proposed scheme is shown to outperform other heuristic approaches since it achieves a better trade-off between compute and stall time than the others.