Optimal two level partitioning and loop scheduling for hiding memory latency for DSP applications

  • Authors:
  • Zhong Wang;Michael Kirkpatrick;Edwin Hsing-Mean Sha

  • Affiliations:
  • Dept of Comp Sci & Engr, University of Notre Dame, Notre Dame, IN;Dept of Comp Sci & Engr, University of Notre Dame, Notre Dame, IN;Dept of Comp Sci & Engr, University of Notre Dame, Notre Dame, IN

  • Venue:
  • Proceedings of the 37th Annual Design Automation Conference
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

The large latency of memory accesses in modern computers is a key obstacle in achieving high processor utilization. To hide this latency, this paper proposes a new memory management technique that can be applied to computer architectures with three levels of memory. The technique takes advantage of access pattern information that is available at compile time by prefetching certain data elements from the higher level memory. It as well maintains certain data for a period of time to prevent unnecessary data swapping. Data locality is much improved compared with the usual pattern by partitioning the iteration space and reducing execution in each partition. These combined approaches lead to improvements in average execution times of approximately 35% over the one-level partition algorithm and more than 80% over list scheduling and hardware prefetching.