Moguls: a model to explore the memory hierarchy for bandwidth improvements

  • Authors:
  • Guangyu Sun;Christopher J. Hughes;Changkyu Kim;Jishen Zhao;Cong Xu;Yuan Xie;Yen-Kuang Chen

  • Affiliations:
  • Pennsylvania State University, State College, PA., USA;Intel Labs, Santa Clara, CA., USA;Intel Labs, Santa Clara, CA., USA;Pennsylvania State University, State College, PA., USA;Pennsylvania State University, State College, PA., USA;Pennsylvania State University, State College, PA., USA;Intel Labs, Santa Clara, CA., USA

  • Venue:
  • Proceedings of the 38th annual international symposium on Computer architecture
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In recent years, the increasing number of processor cores and limited increases in main memory bandwidth have led to the problem of the bandwidth wall, where memory bandwidth is becoming a performance bottleneck. This is especially true for emerging latency-insensitive, bandwidth-sensitive applications. Designing the memory hierarchy for a platform with an emphasis on maximizing bandwidth within a fixed power budget becomes one of the key challenges. To facilitate architects to quickly explore the design space of memory hierarchies, we propose an analytical performance model called Moguls. The Moguls model estimates the performance of an application on a system, using the bandwidth demand of the application for a range of cache capacities and the bandwidth provided by the system with those capacities. We show how to extend this model with appropriate approximations to optimize a cache hierarchy under a power constraint. The results show how many levels of cache should be designed, and what the capacity, bandwidth, and technology of each level should be. In addition, we study memory hierarchy design with hybrid memory technologies, which shows the benefits of using multiple technologies for future computing systems.