MorphCache: A Reconfigurable Adaptive Multi-level Cache hierarchy

Authors:
Shekhar Srikantaiah;Emre Kultursay;Tao Zhang;Mahmut Kandemir;Mary Jane Irwin;Yuan Xie
Affiliations:
The Pennsylvania State University, University Park, PA - 16802;The Pennsylvania State University, University Park, PA - 16802;The Pennsylvania State University, University Park, PA - 16802;The Pennsylvania State University, University Park, PA - 16802;The Pennsylvania State University, University Park, PA - 16802;The Pennsylvania State University, University Park, PA - 16802
Venue:
HPCA '11 Proceedings of the 2011 IEEE 17th International Symposium on High Performance Computer Architecture
Year:
2011

Citing 0
Cited 3

Bandwidth-aware reconfigurable cache design with hybrid memory technologies

Proceedings of the International Conference on Computer-Aided Design
Design space exploration of workload-specific last-level caches

Proceedings of the 2012 ACM/IEEE international symposium on Low power electronics and design
A survey on cache tuning from a power/energy perspective

ACM Computing Surveys (CSUR)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Given the diverse range of application characteristics that chip multiprocessors (CMPs) need to cater to, a "one-cache-topology-fits-all" design philosophy will clearly be inadequate. In this paper, we propose MorphCache, a Reconfigurable Adaptive Multi-level Cache hierarchy. Mor-phCache dynamically tunes a multi-level cache topology in a CMP to allow significantly different cache topologies to exist on the same architecture. Starting from per-core L2 and L3 cache slices as the basic design point, MorphCache alters the cache topology dynamically by merging or splitting cache slices and modifying the accessibility of different cache slice groups to different cores in a CMP. We evaluated MorphCache on a 16 core CMP on a full system simulator and found that it significantly improves both average throughput and harmonic mean of speedups of diverse multithreaded and multiprogrammed workloads. Specifically, our results show that MorphCache improves throughput of the multiprogrammed mixes by 29.9% over a topology with all-shared L2 and L3 caches and 27.9% over a topology with per core private L2 cache and shared L3 cache. In addition, we also compared MorphCache to partitioning a single shared cache at each level using promotion/insertion pseudo-partitioning (PIPP) [28] and managing per-core private cache at each level using dynamic spill receive caches (DSR) [18]. We found that MorphCache improves average throughput by 6.6% over PIPP and by 5.7% over DSR when applied to both L2 and L3 caches.