Filtering directory lookups in CMPs

  • Authors:
  • A. Bosque;V. Viñals;P. Ibáñez;J. M. Llaberıa

  • Affiliations:
  • Department of Computer Architecture, Universitat Politécnica de Catalunya (UPC), Barcelona, Spain;Department of Computer Science and Systems Engineering, University of Zaragoza, Zaragoza, Spain;Department of Computer Science and Systems Engineering, University of Zaragoza, Zaragoza, Spain;Department of Computer Architecture, Universitat Politécnica de Catalunya (UPC), Barcelona, Spain

  • Venue:
  • Microprocessors & Microsystems
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Coherence protocols consume an important fraction of power to determine which coherence action to perform. Specifically, on CMPs with shared cache and directory-based coherence protocol implemented as a duplicate of local caches tags, we have observed that a big fraction of directory lookups cause a miss, because the block looked up is not allocated in any local cache. To reduce the number of directory lookups and therefore the power consumption, we propose to add a filter before the directory access. We introduce two filter implementations. In the first one, filtering information is explicitly kept in the shared cache for every block. In the second one, filtering information is decoupled from the shared cache organization, so the filter size does not depend on the shared cache size. We evaluate our filters in a CMP with 8 in-order processors with 4 threads each and a memory hierarchy with write-through local caches and a shared cache. We show that, for SPLASH2 benchmarks, the proposed filters reduce the number of directory lookups performed by 60% while power consumption is reduced by ~28%. For Specweb2005, the number of directory lookups performed is reduced by 68% (44%), while directory power consumption is reduced by 19% (9%) using the first (second) filter implementation.