Latency Analyses of CC-NUMA and CC-COMA Rings

  • Authors:
  • Xiaodong Zhang;Yong Yan

  • Affiliations:
  • The University of Texas at San Antonio, USA;The University of Texas at San Antonio, USA

  • Venue:
  • ICPP '94 Proceedings of the 1994 International Conference on Parallel Processing - Volume 01
  • Year:
  • 1994

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper focuses comparative performance modeling and evaluation of CC-NUMA and CC-COMA on a hierarchical ring shared-memory architecture. Intensive performance measurements of the two models have been conducted on the KSR-1. The experimental results support the analytical models, and present practical observations and comparisons of the two cache coherence memory systems. Our analytical and experimental results show that a CC-COMA system balances the work load well. However the overhead of frequent data movement may match the gains obtained from improving load balance. Although a CC-NUMA system may not automatically balance the load at the system level, it provides an option for a user to explicitly handle data locality for a possible performance improvement.