Comparative performance evaluation of cache-coherent NUMA and COMA architectures
ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
Cache consistency in hierarchical-ring-based multiprocessors
Proceedings of the 1992 ACM/IEEE conference on Supercomputing
The performance of cache-coherent ring-based multiprocessors
ISCA '93 Proceedings of the 20th annual international symposium on computer architecture
An empirical comparison of the Kendall Square Research KSR-1 and Stanford DASH multiprocessors
Proceedings of the 1993 ACM/IEEE conference on Supercomputing
HPCA '95 Proceedings of the 1st IEEE Symposium on High-Performance Computer Architecture
Hi-index | 0.00 |
This paper focuses comparative performance modeling and evaluation of CC-NUMA and CC-COMA on a hierarchical ring shared-memory architecture. Intensive performance measurements of the two models have been conducted on the KSR-1. The experimental results support the analytical models, and present practical observations and comparisons of the two cache coherence memory systems. Our analytical and experimental results show that a CC-COMA system balances the work load well. However the overhead of frequent data movement may match the gains obtained from improving load balance. Although a CC-NUMA system may not automatically balance the load at the system level, it provides an option for a user to explicitly handle data locality for a possible performance improvement.