Memory coherence in shared virtual memory systems
PODC '86 Proceedings of the fifth annual ACM symposium on Principles of distributed computing
Lazy release consistency for software distributed shared memory
ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
MultiView and Millipage — fine-grain sharing in page-based DSMs
OSDI '99 Proceedings of the third symposium on Operating systems design and implementation
False Sharing and Spatial Locality in Multiprocessor Caches
IEEE Transactions on Computers
Software DSM Protocols that Adapt between Single Writer and Multiple Writer
HPCA '97 Proceedings of the 3rd IEEE Symposium on High-Performance Computer Architecture
Dynamically Controlling False Sharing in Distributed Shared Memory
HPDC '96 Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
The relative importance of concurrent writers and weak consistency models
ICDCS '96 Proceedings of the 16th International Conference on Distributed Computing Systems (ICDCS '96)
Hi-index | 0.00 |
Page based software DSMs experience high degrees of false sharing especially in irregular applications with fine grain sharing granularity. The overheads due to false sharing is considered to be the dominant factor limiting the performance of software DSMs. Several approaches have been proposed in the literature to reduce/eliminate false sharing. In this paper, we evaluate two of these approaches, viz., the Multiple Writer approach and Emulated Fine Grain Sharing (EmFiGS) approach. Our evaluation strategy is two pronged: firstly, we use a novel implementation independent analysis which uses overhead counts to compare the different approaches. The performance of EmFiGS approach is significantly worse, by a factor of 1.5 to as much as 90 times, compared to the Multiple Writer Approach. In many cases, EmFiGS approach performs worse that even a single writer lazy release protocol which experiences very high overheads due to false sharing. Our analysis shows that the benefits gained by eliminating false sharing are far outweighed by the performance penalty incurred due to the reduced exploitation of spatial locality in the EmFiGS approach. The implementation independent nature of our analysis implies that any implementation of the EmFiGS approach is likely to perform significantly worse than the Multiple Writer approach. Secondly, we use experimental evaluation to validate and complement our analysis. Experimental results match well with our analysis. Also the execution times of the application follow the same trend as in our analysis, reinforcing our conclusions.