A cache coherence scheme with fast selective invalidation
ISCA '88 Proceedings of the 15th Annual International Symposium on Computer architecture
SPLASH: Stanford parallel applications for shared-memory
ACM SIGARCH Computer Architecture News
Lazy release consistency for software distributed shared memory
ISCA '92 Proceedings of the 19th annual international symposium on Computer architecture
Distributed operating systems
The SPLASH-2 programs: characterization and methodological considerations
ISCA '95 Proceedings of the 22nd annual international symposium on Computer architecture
Message passing versus distributed shared memory on networks of workstations
Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Scope consistency: a bridge between release consistency and entry consistency
Proceedings of the eighth annual ACM symposium on Parallel algorithms and architectures
OSDI '96 Proceedings of the second USENIX symposium on Operating systems design and implementation
Dynamic data prefetching in home-based software DSMs
Journal of Computer Science and Technology
Optimizing Home-Based Software DSM Protocols
Cluster Computing
JIAJIA: A Software DSM System Based on a New Cache Coherence Protocol
HPCN Europe '99 Proceedings of the 7th International Conference on High-Performance Computing and Networking
Reducing System Overheads in Home-based Software DSMs
IPPS '99/SPDP '99 Proceedings of the 13th International Symposium on Parallel Processing and the 10th Symposium on Parallel and Distributed Processing
A Comparison of Two Strategies of Dynamic Data Prefetching in Software DSM
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Write Detection in Home-Based Software DSMs
Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
An Efficient Lock Protocol for Home-Based Lazy Release Consistency
CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
MigThread: Thread Migration in DSM Systems
ICPPW '02 Proceedings of the 2002 International Conference on Parallel Processing Workshops
Moving Home-Based Lazy Release Consistency for Shared Virtual Memory Systems
ICPP '99 Proceedings of the 1999 International Conference on Parallel Processing
Evaluation of the JIAJIA Software DSM System on High Performance Computer Architectures
HICSS '99 Proceedings of the Thirty-second Annual Hawaii International Conference on System Sciences-Volume 8 - Volume 8
Homeless and home-based Lazy Release Consistency protocols on Distributed Shared Memory
ACSC '04 Proceedings of the 27th Australasian conference on Computer science - Volume 26
Proceedings of the 18th annual international conference on Supercomputing
On Design of Agent Home Scheme for Prefetching Strategy in DSM Systems
AINA '05 Proceedings of the 19th International Conference on Advanced Information Networking and Applications - Volume 1
Understanding The Linux Kernel
Understanding The Linux Kernel
On the Design and Implementation of an Effective Prefetch Strategy for DSM Systems
The Journal of Supercomputing
Distributed Systems: Principles and Paradigms (2nd Edition)
Distributed Systems: Principles and Paradigms (2nd Edition)
Hi-index | 0.00 |
In recent years, cluster computing has been widely investigated and there is no doubt that it can provide a cost-effective computing infrastructure by aggregating computational power, communication, and storage resources. Moreover, it is also considered to be a very attractive platform for low-cost supercomputing. Distributed shared memory (DSM) systems utilize the physical memory of each computing node interconnected in a private network to form a global virtual shared memory. Since this global shared memory is distributed among the computing nodes, accessing the data located in remote computing nodes is an absolute necessity. However, this action will result in significant remote memory access latencies which are major sources of overhead in DSM systems. For these reasons, in order to increase overall system performance and decrease this overhead, a number of strategies have been devised. Prefetching is one such approach which can reduce latencies, although it always increases the workload in the home nodes. In this paper, we propose a scheme named Agent Home Scheme. Its most noticeable feature, when compared to other schemes, is that the agent home distributes the workloads of each computing nodes when sending data. By doing this, we can reduce not only the workload of the home nodes by balancing the workload for each node, but also the waiting time. Experimental results show that the proposed method can obtain about 20% higher performance than the original JIAJIA, about 18% more than History Prefetching Strategy (HPS), and about 10% higher than Effective Prefetch Strategy (EPS).