Virtual memory mapped network interface for the SHRIMP multicomputer
ISCA '94 Proceedings of the 21st annual international symposium on Computer architecture
Construction of staples in lattice gauge theory on a parallel computer
Parallel Computing
Using prediction to accelerate coherence protocols
Proceedings of the 25th annual international symposium on Computer architecture
MPI-LAPI: An Efficient Implementation of MPI for IBM RS/6000 SP Systems
IEEE Transactions on Parallel and Distributed Systems
CANPC '98 Proceedings of the Second International Workshop on Network-Based Parallel Computing: Communication, Architecture, and Applications
Efficient Communication Using Message Prediction for Cluster Multiprocessors
CANPC '00 Proceedings of the 4th International Workshop on Network-Based Parallel Computing: Communication, Architecture, and Applications
Architectural Extensions to Support Efficient Communication Using Message Prediction
HPCS '02 Proceedings of the 16th Annual International Symposium on High Performance Computing Systems and Applications
ATEC '96 Proceedings of the 1996 annual conference on USENIX Annual Technical Conference
High-performance local area communication with fast sockets
ATEC '97 Proceedings of the annual conference on USENIX Annual Technical Conference
Lazy direct-to-cache transfer during receive operations in a message passing environment
Proceedings of the 3rd conference on Computing frontiers
Microprocessors & Microsystems
ISPA'07 Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications
Hi-index | 0.00 |
The focus of this work is on techniques that promise to reduce the message delivery latency in message passing environments. The main contributors to message delivery latency in message passing environments are the copying operations needed to transfer and bind a received message to the consuming process/thread. To reduce this copying overhead and to reach to finer granularity, we introduced architectural extensions comprising of specialized network cache and instructions to manage the operations of this extension. In this work we study the caching environment. Our simulations show that messages can be bound and transferred into the data cache where they persist long enough to be consumed. We also study the structure of the required network cache and show that a small capacity cache is sufficient.