VAXcluster: a closely-coupled distributed system
ACM Transactions on Computer Systems (TOCS)
Experience with Parallel Computing on the AN2 Network
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Digital's clusters and scientific parallel applications
COMPCON '96 Proceedings of the 41st IEEE International Computer Conference
Overview of Digital UNIX Cluster Architecture
COMPCON '96 Proceedings of the 41st IEEE International Computer Conference
Shasta: a low overhead, software-only approach for supporting fine-grain shared memory
Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
IEEE Transactions on Parallel and Distributed Systems
Evaluation of hardware write propagation support for next-generation shared virtual memory clusters
ICS '98 Proceedings of the 12th international conference on Supercomputing
UTLB: a mechanism for address translation on network interfaces
Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
ISCA '99 Proceedings of the 26th annual international symposium on Computer architecture
Fast cluster failover using virtual memory-mapped communication
ICS '99 Proceedings of the 13th international conference on Supercomputing
User-space communication: a quantitative study
SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Experiences with VI communication for database storage
ISCA '02 Proceedings of the 29th annual international symposium on Computer architecture
Using the Memory Channel Network
IEEE Micro
CableS: Thread Control and Memory System Extensions for Shared Virtual Memory Clusters
WOMPAT '01 Proceedings of the International Workshop on OpenMP Applications and Tools: OpenMP Shared Memory Parallel Programming
miNI: reducing network interface memory requirements with dynamic handle lookup
ICS '03 Proceedings of the 17th annual international conference on Supercomputing
Journal of Parallel and Distributed Computing
IEEE Transactions on Parallel and Distributed Systems
Efficient remote block-level I/O over an RDMA-capable NIC
Proceedings of the 20th annual international conference on Supercomputing
Hi-index | 0.00 |
The parallel performance of a cluster is often limited by performance compromises in conventional networks. This paper discusses the benefits of a more specialized networking approach and the resulting design of a low-cost, high-performance interconnect optimized specifically to enhance both the parallel performance and high-availability aspects of a cluster. This interconnect, Memory Channel Network for PCI, attaches to the industry-standard PCI bus found in most computers. The first-generation product, running on a standard Digital UNIX Cluster, provides cluster communication latency and overhead improvements of more than 50 and 1000 times respectively, compared to conventional networks.