MPC++ Performance for Commodity Clustering
HPCN Europe 2001 Proceedings of the 9th International Conference on High-Performance Computing and Networking
A Blocking Algorithm for Parallel 1-D FFT on Clusters of PCs
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Proceedings of the 20th annual international conference on Supercomputing
A hybrid MPI/OpenMP implementation of a parallel 3-d FFT on SMP clusters
PPAM'05 Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics
An implementation of parallel 3-d FFT using short vector SIMD instructions on clusters of PCs
PARA'04 Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing
Hi-index | 0.00 |
This paper proposes a scheme to realize a high performance communication facility using a commodity network. This scheme does not require any special hardware or hardware specific device drivers in order to adapt to many kinds of network interface cards. In this scheme, a reliable lightweight network protocol is handled on a data link layer called by a network device driver directly. An interrupt reaping technique is proposed to eliminate the hardware interrupt overhead when an application waits for a message. PM/Ethernet, an instance of the scheme, is implemented on Linux with minimal modification to the Linux kernel, and existing network device drivers are used without any modification. Using Pentium III 500 MHz PCs on Packet Engine's G-NIC II Gigabit Ethernet NIC, it achieves 77.5 MB/s bandwidth and 37.6 µsec round trip time latency compared to that of TCP/IP which achieves 46.7 MB/s bandwidth and 89.6 µsec round trip time latency. The NAS parallel benchmark IS results show that MPI on PM/Ethernet achieves 75% better performance than MPI on TCP/IP and 7.8 % slower than that of MPI on Myrinet PM.