Multiprocessor performance
Future Generation Computer Systems
A decentralized strategy for genetic scheduling in heterogeneous environments
Multiagent and Grid Systems - Grid Computing, high performance and distributed applications
Experimental evaluation of infiniband transport over local- and wide-area networks
SpringSim '07 Proceedings of the 2007 spring simulation multiconference - Volume 2
Experimental Analysis of InfiniBand Transport Services on WAN
NAS '08 Proceedings of the 2008 International Conference on Networking, Architecture, and Storage
Operating two InfiniBand grid clusters over 28 km distance
International Journal of Grid and Utility Computing
Performance analysis and prediction for distributed homogeneous clusters
Computer Science - Research and Development
Making the network scalable: inter-subnet routing in infiniband
Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing
Hi-index | 0.00 |
We discuss operational and organizational issues of an InfiniBand interconnection between two clusters over a distance of 28 km in day-to-day production use. We describe the setup of hardware and networking components, and the solution of technical integration problems. Then we present solutions for a federated authorization system for the cluster within our two participating universities and other organizational integration problems. Performance measurements for MPI communication and file access to Lustre storage systems are presented. The results and a simple performance model show that MPI performance is intrinsically poor across the long-distance interconnection with limited bandwidth. However, file access and MPI communication among nodes on each side are barely affected by the limitations of the interconnection even at high load. Our organizational and technical setup allows the operation of the two clusters as a single system with lower administration costs and a better load balance than in a disconnected setup.