The analysis of cluster interconnect with the network tests2 toolkit

  • Authors:
  • Alexey Salnikov;Dmitry Andreev;Roman Lebedev

  • Affiliations:
  • Lomonosov Moscow State University, Dorodnicyn Computing Centre of RAS, Moscow National research nuclear university "MEPhI";Lomonosov Moscow State University, Dorodnicyn Computing Centre of RAS, Moscow National research nuclear university "MEPhI";Lomonosov Moscow State University, Dorodnicyn Computing Centre of RAS, Moscow National research nuclear university "MEPhI"

  • Venue:
  • EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The article discusses MPI-2 tools for benchmarking and extracting information on features of interconnect in HPC clusters. Authors develop a toolkit named "network tests2". This toolkit highlights hidden cluster's topology, illuminates the so-called "jump points" in latency during message transfer, allows user to search defective cluster nodes and so on. The toolkit consists of several programs. The first one is an MPI-program that performs message transfer in several modes to provide certain communication activity or benchmarking of a chosen MPI-function and collects some statistics. The output of this program is a set of communicative matrices which are stored as a NetCDF file. The toolkit includes programs that perform data clustering and provide GUI for visualisation and comparison of results obtained from different clusters. This article touches some results obtained from Russian supercomputers such as Lomonosov T500 system. We also present data on Infiniband Mellanox and Blue Gene/P interconnect technologies.