A Mapping Strategy for Parallel Processing
IEEE Transactions on Computers
On mapping parallel algorithms into parallel architectures
Journal of Parallel and Distributed Computing
Task allocation onto a hypercube by recursive mincut bipartitioning
Journal of Parallel and Distributed Computing
Heuristic Technique for Processor and Link Assignment in Multicomputers
IEEE Transactions on Computers
Concurrency: Practice and Experience
Genetic algorithm based heuristics for the mapping problem
Computers and Operations Research - Special issue on genetic algorithms
hypre: A Library of High Performance Preconditioners
ICCS '02 Proceedings of the International Conference on Computational Science-Part III
HPCN Europe 1996 Proceedings of the International Conference and Exhibition on High-Performance Computing and Networking
Branch and Bound Based Load Balancing for Parallel Applications
ISCOPE '99 Proceedings of the Third International Symposium on Computing in Object-Oriented Parallel Environments
ACM SIGARCH Computer Architecture News
IEEE Transactions on Computers
SPEC MPI2007—an application benchmark suite for parallel systems using MPI
Concurrency and Computation: Practice & Experience - International Supercomputing Conference (ISC07)
hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications
PDP '10 Proceedings of the 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing
Near-optimal placement of MPI processes on hierarchical NUMA architectures
Euro-Par'10 Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II
A scalable MPI_Comm_split algorithm for exascale computing
EuroMPI'10 Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface
Optimizing communication for Charm++ applications by reducing network contention
Concurrency and Computation: Practice & Experience - Euro-Par 2009
Generic topology mapping strategies for large-scale parallel architectures
Proceedings of the international conference on Supercomputing
Automating topology aware mapping for supercomputers
Automating topology aware mapping for supercomputers
Improving MPI applications performance on multicore clusters with rank reordering
EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Multi-core and network aware MPI topology functions
EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Noncollective communicator creation in MPI
EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
The IBM Blue Gene/Q interconnection network and message unit
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Design and Evaluation of Network Topology-/Speed- Aware Broadcast Algorithms for InfiniBand Clusters
CLUSTER '11 Proceedings of the 2011 IEEE International Conference on Cluster Computing
Layout-aware scientific computing: a case study using MILC
Proceedings of the second workshop on Scalable algorithms for large-scale systems
Topology aware process mapping
PARA'12 Proceedings of the 11th international conference on Applied Parallel and Scientific Computing
NSDI'14 Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation
Hi-index | 0.00 |
Over the last decade, InfiniBand has become an increasingly popular interconnect for deploying modern super-computing systems. However, there exists no detection service that can discover the underlying network topology in a scalable manner and expose this information to runtime libraries and users of the high performance computing systems in a convenient way. In this paper, we design a novel and scalable method to detect the InfiniBand network topology by using Neighbor-Joining techniques (NJ). To the best of our knowledge, this is the first instance where the neighbor joining algorithm has been applied to solve the problem of detecting InfiniBand network topology. We also design a network-topology-aware MPI library that takes advantage of the network topology service. The library places processes taking part in the MPI job in a network-topology-aware manner with the dual aim of increasing intra-node communication and reducing the long distance inter-node communication across the InfiniBand fabric.