Task allocation onto a hypercube by recursive mincut bipartitioning
Journal of Parallel and Distributed Computing
Embedding Rectangular Grids into Square Grids with Dilation Two
IEEE Transactions on Computers
Embedding Rectangular Grids Into Square Grids
IEEE Transactions on Computers
Embeddings among meshes and tori
Journal of Parallel and Distributed Computing
Dilation-5 embedding of 3-dimensional grids into hypercubes
Journal of Parallel and Distributed Computing
Fast and parallel mapping algorithms for irregular problems
The Journal of Supercomputing
Efficient Embeddings of Grids into Grids
WG '98 Proceedings of the 24th International Workshop on Graph-Theoretic Concepts in Computer Science
Rank Reordering Strategy for MPI Topology Creation Functions
Proceedings of the 5th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Implementing the MPI process topology mechanism
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
An overview of the BlueGene/L Supercomputer
Proceedings of the 2002 ACM/IEEE conference on Supercomputing
An Approach for Torus Embedding
ICPP '99 Proceedings of the 1999 International Workshops on Parallel Processing
Mapping Strategies for Switch-Based Cluster Systems of Irregular Topology
ICPADS '01 Proceedings of the Eighth International Conference on Parallel and Distributed Systems
MPI: A Message-Passing Interface Standard
MPI: A Message-Passing Interface Standard
Optimizing task layout on the Blue Gene/L supercomputer
IBM Journal of Research and Development
Performance effects of node mappings on the IBM bluegene/l machine
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
A study of the effects of machine geometry and mapping on distributed transpose performance
Proceedings of the 5th conference on Computing frontiers
Process Mapping for MPI Collective Communications
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
A Case Study of Communication Optimizations on 3D Mesh Interconnects
Euro-Par '09 Proceedings of the 15th International Euro-Par Conference on Parallel Processing
Mapping semigroup array operations onto multicomputer with torus topology
Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication
Generic topology mapping strategies for large-scale parallel architectures
Proceedings of the international conference on Supercomputing
Mapping communication layouts to network hardware characteristics on massive-scale blue gene systems
Computer Science - Research and Development
Optical Memory and Neural Networks
PaCT'11 Proceedings of the 11th international conference on Parallel computing technologies
Scalable node allocation for improved performance in regular and anisotropic 3D torus supercomputers
EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Avoiding hot-spots on two-level direct networks
Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis
Productive Parallel Linear Algebra Programming with Unstructured Topology Adaption
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Mapping applications with collectives over sub-communicators on torus networks
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Designing energy efficient communication runtime systems: a view from PGAS models
The Journal of Supercomputing
Improving performance of openSHMEM reference library by portable PE mapping technique
Proceedings of the 27th international ACM conference on International conference on supercomputing
Optimized process placement for collective I/O operations
Proceedings of the 20th European MPI Users' Group Meeting
Advancing application process affinity experimentation: open MPI's LAMA-based affinity interface
Proceedings of the 20th European MPI Users' Group Meeting
Task mapping in rectangular twisted tori
Proceedings of the High Performance Computing Symposium
Performance analysis of asynchronous Jacobi's method implemented in MPI, SHMEM and OpenMP
International Journal of High Performance Computing Applications
Hi-index | 0.00 |
Mapping virtual processes onto physical processos is one of the most important issues in parallel computing. The problem of mapping of processes/tasks onto processors is equivalent to the graph embedding problem which has been studied extensively. Although many techniques have been proposed for embeddings of two-dimensional grids, hypercubes, etc., there are few efforts on embeddings of three-dimensional grids and tori. Motivated for better support of task mapping for Blue Gene/L supercomputer, in this paper, we present embedding and integration techniques for the embeddings of three-dimensional grids and tori. The topology mapping library that based on such techniques generates high-quality embeddings of two/three-dimensional grids/tori. In addition, the library is used in BG/L MPI library for scalable support of MPI topology functions. With extensive empirical studies on large scale systems against popular benchmarks and real applications, we demonstrate that the library can significantly improve the communication performance and the scalability of applications.