BIP-SMP: high performance message passing over a cluster of commodity SMPs
SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Future Generation Computer Systems
MPI: A Message-Passing Interface Standard
MPI: A Message-Passing Interface Standard
Direct Cache Access for High Bandwidth Network I/O
Proceedings of the 32nd annual international symposium on Computer Architecture
LiMIC: Support for High-Performance MPI Intra-node Communication on Linux Cluster
ICPP '05 Proceedings of the 2005 International Conference on Parallel Processing
Data Transfers between Processes in an SMP System: Performance Study and Application to MPI
ICPP '06 Proceedings of the 2006 International Conference on Parallel Processing
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
ICPP '08 Proceedings of the 2008 37th International Conference on Parallel Processing
Performance Analysis and Evaluation of PCIe 2.0 and Quad-Data Rate InfiniBand
HOTI '08 Proceedings of the 2008 16th IEEE Symposium on High Performance Interconnects
CLUSTER '07 Proceedings of the 2007 IEEE International Conference on Cluster Computing
Efficient asynchronous memory copy operations on multi-core systems and I/OAT
CLUSTER '07 Proceedings of the 2007 IEEE International Conference on Cluster Computing
High Throughput Intra-Node MPI Communication with Open-MX
PDP '09 Proceedings of the 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing
Cache-Efficient, Intranode, Large-Message MPI Communication with MPICH2-Nemesis
ICPP '09 Proceedings of the 2009 International Conference on Parallel Processing
Fine-Grained Multithreading Support for Hybrid Threaded MPI Programming
International Journal of High Performance Computing Applications
hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications
PDP '10 Proceedings of the 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing
Process Distance-Aware Adaptive MPI Collective Communications
CLUSTER '11 Proceedings of the 2011 IEEE International Conference on Cluster Computing
Kernel Assisted Collective Intra-node MPI Communication among Multi-Core and Many-Core CPUs
ICPP '11 Proceedings of the 2011 International Conference on Parallel Processing
IPDPS '12 Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium
Hybrid MPI: efficient message passing for multi-core systems
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Hi-index | 0.00 |
The multiplication of cores in today's architectures raises the importance of intra-node communication in modern clusters and their impact on the overall parallel application performance. Although several proposals focused on this issue in the past, there is still a need for a portable and hardware-independent solution that addresses the requirements of both point-to-point and collective MPIoperations inside shared-memory computing nodes. This paper presents the KNEM module for the Linux kernel that provides MPI implementations with a flexible and scalable interface for performing kernel-assisted single-copy data transfers between local processes. It enables high-performance communication within most existing MPI implementations and brings significant application performance improvements thanks to more efficient point-to-point and collective operations.