Numerical recipes in C (2nd ed.): the art of scientific computing
Numerical recipes in C (2nd ed.): the art of scientific computing
Flattening on the Fly: Efficient Handling of MPI Derived Datatypes
Proceedings of the 6th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Implementing and Benchmarking Derived Datatypes in Metacomputing
HPCN Europe 2001 Proceedings of the 9th International Conference on High-Performance Computing and Networking
Applying MPI Derived Datatypes to the NAS Benchmarks: A Case Study
ICPPW '04 Proceedings of the 2004 International Conference on Parallel Processing Workshops
International Journal of High Performance Computing Applications
Proceedings of the 22nd annual international conference on Supercomputing
Performance engineering: a must for petascale and beyond
Proceedings of the third international workshop on Large-scale system and application performance
Bridging performance analysis tools and analytic performance modeling for HPC
Euro-Par 2010 Proceedings of the 2010 conference on Parallel processing
Using MPI derived datatypes in numerical libraries
EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Performance expectations and guidelines for MPI derived datatypes
EuroMPI'11 Proceedings of the 18th European MPI Users' Group conference on Recent advances in the message passing interface
Performance modeling for systematic performance tuning
State of the Practice Reports
Concurrency and Computation: Practice & Experience
MPI 3 and beyond: why MPI is successful and what challenges it faces
EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface
Micro-applications for communication data access patterns and MPI datatypes
EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface
A hybrid parallelization of air quality model with MPI and OpenMP
EuroMPI'12 Proceedings of the 19th European conference on Recent Advances in the Message Passing Interface
MPI datatype processing using runtime compilation
Proceedings of the 20th European MPI Users' Group Meeting
Enabling highly-scalable remote memory access programming with MPI-3 one sided
SC '13 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Hi-index | 0.00 |
Many parallel applications need to communicate noncontiguous data. Most applications manually copy (pack/unpack) data before communications even though MPI allows a zero-copy specification. In this work, we study two complex use-cases: (1) Fast Fourier Transformation where we express a local memory transpose as part of the datatype, and (2) a conjugate gradient solver with a checkerboard layout that requires multiple nested datatypes. We demonstrate significant speedups up to a factor of 3.8 and 18%, respectively, in both cases. Our work can be used as a template to utilize datatypes for application developers. For MPI implementers, we show two practically relevant access patterns that deserve special optimization.