A balanced code placement framework
ACM Transactions on Programming Languages and Systems (TOPLAS)
Optimizing compilers for modern architectures: a dependence-based approach
Optimizing compilers for modern architectures: a dependence-based approach
A New Approach to Array Redistribution: Strip Mining Redistribution
PARLE '94 Proceedings of the 6th International PARLE Conference on Parallel Architectures and Languages Europe
Proceedings of the 14th International Conference on Parallel Architectures and Compilation Techniques
Data-Flow Analysis for MPI Programs
ICPP '06 Proceedings of the 2006 International Conference on Parallel Processing
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
Barrier matching for programs with textually unaligned barriers
Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming
Implementation and performance analysis of non-blocking collective operations for MPI
Proceedings of the 2007 ACM/IEEE conference on Supercomputing
A Simple, Pipelined Algorithm for Large, Irregular All-gather Problems
Proceedings of the 15th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Engineering A Compiler
MPI-aware compiler optimizations for improving communication-computation overlap
Proceedings of the 23rd international conference on Supercomputing
Communication-Sensitive Static Dataflow for Parallel Message Passing Applications
Proceedings of the 7th annual IEEE/ACM International Symposium on Code Generation and Optimization
Transforming MPI source code based on communication patterns
Future Generation Computer Systems
Overlapping communication and computation by using a hybrid MPI/SMPSs approach
Proceedings of the 24th ACM International Conference on Supercomputing
Overlapping Computation and Communication for Advection on Hybrid Parallel Computers
IPDPS '11 Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium
Exploiting Data Similarity to Reduce Memory Footprints
IPDPS '11 Proceedings of the 2011 IEEE International Parallel & Distributed Processing Symposium
Tolerating message latency through the early release of blocked receives
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Hi-index | 0.00 |