LogGP: incorporating long messages into the LogP model for parallel computation
Journal of Parallel and Distributed Computing
MagPIe: MPI's collective communication operations for clustered wide area systems
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Asymptotically optimal algorithms for job shop scheduling and packet routing
Journal of Algorithms
Adaptive communication algorithms for distributed heterogeneous systems
Journal of Parallel and Distributed Computing - Special issue on software support for distributed computing
Broadcast scheduling optimization for heterogeneous cluster systems
Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architectures
Network performance-aware collective communication for clustered wide-area systems
Parallel Computing - Clusters and computational grids for scientific computing
MPI: The Complete Reference
On multicast algorithms for heterogeneous networks of workstations
Journal of Parallel and Distributed Computing - Special issue on cluster and network-based computing
Broadcast scheduling optimization for heterogeneous cluster systems
Journal of Algorithms
Efficient Collective Communication on Heterogeneous Networks of Workstations
ICPP '98 Proceedings of the 1998 International Conference on Parallel Processing
ECO: Efficient Collective Operations for Communication on Heterogeneous Networks
IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
A World Wide Number Field Sieve Factoring Record: On to 512 Bits
ASIACRYPT '96 Proceedings of the International Conference on the Theory and Applications of Cryptology and Information Security: Advances in Cryptology
Simgrid: A Toolkit for the Simulation of Application Scheduling
CCGRID '01 Proceedings of the 1st International Symposium on Cluster Computing and the Grid
Scheduling Distributed Applications: the SimGrid Simulation Framework
CCGRID '03 Proceedings of the 3st International Symposium on Cluster Computing and the Grid
HCW '99 Proceedings of the Eighth Heterogeneous Computing Workshop
Efficient Gather Operation in Heterogeneous Cluster Systems
HPCS '02 Proceedings of the 16th Annual International Symposium on High Performance Computing Systems and Applications
Reduction Optimization in Heterogeneous Cluster Environments
IPDPS '00 Proceedings of the 14th International Symposium on Parallel and Distributed Processing
Scheduling Algorithms for Efficient Gather Operations in Distributed Heterogeneous Systems
ICPP '00 Proceedings of the 2000 International Workshop on Parallel Processing
MPICH-G2: a Grid-enabled implementation of the Message Passing Interface
Journal of Parallel and Distributed Computing - Special issue on computational grids
IEEE Communications Magazine
Hi-index | 0.00 |
In this paper, we consider the communications involved by the execution of a complex application, deployed on a heterogeneous large-scale distributed platform. Such applications intensively use collective macro-communication schemes, such as scatters, personalized all-to-alls or gather/reduce operations. Rather than aiming at minimizing the execution time of a single macro-communication, we focus on the steady-state operation. We assume that there is a large number of macro-communications to perform in pipeline fashion, and we aim at maximizing the throughput, i.e., the (rational) number of macro-communications which can be initiated every time-step. We target heterogeneous platforms, modeled by a graph where resources have different communication and computation speeds. The situation is simpler for series of scatters or personalized all-to-alls than for series of reduces operations, because of the possibility of combining various partial reductions of the local values, and of interleaving computations with communications. In all cases, we show how to determine the optimal throughput, and how to exhibit a concrete periodic schedule that achieves this throughput.