MagPIe: MPI's collective communication operations for clustered wide area systems
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Broadcast scheduling optimization for heterogeneous cluster systems
Proceedings of the twelfth annual ACM symposium on Parallel algorithms and architectures
Broadcast scheduling optimization for heterogeneous cluster systems
Journal of Algorithms
Exploiting Hierarchy in Heterogeneous Environments
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
An Adaptive Communication System for Heterogeneous Network Computing
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Efficient Multicast Algorithms for Heterogeneous Switch-based Irregular Networks of Workstations
IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Implementing MPI-2 Extended Collective Operations
Proceedings of the 6th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface
Optimizing data aggregation for cluster-based internet services
Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
HCW '99 Proceedings of the Eighth Heterogeneous Computing Workshop
Efficient collective communication in distributed heterogeneous systems
Journal of Parallel and Distributed Computing
Efficient Multiple Multicast on Heterogeneous Network of Workstations
The Journal of Supercomputing
On broadcasting in heterogenous networks
SODA '04 Proceedings of the fifteenth annual ACM-SIAM symposium on Discrete algorithms
The Journal of Supercomputing
Pipelining Broadcasts on Heterogeneous Platforms
IEEE Transactions on Parallel and Distributed Systems
Broadcast Trees for Heterogeneous Platforms
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
A Method for MPI Broadcast in Computational Grids
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 13 - Volume 14
Efficient trigger-broadcasting in heterogeneous clusters
Journal of Parallel and Distributed Computing
Efficient broadcast in heterogeneous networks of workstations using two sub-networks
International Journal of Parallel Programming
Optimizing the steady-state throughput of scatter and reduce operations on heterogeneous platforms
Journal of Parallel and Distributed Computing
A Parallel Computational Model for Heterogeneous Clusters
IEEE Transactions on Parallel and Distributed Systems
The impact of heterogeneity on master-slave scheduling
Parallel Computing
A framework for adaptive collective communications for heterogeneous hierarchical computing systems
Journal of Computer and System Sciences
Scheduling for atomic broadcast operation in heterogeneous networks with one port model
The Journal of Supercomputing
Fast file dissemination in peer-to-peer networks with upstream bandwidth constraint
Future Generation Computer Systems
Scheduling heuristics for efficient broadcast operations on grid environments
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
The impact of heterogeneity on master-slave on-line scheduling
IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing
GCC'05 Proceedings of the 4th international conference on Grid and Cooperative Computing
Dynamically adaptive binomial trees for broadcasting in heterogeneous networks of workstations
VECPAR'04 Proceedings of the 6th international conference on High Performance Computing for Computational Science
Topology-Based hypercube structures for global communication in heterogeneous networks
Euro-Par'05 Proceedings of the 11th international Euro-Par conference on Parallel Processing
Improving multilevel approach for optimizing collective communications in computational grids
EGC'05 Proceedings of the 2005 European conference on Advances in Grid Computing
Hi-index | 0.00 |
Networks of Workstations (NOW) have become an attractive alternative platform for high performance computing. Due to the commodity nature of workstations and interconnects and due to the multiplicity of vendors and platforms, the NOW environments are being gradually redefined as Heterogeneous Networks of Workstations (HNOW) environments. This paper presents a new framework for implementing collective communication operations (as defined by the Message Passing Interface (MPI) standard) efficiently for the emerging HNOW environments. We first classify different types of heterogeneity in HNOW and then focus on one important characteristic: communication capabilities of workstations. Taking this characteristic into account, we propose two new approaches (Speed-Partitioned Ordered Chain (SPOC) and Fastest-Node First (FNF)) to implement collective communication operations with reduced latency. We also investigate methods for deriving optimal trees for broadcast and multicast operations. Generating such trees is shown to be computationally intensive. It is shown that the FNF approach, in spite of its simplicity, can deliver performance within 1% of the performance of the optimal trees. Finally, these new approaches are compared with the approach used in the MPICH implementation on experimental as well as on simulated testbeds. On a 24-node existing HNOW environment with SGI workstations and ATM interconnection, our approaches reduce the latency of broadcast and multicast operations by a factor of up to 3.5 compared to the approach used in the existing MPICH implementation. On a 64-node simulated testbed, our approaches can reduce the latency of broadcast and multicast operations by a factor of up to 4.5. Thus, these results demonstrate that there is significant potential for our approaches to be applied towards designing scalable collective communication libraries for current and future generation HNOW environments.